COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
Sphere Neural-Networks for Rational ReasoningAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Challenger Mishra. This is an up-to-date introduction to Sphere Neural-Network. The last one can be viewed at http://www.youtube.com/watch?v=mZMwYtwymm0&t=1s The success of Large Language Models (LLMs), e.g., ChatGPT, is witnessed by their planetary popularity, their capability of human-like communication, and also by their steadily improved reasoning performance. However, it remains unclear whether LLMs reason. It is an open problem how traditional neural networks can be qualitatively extended to go beyond the statistic paradigm and achieve high-level cognition. Here, we present a novel qualitative extension by generalising computational building blocks from vectors to spheres. We propose Sphere Neural Networks (SphNNs) for human-like reasoning through model construction and inspection, and develop SphNN for syllogistic reasoning, a microcosm of human rationality. SphNN is a hierarchical neuro-symbolic Kolmogorov-Arnold geometric GNN , and uses a neuro-symbolic transition map of neighbourhood spatial relations to transform the current sphere configuration towards the target. SphNN is the first neural model that can determine the validity of long-chained syllogistic reasoning in one epoch without training data, with the worst computational complexity of O(N). SphNN can evolve into various types of reasoning, such as spatio-temporal reasoning, logical reasoning with negation and disjunction, event reasoning, neuro-symbolic unification, and humour understanding (the highest level of cognition). All these suggest a new kind of Herbert A. Simon’s scissors with two neural blades. SphNNs will tremendously enhance interdisciplinary collaborations to develop the two neural blades and realise deterministic neural reasoning and human-bounded rationality and elevate LLMs to reliable psychological AI. This work suggests that the non-zero radii of spheres are the missing components that prevent traditional deep-learning systems from reaching the realm of rational reasoning and cause LLMs to be trapped in the swamp of hallucination. This talk is part of the ml@cl-math series. This talk is included in these lists:Note that ex-directory lists are not shown. |
Other listsTwo cultures: can policy makers and academic institutions ever work together effectively? Cambridge Technology & New Media Research Cluster CCIMI SeminarsOther talksARC39.1 Launch Event & Keynote: Entangled or Entrapped; Human Induced Changes to the Beetle Fauna of the UK How to Think about God the Creator Today Fragmentations with erasure TBC Seminars in Cancer TBA |