The success of large language models (LLMs) like OpenAI’s GPTs has amazed the world with their extraordinary performance in tasks such as standardized tests, advanced mathematical reasoning, encyclopedic knowledge retrieval, and human-like language generation. However, these models fall short when it comes to embodied reasoning, physical and social understanding, and real-world strategic planning.
In this seminar, Prof. Xing will introduce “PAN”, a novel architecture designed for general and purposeful reasoning in complex, real-world environments. PAN combines a new World Model for steerable simulations of potential outcomes and a dynamic Agent Model capable of planning, learning, and adapting to achieve goals across diverse contexts.
He will contrast this approach with current architectures, addressing common misconceptions around agent and world models, and discuss choices of data, representations, and training strategies. The talk will also feature theoretical insights and a preview of an upcoming model release. Prof. Xing will conclude with a broader reflection on concepts such as agency and artificial general intelligence (AGI), informed by a renewed philosophical lens.