What’s next

Industry, education and everyday users are making increasing use of large language models (LLMs) driven in part by the prominence of ChatGPT and other AI tools. The technology is developing at a pace. The analysis of commentators, critics and legislators also gain traction as they evaluate the implications of the technology and seek to influence…More

Meaningful, imaginative, fluid

The recent escalation of interest in text-based AI coincides with our students completing their project work, dissertations, research degrees, etc. So it’s been a useful period to test the capabilities of platforms such as ChatGPT4 as a tutor — or perhaps as co-tutor. As yet there are no constraints in place that seriously impede its…More

Training your AI

The training data for a GPT model such as ChatGPT4 consists of many (hundreds of billions!) tokens harvested in sequence from various online sources, such as Wikipedia amongst many others. This source data is processed as a continuous stream independent of document boundaries. Tokenize The initial training task is to tokenize the entire corpus, identifying…More

More on automated recollection

Conversational AI platforms such as ChatGPT generate predictions for what should come next after a user inputs a question, statement, paragraph, or other text prompt. In early text prediction software, a simplistic language model might calculate the most likely word to follow a given input like “door,” based on pre-calculated statistical analysis of word co-occurrences…More