A Very Small Language Model

By Jez Higgins

There are lots of things in software that seem kind of amazing, even to people who work in software. Compilers & interpreters, operating systems, windowing systems are obvious choices, as are lots of games stuff. These are, however, all pretty large topics. You can’t write an operating system in even a 90 minute session. I can’t anyway.

My phone keyboard keeps asking me again and again if I want to turn on some kind of AI assistant to "boost productivity" and "unlock creativity". I continue to say no, but it got me thinking. This rebranding of "next word prediction" as "AI assistance" groups this tech in the same bucket as the ChatGPTs and the Claudes and what have you and stokes up the idea that text prediction is a difficult problem, something best left to the big brains at Google or Apple or Microsoft with their shiny offices and free lunches in Silicon Valley and Redmond.

I reject that idea. I think this technology is something any programmer can build. We can write a text prediction engine from a standing start in an hour. In this talk, I’ll show attendees how to do just that, as well examining ways we can enhance our very small language model, and maybe even "boost productivity" and "unlock creativity".





Your Privacy

By clicking "Accept Non-Essential Cookies" you agree ACCU can store non-essential cookies on your device and disclose information in accordance with our Privacy Policy and Cookie Policy.

Current Setting: Non-Essential Cookies REJECTED


By clicking "Include Third Party Content" you agree ACCU can forward your IP address to third-party sites (such as YouTube) to enhance the information presented on this site, and that third-party sites may store cookies on your device.

Current Setting: Third Party Content EXCLUDED



Settings can be changed at any time from the Cookie Policy page.