Beyond Token Prediction: the post-Pretraining journey of modern LLMs
(This blog post, as most of my recent ones, is written with AI assistance and augmentation)
(This blog post, as most of my recent ones, is written with AI assistance and augmentation)
The breakneck speed of innovation in the artificial intelligence sector has naturally steered my writing towards AI-centric themes on this blog. Regular readers, however, will recall that my passi...
Exploring the AI-Driven Future of Software Development
In my view, the concept of Artificial General Intelligence (AGI) as it’s commonly understood might be a misnomer. Human intelligence itself is not ‘general’; it is inherently constrained by our sen...
The Big Sur Marathon is considered the most beautiful marathons in the US and top ten in the world. However, it did not make it into my top 10 most epic runs
In the landscape of Generative AI (GenAI), we often find ourselves amazed at the rapidity and scale of advancements. GPT-4 stands as a shining example, pushing the boundaries of linguistic understa...
The layers of GenAI development
(I recently turned this guide into a paper. You can find it here)
In this post I summarize the main advances in the area of LLM models, and particularly open source LLMs (including Falcon, LlaMa2, and Free Willy). I describe different leaderboards and what their ...
DALL-E 2: An old professor with a notebook in his hand talking to a futuristic looking robot. 4k. Professional photo. Photorealistic