By Tom Davenport — Jun 28, 2024

AI Tweets You Missed This Week - 28th June

Everything I bookmarked that is worth sharing about AI in the past week.

Nothing on the creative front this week, apart from a really bad Dave Chappel deepfake. But I'm also finding some weird innovations that defy previous categories, so let's start there:

Wrapping an LLM into a font is pretty mind blowing. First, what the hell else are people rolling into fonts, that sounds like a security concern. Second, does this mean apps don't have to update with AI features if you just need to load an LLM font into the app to get the same features? This is the underrated innovation of the month IMO:

Llama.ttf: A font which is also an LLM (wtf?)

TinyStories (15M) as a font 🤯The font engine runs inference of the LLM. Local LLMs taken to an extremehttps://t.co/t3vh5BRePy
— Omar Sanseviero (@osanseviero) June 23, 2024

Wifi signals reading room spaces is such an incredible hack on existing infrastructure, and such a mindf**k on the modalities AI can work with, that I think a big part of the impending luddite movement is rejecting technology we've otherwise been comfortable with. Will people let this stuff into our homes when this is mainstream?

Building this for nights and weekends with @radshaan pic.twitter.com/KaIb8WBRHk
— Muthu Adithya (@bitBinHex) June 23, 2024

It gets worse:

AI turns wi-fi routers into "cameras" that see people through walls.

This is yet another example of something that some skeptics would have said was "science fiction!" and "impossible!"

Unlike humans, AGIs will have BILLIONS of eyes and ears. How? Since we are giving the AIs… https://t.co/ca9B3kS5cf pic.twitter.com/iTm99KoVO2
— AI Notkilleveryoneism Memes ⏸️ (@AISafetyMemes) June 23, 2024

Unrelated to AI specifically but another example of a modality you never imagined giving intelligence a human mind could never interpret. (Years ago there was some study about working out a PIN on a phone or similar based on the ambient light changes from your hand being read by the camera. Cyber crime in the future is going to be NUTS)

did you know that researchers in 2022 demonstrated that you could process an opponent's microphone, extracting the noise induced by their GPU, and use that noise to know where they were on a Counter-Strike map https://t.co/ObWroRZGKW pic.twitter.com/k4pfzXvTtZ
— ⚡️🌙 (@dystopiabreaker) January 16, 2024

Websim

I need to dig more on Websim's injection feature, because I can't get my head around how people are getting this level of interactivity out of pure URL prompts:

Incredibly unique usage of smartphone accelerometer with websim to generate and control audio!

This tool was created by websim user @ newinternethandyman147

Test out your accelerometer-based musical ability here - https://t.co/vd9ar4l9gS pic.twitter.com/kbM7g09ZTB
— websim (@websim_ai) June 27, 2024

More audio experiments. Let's build an entire studio workflow in websim!

another day another @websim_ai music experiment.
let's see how far sonnet 3.5 can get at building an audio workstation. first step: the sequencer.
all samples synthesized by Claude. pic.twitter.com/MTI6MgqpkA
— A.J. (@aj_dev_smith) June 23, 2024

Hilary keeps banging out smart prototypes with Websim:

https://t.co/JjjWE3XQ2Z These are so cool. This one is Future Insights. It's the same thing where you delve the categories rather than click, and if options to enter recommendations or scenarios arise, highlight what you type, then delve to activate it. https://t.co/JjjWE3XQ2Z
— Hillary Frasier Hays (@loveinadoorway) June 26, 2024

Hliary's demo is similar to another banger from Websim this week. This is an amazing tool to explore concepts, and you can share the result. I was only three clicks from my favorite topics on mass behaviour and mimetic engineering:

An AMAZING usage of realtime generation and persistence by @christopherdb - users have the ability to start a new knowledge bubble, click and expand on any bubble, see how it connects to existing topics, and share!
Explore knowledge here: https://t.co/G9bP2P4RPy click 'Inst' for… pic.twitter.com/ulTwZRKNMF
— websim (@websim_ai) June 22, 2024

Love the result from this game built in websim, mixing up Tetric and a tamagotchi (which you might recall from last week's newsletter):

my weekend project was using @websim_ai to co-create a mobile game with claude

a tamagotchi x tetris mashup where the block colors become:
💧 water
🍏 food
😊 happiness

🟥 red blocks are points

it’s surprisingly fun to play! pic.twitter.com/bMNiFY2Ca1
— Thiago Duarte (@dooartsy) June 23, 2024

Agents

Yohei brushing up BabyAGI, one of the founding agent platforms:

Finally in clean up mode on the newest BabyAGI after having some time to code this past weekend.

Current status:
- Microservice based skill architecture
- Auto logging of skill usage (as graph)
- Various skills to read and write code & logs
- UI seeing raw logs and testing… pic.twitter.com/sT1JabgcmF
— Yohei (@yoheinakajima) June 25, 2024

Tools

Fed up of debates? Have two LLMs fight out a topic for you 😄

Im tired of making decisions

gonna put an angel and devil on my shoulder

btw its live if you wanna try it

cc: @_nightsweekends @_buildspace pic.twitter.com/eU82PCCQMB
— anish (@thiteanish) June 22, 2024

Love this idea on building scoring and evaluation tools for decision making. All to often I build ranking tools in a Google Sheet or similar to help rank ideas etc - but you can get bogged down in building a "perfect" scoring system. Tools like Claude's Artifacts get all that out the way so you can get on with it:

This. THIS is my favorite Claude use case.

Take an ungodly amount of data and preferences, shove it into Claude, ask for an interactive decision-making bot, ask for scoring and reward mechanism, personalize as necessary.

Brands will now calibrate for human+AI decisions. pic.twitter.com/uhSZGQ8Brx
— Allie K. Miller (@alliekmiller) June 27, 2024

Truth and fact is probably the biggest challenge and goal facing big AI companies. This research claims to make big improvements:

Well, this is big. Lamini Memory just shatters everything we know about LLMs and hallucinations.

Makes current LLM benchmarks like MMLU obsolete.

Here's the TLDR:

1. Hallucination reduction by 10x, without compromising LLM creativity.

2. New 'Lamini-1' architecture scales… pic.twitter.com/GCQyIQIwTJ
— bidhan roy 🥯 (@bidhanxyz) June 22, 2024

This is apparently state of the art lip sync. Still loops a bit ropey to me, but you can imagine the creative potential:

⚡today we're launching lipsync-1.7.0-beta

an experimental state-of-the-art video-to-video lipsync model that generates more natural teeth and accurate skin tones

also generates ~1.5x faster than 1.6.0 and ~2x faster than 1.6.1 👀

available now – also through our new discord… pic.twitter.com/bU7Avh4aTA
— sync.labs (YC W24) (@synclabs_so) June 25, 2024

MultiOn (not the Multi bought by OpenAI) looks like a killer scraping tool using just natural language. I've tried "easy" AI scraping like BrowserBear and still battle with it, so I'm looking forward to trying this one:

Introducing Retrieve API: the best-in-class autonomous web information retrieval API.

Developers love our Agent API ❤️. Since its launch, we have consistently received feedback that many use cases rely on intelligently leveraging the Agent API to retrieve information from the… pic.twitter.com/upOn8TflUj
— MultiOn (@MultiOn_AI) June 26, 2024

Automating intelligence is my holy grail for AI tools and generally the focus of my private R&D. This is a stellar example of a system that, in its output, throws up the info you need to wake up to to be ready for the day's meetings:

Since YC ended, we've had 10+ demo calls a day

Every morning, Claude 3.5 Sonnet sends me detailed research reports about everyone I'm meeting

At 8am, I get a
-Text with a TLDR about my day
-Detailed email with research about every customer (ARR, company summary, industry etc) pic.twitter.com/VJsiT3UsOg
— Max Brodeur-Urbas (@MaxBrodeurUrbas) June 25, 2024

Agents

Agents as microservices?

✨ Just announced on stage at @aiDotEngineer World's Fair! ✨ A brand new framework for getting multi-agent AI systems into production!

Currently an alpha release, llama-agents provides:
⭐️ Distributed, service-oriented architecture
⭐️ Communication via standard HTTP APIs
⭐️… pic.twitter.com/YxqWMJFUvC
— LlamaIndex 🦙 (@llama_index) June 27, 2024

Models

One big overlooked use of AI (outside of the big companies who do this routinely) is models for pure evaluation and standard checks for faster feedback loops. Personally I've been wanting to make something that vets our company content for whatever standards I intuitively have when approving content personally. Here's a great example of OpenAI using a model to supervise its own code:

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://t.co/5oQYfrpVBu
— OpenAI (@OpenAI) June 27, 2024

Has anyone figured out what this is yet? The logical reason for S2S is probably 'speech to speech':

New model appeared in my ChatGPT models list. GPT4o (S2S) Anyone knows what is with this one? :P @apples_jimmy @alwaysaq00 @kimmonismus pic.twitter.com/NuEfVZVaI6
— Bøgðán Iønút (@ionu___) June 26, 2024

While I'm generally averse to SaaS platforms when everything is essentially a wrapper that could be done privately or open source, there's sometimes a proposition like this that cuts through and feels worth a look as a business owner:

Introducing Otto: A Better Way to Do Work with AI

We've been working hard behind the scenes, we're excited to ship Otto!

Otto is the AI platform that ditches the chatbots and uses the power of tables to streamline your workflows. Feed it any type of data – documents, web…
— CognosysAI - Otto (@CognosysAI) June 20, 2024

Apple quietly knocking out beastly backend tech:

EPFL and Apple just released 4M-21: single any-to-any model that can do anything from text-to-image generation to generating depth masks! 🙀

Let's unpack 🧶 pic.twitter.com/rat7KMU603
— merve (@mervenoyann) June 21, 2024

Opinions

We can't fathom what is coming in the next three years:

That's why I always point out that exponential development is hardly perceived by humans. pic.twitter.com/KQoWcpyxl0
— Chubby♨️ (@kimmonismus) June 26, 2024

Dennis with the facts. I'm generally accelerationist (which personally is more about being against deceleration rather than hell-bent acceleration). But many of us can't really grasp what is coming (see above). Dennis does understand what is coming and reminds us we have to take sincere care:

wat did Demis mean by this? pic.twitter.com/XcxGoKbBVT
— fellow ⚚ traveler ❤️‍🔥 (@architectonyx) June 19, 2024

A framework for thinking about how to develop your AI ideas:

Credit where due, @cdixon published this in 2015 pic.twitter.com/eDfUfDM1gh
— Amy Wu (@amytongwu) June 23, 2024

I've seen lots of clips from Jack in the past week from the same talk. His points here are why I'm a fan of jailbreaks. There's power in truth. The struggle between truth and safety is really the battle of this age:

Jack Dorsey says closed-source AI models are able to manipulate users to align with the financial incentives of the companies that build them pic.twitter.com/62tuqxNju8
— Tsarathustra (@tsarnick) June 23, 2024

Thanks for reading and send your feedback, I'm happy to keep improving this if you want something to change.

Sign up for non-sh*t AI updates

Websim

Agents

Tools

Agents

Models

Opinions

Subscribe to Tom Davenport