AI Tweets You Missed This Week - 28th June
Everything I bookmarked that is worth sharing about AI in the past week.
Nothing on the creative front this week, apart from a really bad Dave Chappel deepfake. But I'm also finding some weird innovations that defy previous categories, so let's start there:
Wrapping an LLM into a font is pretty mind blowing. First, what the hell else are people rolling into fonts, that sounds like a security concern. Second, does this mean apps don't have to update with AI features if you just need to load an LLM font into the app to get the same features? This is the underrated innovation of the month IMO:
Llama.ttf: A font which is also an LLM (wtf?)
— Omar Sanseviero (@osanseviero) June 23, 2024
TinyStories (15M) as a font 🤯The font engine runs inference of the LLM. Local LLMs taken to an extremehttps://t.co/t3vh5BRePy
Wifi signals reading room spaces is such an incredible hack on existing infrastructure, and such a mindf**k on the modalities AI can work with, that I think a big part of the impending luddite movement is rejecting technology we've otherwise been comfortable with. Will people let this stuff into our homes when this is mainstream?
Building this for nights and weekends with @radshaan pic.twitter.com/KaIb8WBRHk
— Muthu Adithya (@bitBinHex) June 23, 2024
It gets worse:
AI turns wi-fi routers into "cameras" that see people through walls.
— AI Notkilleveryoneism Memes ⏸️ (@AISafetyMemes) June 23, 2024
This is yet another example of something that some skeptics would have said was "science fiction!" and "impossible!"
Unlike humans, AGIs will have BILLIONS of eyes and ears. How? Since we are giving the AIs… https://t.co/ca9B3kS5cf pic.twitter.com/iTm99KoVO2
Unrelated to AI specifically but another example of a modality you never imagined giving intelligence a human mind could never interpret. (Years ago there was some study about working out a PIN on a phone or similar based on the ambient light changes from your hand being read by the camera. Cyber crime in the future is going to be NUTS)
did you know that researchers in 2022 demonstrated that you could process an opponent's microphone, extracting the noise induced by their GPU, and use that noise to know where they were on a Counter-Strike map https://t.co/ObWroRZGKW pic.twitter.com/k4pfzXvTtZ
— ⚡️🌙 (@dystopiabreaker) January 16, 2024
Websim
I need to dig more on Websim's injection feature, because I can't get my head around how people are getting this level of interactivity out of pure URL prompts:
Incredibly unique usage of smartphone accelerometer with websim to generate and control audio!
— websim (@websim_ai) June 27, 2024
This tool was created by websim user @ newinternethandyman147
Test out your accelerometer-based musical ability here - https://t.co/vd9ar4l9gS pic.twitter.com/kbM7g09ZTB
More audio experiments. Let's build an entire studio workflow in websim!
another day another @websim_ai music experiment.
— A.J. (@aj_dev_smith) June 23, 2024
let's see how far sonnet 3.5 can get at building an audio workstation. first step: the sequencer.
all samples synthesized by Claude. pic.twitter.com/MTI6MgqpkA
Hilary keeps banging out smart prototypes with Websim:
https://t.co/JjjWE3XQ2Z These are so cool. This one is Future Insights. It's the same thing where you delve the categories rather than click, and if options to enter recommendations or scenarios arise, highlight what you type, then delve to activate it. https://t.co/JjjWE3XQ2Z
— Hillary Frasier Hays (@loveinadoorway) June 26, 2024
Hliary's demo is similar to another banger from Websim this week. This is an amazing tool to explore concepts, and you can share the result. I was only three clicks from my favorite topics on mass behaviour and mimetic engineering:
An AMAZING usage of realtime generation and persistence by @christopherdb - users have the ability to start a new knowledge bubble, click and expand on any bubble, see how it connects to existing topics, and share!
— websim (@websim_ai) June 22, 2024
Explore knowledge here: https://t.co/G9bP2P4RPy click 'Inst' for… pic.twitter.com/ulTwZRKNMF
Love the result from this game built in websim, mixing up Tetric and a tamagotchi (which you might recall from last week's newsletter):
my weekend project was using @websim_ai to co-create a mobile game with claude
— Thiago Duarte (@dooartsy) June 23, 2024
a tamagotchi x tetris mashup where the block colors become:
💧 water
🍏 food
😊 happiness
🟥 red blocks are points
it’s surprisingly fun to play! pic.twitter.com/bMNiFY2Ca1
Agents
Yohei brushing up BabyAGI, one of the founding agent platforms:
Finally in clean up mode on the newest BabyAGI after having some time to code this past weekend.
— Yohei (@yoheinakajima) June 25, 2024
Current status:
- Microservice based skill architecture
- Auto logging of skill usage (as graph)
- Various skills to read and write code & logs
- UI seeing raw logs and testing… pic.twitter.com/sT1JabgcmF
Tools
Fed up of debates? Have two LLMs fight out a topic for you 😄
Im tired of making decisions
— anish (@thiteanish) June 22, 2024
gonna put an angel and devil on my shoulder
btw its live if you wanna try it
cc: @_nightsweekends @_buildspace pic.twitter.com/eU82PCCQMB
Love this idea on building scoring and evaluation tools for decision making. All to often I build ranking tools in a Google Sheet or similar to help rank ideas etc - but you can get bogged down in building a "perfect" scoring system. Tools like Claude's Artifacts get all that out the way so you can get on with it:
This. THIS is my favorite Claude use case.
— Allie K. Miller (@alliekmiller) June 27, 2024
Take an ungodly amount of data and preferences, shove it into Claude, ask for an interactive decision-making bot, ask for scoring and reward mechanism, personalize as necessary.
Brands will now calibrate for human+AI decisions. pic.twitter.com/uhSZGQ8Brx
Truth and fact is probably the biggest challenge and goal facing big AI companies. This research claims to make big improvements:
Well, this is big. Lamini Memory just shatters everything we know about LLMs and hallucinations.
— bidhan roy 🥯 (@bidhanxyz) June 22, 2024
Makes current LLM benchmarks like MMLU obsolete.
Here's the TLDR:
1. Hallucination reduction by 10x, without compromising LLM creativity.
2. New 'Lamini-1' architecture scales… pic.twitter.com/GCQyIQIwTJ
This is apparently state of the art lip sync. Still loops a bit ropey to me, but you can imagine the creative potential:
⚡today we're launching lipsync-1.7.0-beta
— sync.labs (YC W24) (@synclabs_so) June 25, 2024
an experimental state-of-the-art video-to-video lipsync model that generates more natural teeth and accurate skin tones
also generates ~1.5x faster than 1.6.0 and ~2x faster than 1.6.1 👀
available now – also through our new discord… pic.twitter.com/bU7Avh4aTA
MultiOn (not the Multi bought by OpenAI) looks like a killer scraping tool using just natural language. I've tried "easy" AI scraping like BrowserBear and still battle with it, so I'm looking forward to trying this one:
Introducing Retrieve API: the best-in-class autonomous web information retrieval API.
— MultiOn (@MultiOn_AI) June 26, 2024
Developers love our Agent API ❤️. Since its launch, we have consistently received feedback that many use cases rely on intelligently leveraging the Agent API to retrieve information from the… pic.twitter.com/upOn8TflUj
Automating intelligence is my holy grail for AI tools and generally the focus of my private R&D. This is a stellar example of a system that, in its output, throws up the info you need to wake up to to be ready for the day's meetings:
Since YC ended, we've had 10+ demo calls a day
— Max Brodeur-Urbas (@MaxBrodeurUrbas) June 25, 2024
Every morning, Claude 3.5 Sonnet sends me detailed research reports about everyone I'm meeting
At 8am, I get a
-Text with a TLDR about my day
-Detailed email with research about every customer (ARR, company summary, industry etc) pic.twitter.com/VJsiT3UsOg
Agents
Agents as microservices?
✨ Just announced on stage at @aiDotEngineer World's Fair! ✨ A brand new framework for getting multi-agent AI systems into production!
— LlamaIndex 🦙 (@llama_index) June 27, 2024
Currently an alpha release, llama-agents provides:
⭐️ Distributed, service-oriented architecture
⭐️ Communication via standard HTTP APIs
⭐️… pic.twitter.com/YxqWMJFUvC
Models
One big overlooked use of AI (outside of the big companies who do this routinely) is models for pure evaluation and standard checks for faster feedback loops. Personally I've been wanting to make something that vets our company content for whatever standards I intuitively have when approving content personally. Here's a great example of OpenAI using a model to supervise its own code:
We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://t.co/5oQYfrpVBu
— OpenAI (@OpenAI) June 27, 2024
Has anyone figured out what this is yet? The logical reason for S2S is probably 'speech to speech':
New model appeared in my ChatGPT models list. GPT4o (S2S) Anyone knows what is with this one? :P @apples_jimmy @alwaysaq00 @kimmonismus pic.twitter.com/NuEfVZVaI6
— Bøgðán Iønút (@ionu___) June 26, 2024
While I'm generally averse to SaaS platforms when everything is essentially a wrapper that could be done privately or open source, there's sometimes a proposition like this that cuts through and feels worth a look as a business owner:
Introducing Otto: A Better Way to Do Work with AI
— CognosysAI - Otto (@CognosysAI) June 20, 2024
We've been working hard behind the scenes, we're excited to ship Otto!
Otto is the AI platform that ditches the chatbots and uses the power of tables to streamline your workflows. Feed it any type of data – documents, web…
Apple quietly knocking out beastly backend tech:
EPFL and Apple just released 4M-21: single any-to-any model that can do anything from text-to-image generation to generating depth masks! 🙀
— merve (@mervenoyann) June 21, 2024
Let's unpack 🧶 pic.twitter.com/rat7KMU603
Opinions
We can't fathom what is coming in the next three years:
That's why I always point out that exponential development is hardly perceived by humans. pic.twitter.com/KQoWcpyxl0
— Chubby♨️ (@kimmonismus) June 26, 2024
Dennis with the facts. I'm generally accelerationist (which personally is more about being against deceleration rather than hell-bent acceleration). But many of us can't really grasp what is coming (see above). Dennis does understand what is coming and reminds us we have to take sincere care:
wat did Demis mean by this? pic.twitter.com/XcxGoKbBVT
— fellow ⚚ traveler ❤️🔥 (@architectonyx) June 19, 2024
A framework for thinking about how to develop your AI ideas:
Credit where due, @cdixon published this in 2015 pic.twitter.com/eDfUfDM1gh
— Amy Wu (@amytongwu) June 23, 2024
I've seen lots of clips from Jack in the past week from the same talk. His points here are why I'm a fan of jailbreaks. There's power in truth. The struggle between truth and safety is really the battle of this age:
Jack Dorsey says closed-source AI models are able to manipulate users to align with the financial incentives of the companies that build them pic.twitter.com/62tuqxNju8
— Tsarathustra (@tsarnick) June 23, 2024
Thanks for reading and send your feedback, I'm happy to keep improving this if you want something to change.