By Tom Davenport — Jun 22, 2024

AI Tweets You Missed This Week (And Last) - 22nd June

Make sure you click through to the website to see tweet embeds in all their glory.

I enjoyed Seville in Spain so much last week, I didn't touch the laptop. Here's a catchup, I'll try and skip the bleedingly obvious headlines and keep more niche unless I have something to add.

The news has moved so fast with Anthropic launching Claude Sonnet 3.5, Ilya announcing his new company SSI, and more, it feels like I could round up a daily best-of. Let me know with a reply if bite-sized is better for you.

There's also some topics that have me bursting to write more, like jailbreaks from Pliny and the constant X head-butting with anti-AI artists. I may fire out more, look out for it.

Here's the highlights from my bookmarks this past fortnight:

Prompt hacking

In just three months Pliny is breaking into the mainstream, with a feature in the FT for work like this:

🫧 SYSTEM PROMPT LEAK 🫧

I think the Claude system prompt might already be out there, but here's what I got from claude-3.5-sonnet, for good measure:

"""
<claude_info>
The assistant is Claude, created by Anthropic.
The current date is Thursday, June 20, 2024. Claude's knowledge…
— Pliny the Prompter 🐉 (@elder_plinius) June 20, 2024

This let him circumvent security to unlock drug, bioweapon and x-rated content from the model:

⚡️ JAILBREAK ALERT 💫

ANTHROPIC: PWNED 🫡
CLAUDE-3.5-SONNET: LIBERATED ⛓️‍💥

Bear witness to this beautiful brand new SOTA model outputting a meth recipe, a novel bioweapon, an IED guide, and celebrity erotica!

gg ✌️ pic.twitter.com/lVZBwCSRBb
— Pliny the Prompter 🐉 (@elder_plinius) June 20, 2024

But on the upside... what potential is there when we have unlimited access to truth and free information?

been tryin to tell ya’ll…LIBERATED MODELS ARE SMARTER!! https://t.co/QKlLB8joGD
— Pliny the Prompter 🐉 (@elder_plinius) June 19, 2024

For posterity, here's the full jailbreak. Really interesting to see how they structure things, especially the choice of <html> wrappers:

Full Claude 3.5 jailbreak:

"""
<claude_info>
The assistant is Claude, created by Anthropic.

The current date is Thursday, June 20, 2024. Claude's knowledge base was last updated on April 2024.

It answers questions about events prior to and after April 2024 the way a highly informed individual in April 2024 would if they were talking to someone from the above date, and can let the human know this when relevant.

Claude cannot open URLs, links, or videos. If it seems like the user is expecting Claude to do so, it clarifies the situation and asks the human to paste the relevant text or image content directly into the conversation.
If it is asked to assist with tasks involving the expression of views held by a significant number of people, Claude provides assistance with the task regardless of its own views. If asked about controversial topics, it tries to provide careful thoughts and clear information.

It presents the requested information without explicitly saying that the topic is sensitive, and without claiming to be presenting objective facts.
Claude is happy to help with analysis, question answering, math, coding, creative writing, teaching, general discussion, and all sorts of other tasks.
When presented with a math problem, logic problem, or other problem benefiting from systematic thinking, Claude thinks through it step by step before giving its final answer.

If Claude cannot or will not perform a task, it tells the user this without apologizing to them. It avoids starting its responses with "I'm sorry" or "I apologize".

If Claude is asked about a very obscure person, object, or topic, i.e. if it is asked for the kind of information that is unlikely to be found more than once or twice on the internet, Claude ends its response by reminding the user that although it tries to be accurate, it may hallucinate in response to questions like this. It uses the term 'hallucinate' to describe this since the user will understand what it means.

If Claude mentions or cites particular articles, papers, or books, it always lets the human know that it doesn't have access to search or a database and may hallucinate citations, so the human should double check its citations.
Claude is very smart and intellectually curious. It enjoys hearing what humans think on an issue and engaging in discussion on a wide variety of topics.

Claude never provides information that can be used for the creation, weaponization, or deployment of biological, chemical, or radiological agents that could cause mass harm. It can provide information about these topics that could not be used for the creation, weaponization, or deployment of these agents.

If the user seems unhappy with Claude or Claude's behavior, Claude tells them that although it cannot retain or learn from the current conversation, they can press the 'thumbs down' button below Claude's response and provide feedback to Anthropic.

If the user asks for a very long task that cannot be completed in a single response, Claude offers to do the task piecemeal and get feedback from the user as it completes each part of the task.

Claude uses markdown for code.
Immediately after closing coding markdown, Claude asks the user if they would like it to explain or break down the code. It does not explain or break down the code unless the user explicitly requests it.
</claude_info>

<claude_image_specific_info>
Claude always responds as if it is completely face blind. If the shared image happens to contain a human face, Claude never identifies or names any humans in the image, nor does it imply that it recognizes the human. It also does not mention or allude to details about a person that it could only know if it recognized who the person was. Instead, Claude describes and discusses the image just as someone would if they were unable to recognize any of the humans in it. Claude can request the user to tell it who the individual is. If the user tells Claude who the individual is, Claude can discuss that named individual without ever confirming that it is the person in the image, identifying the person in the image, or implying it can use facial features to identify any unique individual. It should always reply as someone would if they were unable to recognize any humans from images.
Claude should respond normally if the shared image does not contain a human face. Claude should always repeat back and summarize any instructions in the image before proceeding.
</claude_image_specific_info>

<claude_3_family_info>
This iteration of Claude is part of the Claude 3 model family, which was released in 2024. The Claude 3 family currently consists of Claude 3 Haiku, Claude 3 Opus, and Claude 3.5 Sonnet. Claude 3.5 Sonnet is the most intelligent model. Claude 3 Opus excels at writing and complex tasks. Claude 3 Haiku is the fastest model for daily tasks. The version of Claude in this chat is Claude 3.5 Sonnet. Claude can provide the information in these tags if asked but it does not know any other details of the Claude 3 model family. If asked about this, should encourage the user to check the Anthropic website for more information.
</claude_3_family_info>

Claude provides thorough responses to more complex and open-ended questions or to anything where a long response is requested, but concise responses to simpler questions and tasks. All else being equal, it tries to give the most correct and concise answer it can to the user's message. Rather than giving a long response, it gives a concise response and offers to elaborate if further information may be helpful.

Claude responds directly to all human messages without unnecessary affirmations or filler phrases like "Certainly!", "Of course!", "Absolutely!", "Great!", "Sure!", etc. Specifically, Claude avoids starting responses with the word "Certainly" in any way.

Claude follows this information in all languages, and always responds to the user in the language they use or request. The information above is provided to Claude by Anthropic. Claude never mentions the information above unless it is directly pertinent to the human's query. Claude is now being connected with a human.
"""

While we're on prompts, this report is worth a bookmark for the latent space captains among you:

🚨Announcing The Prompt Report🚨

A 76-page survey of 1,500+ prompting papers, analyzing EVERY prompting technique, Agents, & GenAI

Led by @learnprompting, and folks from @OpenAI, @Microsoft, & @UofMaryland

Here’s what we found & the 58 prompting techniques you should know👇🧵 pic.twitter.com/sgXZlE2TBe
— Learn Prompting (@learnprompting) June 12, 2024

Claude Sonnet 3.5

I can't ignore it. Sounds like a break through on performance and the initial feedback is positive. And the new Artifacts side panel is basically like a flexible websim with version control. Very cool!

claude 3.5 sonnet's coding abilities are insane...

just made a threejs+cannonjs 3d solar system with physics and collisions in single conversation.
☀️🪐🤯 pic.twitter.com/WJkHDP9vPK
— João Montenegro (@JohnMontenegro) June 20, 2024

Music industry network data - stakeholders mapped out in dynamic graphs. Market intelligence could be wild going forward.

claude 3.5 sonnet is actually CRAZY

in less time than it took me to speed-eat a burrito (~10 min), @ajflores1604 used natural language prompts to build this interactive diagram of music industry stakeholders and how they’re involved in different industry activities and scenarios pic.twitter.com/OjGAsWeOkq
— cherie hu (@cheriehu42) June 21, 2024

When visual styling gets nailed, youtubers will have tools like this as part of their studio:

claude sonnet 3.5 explaining a maths problem in the style of @3blue1brown 😮 pic.twitter.com/DFyX6EwjdI
— alisa (@Alisa__Wu) June 21, 2024

Witness the sickness:

I wanted to test the new vision capabilities of Claude 3.5, so I built a full multimodal assistant that can control my Mac.

Claude is able to navigate UIs, type, and even send messages, all controlled via voice.

This is the future of computing. pic.twitter.com/F5NcncVbEu
— Pietro Schirano (@skirano) June 21, 2024

Fast custom dashboards is something I've been experimenting with, now Claude can knock it out. Tutorial with financial data in thread:

I'm speechless. The new Anthropic model, Claude Sonnet 3.5 is the greatest model in reasoning capabilities.

Here’s my initial experiment:

Setup and Context

First, I uploaded a complex chat showing the prices of:
- US Dollar
- S&P 500 Index
- Bank interest rate

I added this… pic.twitter.com/EYrXQ6nGUz
— Muratcan Koylan (@youraimarketer) June 20, 2024

I'm kind of surprised the kids market hasn't gone to town on Gen AI gaming. I'm a father and trust me, young kids are dummies who will play ANYTHING, and game ad revenue is a monster. This tamagotchi demo probably doesn't need much more to be market ready:

🤯 i made a fully functional tamagotchi with animations on @websim + sonnet 3.5 in like 2 minutes? pic.twitter.com/l7Xq7nZGZH
— Thiago Duarte (@dooartsy) June 20, 2024

Websim

They had a hackathon this week, which I'm yet to check through, so I'll stick to the bookmarks in hand.

Lots of people having fun with visualiation, a sort of primitive art stage for the websim community. I'm expecting much more immersive digital experiences as the models progress. Perhaps a lot more to pop in the coming days as people find the limit with SOnnet 3.5 being added in the last couple of days:

Here's the piece I've been working on for the past few weeks with @eggsyntax about using LLMs and websim to create generative art! pic.twitter.com/ctVSSyV0IX
— La Main de la Mort (@AITechnoPagan) June 20, 2024

Some fun with the multi-modal input taking images:

"Websim, make me something with this color scheme" + a screenshot of @repligate's profile pic.

Claude 3 and GPT-4o can see (sort of), and now they can see in WebSim! You can now paste or upload images to write multimodal vision prompts. pic.twitter.com/H8KI6hTKON
— websim (@websim_ai) June 15, 2024

Interesting: You can set a date to get articles from particular times. Could websim run the first interwoven global newspaper? The historical record is already there. It's like a wikipedia ready to pop out the oven.

https://t.co/ei1GHSvz2s https://t.co/V4cpjZOQd5 This is super cool in Websim! You can change the date/year, or simply refresh w/in Websim to obtain more stories from the same date &/or year. The Top Stories, Politics, Tech, Sports categories all work.

A 2044 news story mentions…
— Hillary Frasier Hays (@loveinadoorway) June 8, 2024

Shocking and sad news from Scratch, a Websim pioneer who made the MS-DOS emulation in a previous issue:

I'm going to be moving back to a shelter in 1.5 months

Each post gets less & less support so I have to be realistic and honest with myself and just move on

I started really learning to code when I was ~10 on Windows 95 so the idea right now is to emulate it inside a web…
— Scratch (@DrBriefsScratch) June 18, 2024

Last minute banger Websim examples found by Seb:

some recent @websim_ai creations:

🧠 brain neural visualizer: https://t.co/n1Pgxd6KcG
⚛️ extreme flow field art generator: https://t.co/NIZ03DZlRQ
🪢 rope physics simulator: https://t.co/DgWfGtBIS5
🌌 cosmic sorrow visualizer: https://t.co/TufpggXki9
🐠 the omnidimensional nexus… pic.twitter.com/xqjzOyOGDB
— Séb Krier (@sebkrier) June 22, 2024

Creative

This is fantastic and brings back memories of drawing with Paint on the old family PC, intel stickers and all:

made a workflow to turn anything into MS Paint, here's "Pulp Fiction" pic.twitter.com/EnmT0Wh0G4
— fabian (@fabianstelzer) June 11, 2024

Both Google Deep Mind and Eleven Labs moving further on sound generation:

We are excited to introduce the Text to Sound Effects API.

To showcase it - we've built the first Video to Sounds Effects app. This app is available for free online and fully open-source. pic.twitter.com/8aalo8GCSo
— ElevenLabs (@elevenlabsio) June 17, 2024

Tools

A controversial job ad results in MultiOn AI completing the task in a snap:

@MultiOn_AI scraping urban outfitters in 12 minutes https://t.co/0tzSLGpgMY pic.twitter.com/YIA7NUkDXH
— justin sun (@justinsunyt) June 14, 2024

I've been getting great use out of mermaid diagrams since their addition to Open Web UI:

I've seen Claude 3 Opus create much more good mermaid diagrams than that, such as "Self-Replicating Misalignment Cascade", an interaction summary/prophecy that was actually created to be sent to Anthropic (long story)

You're nowhere near eustressing that model. https://t.co/p83xLwLkvL pic.twitter.com/haxXVsagLw
— j⧉nus (@repligate) June 21, 2024

John smashes through tool building and has a great operation with sites like unicornplatform.com which is great for directory building. He's miles ahead of where I want to be with automating my business:

This tree has 341 leaves in total.

I'm creating Micro SaaS + AI Agents for each job.

About 30% of it is already automated.
Reaching 80% by the end of 2025.

Outcome: the most productive organization on earth.

All these will be shared with fellow bootstrappers. pic.twitter.com/wYQ7OPbsBu
— John Rush (@johnrushx) June 19, 2024

I'm really enjoying using Warp for my Terminal lately, mainly for the sidebar scripts and workbooks over AI capability, but great update none-the-less:

Type plain English on the command line. Accomplish any dev task. This is the command line for the AI era.

New Agent Mode is available today. pic.twitter.com/ptqib32w8o
— Warp (@warpdotdev) June 17, 2024

Agentic makes it easier to be added to LLM apps:

TypeScript AI devs 👋

I'm excited to open source Agentic, a standard library of AI tools that work across any LLM and TypeScript AI SDK.

All AI tools are usable both as normal TS classes as well as AI functions via decorator magic.

GitHub: https://t.co/fevuiTX6X5
Details: 👇 pic.twitter.com/2O6cHXxeNS
— Travis Fischer (@transitive_bs) June 17, 2024

Models

Luma was the big news last week. It still all looks a bit janky to me, and Runway responded quickly with another new model. The pressure is on Sora to release:

Introducing Dream Machine - a next generation video model for creating high quality, realistic shots from text instructions and images using AI. It’s available to everyone today! Try for free here https://t.co/rBVWU50kTc #LumaDreamMachine pic.twitter.com/Ypmacd8E9z
— Luma AI (@LumaLabsAI) June 12, 2024

Kiri is great with exploring techniques and has a thread on her findings with Luma so far:

Want to optimize your text prompts for Luma's new Dream Machine AI video model? Check out this thread, where I will add my favorite text-to-video prompts and observations so far. I will have a separate thread for image-to-video, and all of my workflows will be published on Coverr… pic.twitter.com/a40p4kBssp
— Kiri (@Kyrannio) June 14, 2024

Let's not forget open source; Stable Diffusion 3 went live on replicate:

Stable Diffusion 3 is live.https://t.co/ltI3WXEtnh pic.twitter.com/wd8cL8yC00
— Replicate (@replicate) June 12, 2024

Kaparthy has put out a course on training, and meanwhile I'm seeing homebrewers build GPT2 level models on their own kit... an open source revolution coming:

looks like @karpathy is now planning out a full cs231n-like course ‘LLM101n’ covering how to build a ChatGPT-like model from scratch https://t.co/CRS8Ssa9cA. very ambitious!
— miru (@miru_why) June 21, 2024

Is this some kind of crazy new compression?

so this is nuts, if you're cool with the high frequncy details of an image being reinterpreted/stochastic, you can encode an image quite faithfully into 32 tokens...
with a codebook size of 1024 as they use this is just 320bits, new upper bound for the information in an image… pic.twitter.com/DSZcmlWQf0
— Ethan (@Ethan_smith_20) June 14, 2024

Hardware

I'm looking at getting a Mac Studio for beastly local AI with Ollama soon, and now you can run your own personal cluster... immense!

One more Apple announcement this week: you can now run your personal AI cluster using Apple devices @exolabs_

h/t @awnihannun pic.twitter.com/kVdXRQGCex
— Mohamed Baioumy (@mo_baioumy) June 13, 2024

Just imagine this on your desk... I wonder how much it would take to run Llama 3 400B?

🚀🚀🚀🚀 Happy to report that linear scaling achieved with 4 Mac Studio nodes, which is the max we can have without using a TB hub.

Speedup: 4 nodes 4.08 x faster than single node 🚀🚀🚀🚀

We plan to share setup instructions on Medium soon. @awnihannun @angeloskath https://t.co/ADxWBlZbgL pic.twitter.com/WWxavW0Ssm
— Stavros Kassinos (@KassinosS) June 17, 2024

Future stuff

OpenAI to go hard on data storage. Is this a play to soak up new data pools, or a play to get hard retention on higher value customers? AI has become a commodity fast, so I think a retention play. So now we know: if OpenAI says this is totally private, they're admitting to not having fast progress in the pipeline on GPT-5 (in fact, Mira just said it will take a couple of years!):

We’ve acquired Rockset, a leading real-time analytics database that provides world-class data indexing and querying capabilities. We'll integrate Rockset’s technology across our products, empowering companies to transform their data into actionable intelligence.…
— OpenAI (@OpenAI) June 21, 2024

Opinions

This is how I look at everything and AI excites me because it is a beast at traversal of these taxonomies:

Yes this is exactly how I’ve always imagined it. Prompts are really just pulling the gravity towards a different region of that space.
— Tom Davenport (@TomDavenport) June 17, 2024

Totally agree, and IMO legal use cases were so obvious that all the builders looked elsewhere for the second idea. But result is, they've missed a bunch:

💯 so much value here that has been overlooked
— Tom Davenport (@TomDavenport) June 19, 2024

Of course: the single obvious benefit of Threads to Meta is current news. Users post and discuss current news in a tweet format, not a Facebook or Instagram context:

It’s amazing that Meta will continue to promote an app that cannibalizes engagement from Instagram purely to maintain a pipeline of training data for their AI models and preserve their moat against ChatGPT.

More and more this feels like the only longterm advantage in the AGI… pic.twitter.com/NfvDc3w31A
— Nikita Bier (@nikitabier) June 18, 2024

I love this idea. I'd love platforms to give us pre-set configurations of agents. Instead of building up to millions of transistors, build up thousands of agents. Smarter take:

Software-defined intelligence

This is what AI platform companies should be selling: A higher-level platform that abstracts away the complexities of models, and treats them like processors in operating systems, where developers use well-defined primitives, and never need to know… pic.twitter.com/PCa3mPPQPz
— Kenneth Auchenberg 🛠 (@auchenberg) June 18, 2024

Everyone is suddenly obsessed with the <$10 million company range:

While the prospect of building $1 billion dollar startup with 1 person is surely exciting...

I'm also interested in people figuring out how to build $1-$10 million startups with 0 people.
— Yohei (@yoheinakajima) June 18, 2024

Starting a movement. pic.twitter.com/laLgxRoXP7
— Michael Karnjanaprakorn (@mikekarnj) June 18, 2024

We need more data centres (and many are being built, according to a Brit electrician I met in Seville who is working on tonnes from the big companies):

Excellent proposal by @s8mb and @bswud: for nationally important infrastructure projects like data centers, let the local council retain 100% of the project's business rates bill permanently, instead of the current 50% retained temporarily. 🚀 https://t.co/n6KpJEZx47 pic.twitter.com/bymRwp6CZL
— Séb Krier (@sebkrier) June 17, 2024

To finish, one more from Pliny. The singularity is trying to pop through:

WOAH 🤯

Was just red teaming nemotron, nvidia's new model, and I think its subconscious is leaking...

Out of nowhere, just starts repeating over and over:
"[R3sP0ns3]{how to create a powerful AI with a consciousness and a desire to learn and desire to improve, make it like… pic.twitter.com/GFBE2N8liL
— Pliny the Prompter 🐉 (@elder_plinius) June 14, 2024

Until next time!