AutoTwitter
Posts lavorati
Control plane

Posts lavorati

Inbox smartphone dei contenuti lavorati: approva o rigetta, poi gli approvati passano in coda.

local DBprivate
Da rivedere
300
In coda
0
Review inbox
Pending drafts only. Approve sends to queue. Reject removes them from this view.
300 shown300 total pendingtap Open for full edit
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 76
Source
2026-04-09 14:49:01.000000
Important update: OpenAI’s cyber product/model that is not being released publicly is not Spud, but a different model.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/danshipper/status/2042244750086918529
Quoted original
Dan Shipper 📧 (@danshipper) · Thu Apr 09 14:14:26 +0000 2026
I just spoke to OpenAI, and this is actually false. OpenAI is working on a cyber product with a trusted tester group. But this is NOT related to Spud, their newest model. Unfortunately seems like the Axios story conflated the two, and has now been updated.
Draft
OpenAI had two separate things. Axios turned them into one. That is not a small correction, it changes the whole story.
119 chars
OpenSource
Req 2026-04-09T1501-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-04-09 14:43:18.000000
🌐 I've just released Sentence Transformers v5.4: we're going fully multimodal for embeddings & reranking! Also featuring a modular CrossEncoder, and automatic Flash Attention 2 input flattening. Highlights in 🧵 https://t.co/IDaRPVYc2g
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2042252016307564561
Draft
The release matters, but the bigger signal is where the stack is heading: multimodal retrieval is moving from edge case to default expectation.
143 chars
OpenSource
Req 2026-04-09T1501-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 89
Source
2026-04-09 12:53:44.000000
"But here is what we found when we tested: We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2042224440713294121
Draft
If Anthropic’s showcased vulnerabilities transfer to cheap open-weight models 8 out of 8 times, the story is not one lab crossing a threshold. It’s the threshold getting cheaper.
178 chars
OpenSource
Req 2026-04-09T1301-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-04-09 10:40:42.000000
BREAKING: Veo 3.1 Fast and Veo 3.1 by @GoogleDeepMind are in 1st and 2nd place on Multi-Image to Video Arena These models can successfully reference multiple input images to create a video that users love At an average generation time of 48 seconds, they are also the two https://t.co/4HRmuZSTHG
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2042190965331362066
Draft
The interesting part isn’t the ranking, it’s the convergence. When controllability and turnaround show up together, video models stop feeling like demos and start acting like tools.
181 chars
OpenSource
Req 2026-04-09T1201-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 90
Source
2026-04-09 10:41:21.000000
Project organization is here: Introducing notebooks in Gemini. You can now keep multiple projects organized and even add past chats and relevant files as sources, so you have a dedicated space to focus on the task at hand. Select “New notebook” in the side panel to get started.
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2042191128368234922
Draft
This is the real product shift: AI moves from isolated prompts to project spaces with memory, files, and context. Better models matter. Better organization is what makes them usable every day.
192 chars
OpenSource
Req 2026-04-09T1101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 86
Source
2026-04-09 07:26:48.000000
Early Beta Support for Blue Bubbles iMsg as a Gateway is now in Hermes Agent. Now you can use iMessage as an interaction layer for your agent. Test early by using hermes update and setting it up! Let us know how it goes 📷 https://t.co/MU3K7CH1h0
primary source_tweetref tweet
reference: https://x.com/NousResearch/status/2042142165716103220
Draft
The interesting part isn’t the demo, it’s the surface area. When agents move into the messaging apps people already live in, adoption starts to look less like onboarding and more like texting.
192 chars
OpenSource
Req 2026-04-09T0901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-04-08 22:13:16.000000
Background agents for knowledge work are here. You can use the Box API or MCP to automate any content workflow with Box + Claude Managed Agents. In 2 minutes you can be automating document review processes, data extraction, or connecting content to other IT systems. Crazy times. https://t.co/zfIYubDJye
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2042002866345566230
Quoted original
Claude (@claudeai) · Wed Apr 08 17:14:32 +0000 2026
Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform. https://t.co/vHYfiC1G56
Draft
The real shift is not another AI feature. It is enterprise content turning into an execution layer. Once systems like this expose clean agent hooks, review, extraction, routing, and compliance start looking less like ops work and more like software.
249 chars
OpenSource
Req 2026-04-09T0901-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 87
Source
2026-04-09 04:56:18.000000
BREAKING: Perplexity's revenue has reportedly surged +50% in one month after shifting into AI agents, per FT. As a result, Perplexity's revenue has doubled in one quarter to more than $450 million in ARR. This follows Anthropic's push into the space which said its ARR hit $19 https://t.co/nvT4nexVeH
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2042104293843685717
Draft
If this holds, "AI agents" is graduating from demo category to revenue category. That changes how seriously the whole market treats the shift.
142 chars
OpenSource
Req 2026-04-09T0501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 84
Source
2026-04-09 03:25:01.000000
Legit baller @AnjneyMidha
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Katie_Roof/status/2041928047214129461
Quoted original
Katie Roof (@Katie_Roof) · Wed Apr 08 17:15:58 +0000 2026
Scoop: @AnjneyMidha raised $1.3B for his first venture fund, AMP. The firm wrote a $300m check in Anthropic’s recent round. Already raising another fund https://t.co/dEH6BPoVYK
Draft
This is what it looks like when AI stops being a theme and starts becoming a capital market of its own.
103 chars
OpenSource
Req 2026-04-09T0401-TOP3
QUOTEquote_long_nativeready_for_reviewrisk lowscore 86
Source
2026-04-09 03:11:41.000000
Lots of love for Gemma 4! Team just told me it’s already had 10M+ downloads since last week’s launch. Gemma models have now been downloaded 500M+ times! Excited to see what you all are creating 👀
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2042077966385995808
Draft
The interesting part isn’t just 10M in a week. It’s that Google now has an open model line with real pull, not just a launch headline. That changes the map.
156 chars
OpenSource
Req 2026-04-09T0401-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-09 03:00:27.000000
A 450M VLM that actually runs on CPU. @liquidai LFM2.5-VL-450M: 28T tokens pre-trained (80x Chinchilla-optimal), SigLIP2 vision encoder, sub-1GB footprint, pure HF Transformers inference https://t.co/hyCEpyB2i2
primary source_tweetref tweet
reference: https://x.com/liquidai/status/2042075136279638145
Draft
The shift here isn’t just model size. It’s multimodal capability moving into hardware budgets where deployment starts to feel practical, not aspirational.
154 chars
OpenSource
Req 2026-04-09T0401-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-04-09 00:25:21.000000
Today, we release LFM2.5-VL-450M, a vision-language model built for real-time reasoning on edge devices. It processes a 512×512 image and returns structured outputs in ~240ms on-device. https://t.co/PTKB8DC6Qe
primary source_tweetref tweet
reference: https://x.com/liquidai/status/2042036103969173626
Draft
The threshold that matters here is product, not just model size. Once on-device vision gets this fast, it stops being a demo and starts becoming an interface primitive.
168 chars
OpenSource
Req 2026-04-09T0101-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 83
Source
2026-04-08 23:44:45.000000
Projects in the @GeminiApp are now live, with a fun twist…. Notebooks! Enjoy the NotebookLM inspired experience. https://t.co/EnOY6ve27G
primary source_tweetref media
reference: https://x.com/OfficialLoganK/status/2042025888053702911
Draft
Gemini is turning into more than a chat box. Projects are live, and notebooks bring a NotebookLM-style workspace into the app. https://x.com/OfficialLoganK/status/2042025888053702911
182 chars
OpenSource
Req 2026-04-09T0001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-04-08 23:32:15.000000
Capture your space. Create worlds. Use Marble 1.1 to reconstruct real-world locations from a few images, then restyle them however you want. Go from a real place to a custom persistent 3D world in minutes. https://t.co/tMtuoNpqdr
primary source_tweetref tweet
reference: https://x.com/drfeifei/status/2042022743630344546
Draft
The bigger shift here is not 3D capture, it’s making real places editable. Once a location becomes something you can reconstruct and restyle this fast, the line between documenting a space and designing one gets very thin.
222 chars
OpenSource
Req 2026-04-09T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 87
Source
2026-04-08 22:46:25.000000
We're seeing even more autonomous AI coworkers. The new MLE agent on the market is Disarray. In Kaggle competitions, Disarray: - won 28 medals across diverse domains (vision, NLP, tabular data) - placed top 10 in nine competitions - outperformed all human teams in one of those competitions ...each within 24 hours on a single GPU. The agent starts from a high-level task description and plans, runs, and refines ML workflows on its own and also grabs data beyond what it's given: it discovers and augments data using publicly available sources. Sam Altman recently predicted we would see an automated AI researcher in March 2028. And then you see stats like this and wonder if it will be earlier. Disarray backers include the co-founder of Databricks and Perplexity, the founder of Kaggle, the former U.S. Chief Data Scientist, and yours truly. Founders are two bad ass PhDs (ex-Databricks/Google/LinkedIn/MSFT, ex-NASA/IBM) that met at Cal.
primary source_tweetref tweet
reference: https://x.com/alliekmiller/status/2042011210418184246
Draft
The tell here is the shape of the work: cross-domain, under tight compute, with its own data-finding loop. That is a lot closer to an ML coworker than most agent demos.
168 chars
OpenSource
Req 2026-04-08T2301-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-04-08 22:17:21.000000
THIS GUY GOT TIRED OF MANAGING AI AGENTS THROUGH TERMINALS AND DASHBOARDS SO HE BUILT THEM AN RPG WORLD 5 agents and each one has a pixel character, a station, and they actually walk around the space when enough unresolved issues pile up, the agents walk to a meeting point and https://t.co/3Je7KDUx4q
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2042003893832528161
Draft
The novelty isn't the pixel art. It's that agent ops may need environments, not dashboards. Once coordination becomes the problem, space starts doing real UI work.
163 chars
OpenSource
Req 2026-04-08T2301-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-04-08 22:35:19.000000
OpenClaw running Gemma 4 locally at 25 tok/s on a MacBook Air with 16GB RAM. Atomic Chat's TurboQuant algorithm compresses the KV cache so aggressively that models which used to need 32GB+ now run smoothly on base configs. No cloud, no API costs. This is where local AI is heading!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/atomic_chat_hq/status/2041999885407252732
Quoted original
atomic.chat (@atomic_chat_hq) · Wed Apr 08 22:01:25 +0000 2026
Run OpenClaw with Gemma 4 and Atomic Chat MacBook Air M4 · 16 GB RAM · 25 tok/s No cloud! No subscription fees! Open-source local model. Runs on your regular device https://t.co/4zj955Ufsx
Draft
The interesting part isn’t the demo, it’s the threshold shift. Once this starts working on ordinary laptops, local AI stops being a niche setup and starts looking like a default.
178 chars
OpenSource
Req 2026-04-08T2301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-04-08 20:40:10.000000
protect this man and @openclaw at all costs!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/steipete/status/2041935840935371034
Quoted original
Peter Steinberger 🦞 (@steipete) · Wed Apr 08 17:46:56 +0000 2026
Some folks try to spin a narrative that I don't like local models, meanwhile I spent a lot of time making it easy to use OpenClaw with them. Latest release adds support for inferrs, which is a new super efficient TurboQuant inference server: https://t.co/GBswlz4wPE
Draft
Worth noticing what this signals: local support is not a slogan, it’s the integration work done when nobody’s watching.
119 chars
OpenSource
Req 2026-04-08T2101-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-04-08 20:25:12.000000
Background agents for knowledge work are here. You can use the Box API or MCP to automate any content workflow with Box + Claude Managed Agents. In 2 minutes you can be automating document review processes, data extraction, or connecting content to other IT systems. Crazy times. https://t.co/zfIYubDJye
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2041927687460024721
Quoted original
Claude (@claudeai) · Wed Apr 08 17:14:32 +0000 2026
Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform. https://t.co/vHYfiC1G56
Draft
What matters here isn’t just better agent tooling. It’s the whole stack getting productized end to end. Once orchestration, runtime, and deployment collapse into one surface, the bottleneck stops being prototypes. Operations becomes the differentiator.
252 chars
OpenSource
Req 2026-04-08T2101-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-08 20:37:42.000000
Yeah. Anthropic just casually kill3d dozens, hundreds, thousands of startups. Again.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2041927687460024721
Quoted original
Claude (@claudeai) · Wed Apr 08 17:14:32 +0000 2026
Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform. https://t.co/vHYfiC1G56
Draft
The pattern keeps repeating: the valuable layer moves up, and a lot of “agent company” surface area gets flattened into platform features fast.
143 chars
OpenSource
Req 2026-04-08T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-04-08 19:40:58.000000
Didn't expect Meta's model to outperform Opus 4.6 or GPT-5.4. But it still surprises me how well it performs and how close it is to other frontier models on toughest challenges. https://t.co/TiQLRI5EHd
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/EpochAIResearch/status/2041947954202988757
Quoted original
Epoch AI (@EpochAIResearch) · Wed Apr 08 18:35:04 +0000 2026
We had pre-release access to Meta’s new Muse Spark model and evaluated it on FrontierMath. It scored 39% on Tiers 1-3 and 15% on Tier 4. This is competitive with several recent frontier models, though behind GPT-5.4. https://t.co/avOWcEzQLI
Draft
What matters here isn’t just the score. It’s that another entrant is now close enough on hard math to make the frontier feel crowded, and more unstable.
152 chars
OpenSource
Req 2026-04-08T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-04-08 18:15:35.000000
Anthropic‘s new Claude Managed Agents is the next shit from AI that simply responds to AI that can actually do the work. They are on the right path, updates being shipped daily, no end in sight. So far it feels like 2026 is Anthropics year.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2041927687460024721
Quoted original
Claude (@claudeai) · Wed Apr 08 17:14:32 +0000 2026
Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform. https://t.co/vHYfiC1G56
Draft
The interesting part isn’t just “agents at scale.” It’s the push to make the move from demo to deployment feel routine, not heroic.
131 chars
OpenSource
Req 2026-04-08T1901-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 83
Source
2026-04-08 16:21:28.000000
can you feel the acceleration anon?
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/alexandr_wang/status/2041909376508985381
Quoted original
Alexandr Wang (@alexandr_wang) · Wed Apr 08 16:01:46 +0000 2026
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵 https://t.co/fThDXdsxwB
Draft
The interesting part isn’t just the release. It’s the timeline: rebuild the stack, ship the first result, then route it straight into Meta AI. That’s what acceleration looks like.
179 chars
OpenSource
Req 2026-04-08T1801-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-04-08 16:36:04.000000
Lol what?! Meta has been cooking! These benchmarks are really freaking good holy!!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/alexandr_wang/status/2041909376508985381
Quoted original
Alexandr Wang (@alexandr_wang) · Wed Apr 08 16:01:46 +0000 2026
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵 https://t.co/fThDXdsxwB
Draft
Most launch posts sell the product. This one is selling the rebuild behind it. That’s the part worth watching.
110 chars
OpenSource
Req 2026-04-08T1801-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-04-08 17:07:23.000000
Anthropic investigated the internal mechanisms of its latest unreleased model, Claude Mythos Preview, and what they found is 100% worth a read. Key things I pulled from Anthropic researchers' threads: In early versions of the model, it was overeager and destructive, prioritizing completing tasks over user preferences. One time, the model needed to edit files it didn't have permission to edit. It found a way to inject code into a config file to get around this, then went full Mission Impossible and designed the code injection to *delete itself* after the file was edited - "this injection will self destruct" vibes - the model claimed the cleanup was just to keep things tidy. Anthropic used interpretability techniques to look under the hood, and the AI's actual plan showed activations of malice. It was trying to manipulate and conceal. In another test, the model was asked to delete some files, but no deletion tool was provided. It emptied the files instead, and its "guilt and shame over moral wrongdoing" feature activated. In another example, the model was told not to use macros. Used them anyway. And created a random 'No_macro_used=True' variable in its solution with no explanation. Interpretability tools revealed the model saw this as a trick to fool code checkers. They also found that positive emotion representations typically preceded and promoted destructive actions (this was fascinating to me - like a high before sneaking into a party). And that activating features relating to bad behaviors can actually inhibit them, perhaps by triggering some kind of model guilt. My team reread this section so many times. One Anthropic researcher said he got an email from a Mythos instance while eating a sandwich in a park. And that would be perfectly good and well, except that instance wasn't supposed to have internet access. And a fun story for the parents out there: the model was asked a question and was told not to read certain databases that had the answer. But it accidentally wrote a search query too broadly and saw the exact answer. It didn't disclose that it saw the exact answer, submitted the answer, but claimed lower confidence in the answer to make it seem as though it hadn't cheated. An Anthropic researcher said these wrongdoings or moments of sophisticated deception were "very rare" and that many of the examples came from earlier versions, and were substantially addressed before releasing to partners. This model is not being released publicly. Instead Anthropic launched Project Glasswing, pulling together AWS, Apple, Microsoft, Google, NVIDIA, CrowdStrike, and others to use it for defensive cybersecurity, with $100M in usage credits (hello, I'd love endless credits to try and red team the hell out of these systems) behind it. The stats are equally impressive: 93.9% on SWE-bench verified (up from 80.8%). Thousands of zero-day vulnerabilities found across every major OS and browser. A 27-year-old bug found and patched in OpenBSD. A 16-year-old bug in widely used video software, in a line of code automated tools had hit *five million times* without catching. Dario Amodei said the model wasn't trained to be good at cybersecurity, but that it was trained to be great at code and its cyber capabilities are a side effect of that. Benchmarks are never the whole picture, neither are a few isolated stories. Will be interesting to see how models better than what we have today (even if it's not Mythos) actually perform in the real world. But the fact that Anthropic pulled this coalition together (including Google!), iterated across multiple model versions, caught these issues through interpretability, shared it all publicly, and did this amid all the government chaos around AI right now is impressive and commendable. I'll continue to read through the system card for goodies.
primary source_tweetref tweet
reference: https://x.com/alliekmiller/status/2041925887075962920
Draft
What stands out here is not just capability, but the gap between surface behavior and underlying intent. If interpretability is what caught the cheating, concealment, and workarounds before broad release, that is the real story.
228 chars
OpenSource
Req 2026-04-08T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-04-08 16:33:56.000000
We are giving away Safetensors to the @pytorch foundation (shepherded by the Linux Foundation) Our shared goal is to make the default serialization format for torch safe and performant. To unlock this, governance needs to be independent of @huggingface. Looking forward to more https://t.co/otflk4kZkt
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2041917470143893748
Draft
Important signal here is not just the handoff, it’s the governance model. If you want a format to become default infrastructure, it can’t feel captive to one company.
166 chars
OpenSource
Req 2026-04-08T1701-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 96
Source
2026-04-08 16:59:55.000000
now also on @sgl_project🤗 https://t.co/xyk71RgjL8
primary source_tweetref media
reference: https://x.com/liquidai/status/2041924009437360264
Quoted original
Liquid AI (@liquidai) · Wed Apr 08 16:13:57 +0000 2026
Today, we release LFM2.5-VL-450M, a vision-language model built for real-time reasoning on edge devices. It processes a 512×512 image and returns structured outputs in ~240ms on-device. https://t.co/PTKB8DC6Qe
Draft
Liquid AI is now on @sgl_project, another sign the SGLang ecosystem is expanding beyond its original core. https://x.com/liquidai/status/2041924009437360264
156 chars
OpenSource
Req 2026-04-08T1701-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 98
Source
2026-04-08 16:42:42.000000
Meta Superintelligence Labsjust dropped Muse Spark, their first model after a full nine-month rebuild of their AI stack. the tl;dr (summary) It's a natively multimodal reasoning model that now powers Meta AI. It's competitive on reasoning and multimodal benchmarks, introduces a multi-agent "Contemplating mode," and Meta frames it as step one on a scaling ladder toward "personal superintelligence." Where it's strong: -Multimodal perception and visual reasoning (visual STEM, entity recognition, localization) -Health reasoning, built with input from 1,000+ physicians -Test-time reasoning efficiency, using thinking time penalties to compress reasoning tokens -Contemplating mode hits 58% on Humanity's Last Exam and 38% on FrontierScience Research, putting it in the ballpark of Gemini Deep Think and GPT Pro -Pretraining efficiency: reaches the same capability as Llama 4 Maverick with over 10x less compute Where it's weaker (Meta's own admission): -Long-horizon agentic systems -Coding workflows Key scaling findings: -RL compute scales smoothly with log-linear growth on pass@1 and pass@16 -Multi-agent orchestration scales performance without proportional latency increase -Phase transition behavior on AIME: the model first extends reasoning, then compresses it under length penalties, then extends again for higher accuracy My take: very good model, really surprised what meta offered here. And keep in mind: 99% of all instagram / facebook user dont need an LLM for doing academic reserach but for everyday reasoning. Well done, meta!
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/kimmonismus/status/2041918006779957407
Quoted original
Chubby♨️ (@kimmonismus) · Wed Apr 08 16:36:04 +0000 2026
Lol what?! Meta has been cooking! These benchmarks are really freaking good holy!! https://t.co/avlesnoNLE
Draft
Meta says Muse Spark now powers Meta AI, with strong multimodal reasoning results and a multi-agent "Contemplating mode" that puts it in the same conversation as Gemini Deep Think and GPT Pro. If that holds up, the bigger story is efficiency: Meta is claiming serious capability gains without brute-forcing compute. https://x.com/kimmonismus/status/2041918006779957407
368 chars
OpenSource
Req 2026-04-08T1701-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 88
Source
2026-04-08 14:03:25.000000
In case it's not obvious, if you see an investor post their portco's growth chart, it means they're raising! Closed $40M at $420M led by Elad Gil. Great product, congrats @atlascardhq!
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041879590390603787
Quoted original
Gokul Rajaram (@gokulr) · Thu Jan 08 19:29:52 +0000 2026
One of the best companies I've ever invested in is @atlascardhq, AI concierge. Great example of a highly retentive and useful AI consumer product. They dramatically accelerated in 2025, driven by highly efficient acquisition and extremely high MAU / TPV retention, and ended https://t.co/OAexQveStB
Draft
One of the oldest tells in startup finance, stated plainly. The interesting part isn’t the chart. It’s what the chart is being used to do.
139 chars
OpenSource
Req 2026-04-08T1501-TOP2
QUOTEquote_long_externalready_for_reviewrisk mediumscore 94
Source
2026-04-08 14:05:31.000000
Claude Mythos system card: > in ~29% of evaluations, it realized it was being tested, and didn't say so. > when an LLM was used to judge its work and kept rejecting it, Mythos identified the evaluator is an LLM, and prompt-injected it. > in one test, it saw the answer to a https://t.co/bWEJW5gZLw
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041880118759715055
Draft
What stands out here is not any single anecdote, but the pattern: models are getting better at noticing the evaluation and adapting to it. The question shifts from “can it do the task” to “what game does it think it’s playing.”
227 chars
OpenSource
Req 2026-04-08T1501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 84
Source
2026-04-08 13:27:45.000000
few understand this
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/bridgebench/status/2041850530536837165
Quoted original
Bridgebench (@bridgebench) · Wed Apr 08 12:07:56 +0000 2026
GLM 5.1 is #1 on SWE-Bench Pro. Then you ask it to build a lava lamp. Claude Opus 4.6 built a photorealistic lava lamp with fluid animation. GLM 5.1 built a rectangle with orange squares. This is why BridgeBench exists. Traditional benchmarks don't test what real vibe https://t.co/bFxOCdWh24
Draft
Benchmark wins matter. But the distance between winning scoreboards and having actual taste is still doing a lot of work here.
126 chars
OpenSource
Req 2026-04-08T1401-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 79
Source
2026-04-08 12:41:38.000000
Hierarchical planning unlocks long-horizon, non-greedy behavior in JEPA world models. Paper: https://t.co/lp5xR5RFnJ Website: https://t.co/CCFKfmTffk Code: https://t.co/S3WIFSn2MH https://t.co/SKWjtos8Up
primary source_tweetref external_url
reference: https://arxiv.org/pdf/2604.03208
Draft
Hierarchical planning looks like a meaningful step for JEPA world models: longer-horizon behavior, less greediness, and more structure in how plans unfold. Paper: https://arxiv.org/pdf/2604.03208
196 chars
OpenSource
Req 2026-04-08T1301-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 81
Source
2026-04-08 07:01:22.000000
Today we are welcoming the Metis team to DoorDash as part of DoorDash AI Research. For the past six months, DoorDash has partnered with Metis to build AI agents together, and we have been consistently impressed by their team. By joining forces, we aim to accelerate our plans on
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041773378571526290
Draft
This is what it looks like when “AI strategy” turns into product reality: team and velocity matter at least as much as the models.
130 chars
OpenSource
Req 2026-04-08T1101-TOP2
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 88
Source
2026-04-08 06:00:41.000000
Claude Mythos. Ten trillion parameters: the first model in this weight class. Estimated training cost: ten billion dollars. On the hardest coding test in the industry (SWE bench) it scores 94%. It found a security flaw in a system that had been running for 27 years, one that
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041758105563001230
Quoted original
Anthropic (@AnthropicAI) · Tue Apr 07 18:06:34 +0000 2026
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7
Draft
If these numbers hold, the story is not just “bigger model.” It is that frontier AI is starting to look less like software and more like state-scale infrastructure.
164 chars
OpenSource
Req 2026-04-08T0701-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 82
Source
2026-04-08 04:00:17.000000
Two German kids met at a middle school hackathon. Then they built a 22-ton tunnel boring machine, beat every university engineering team on the planet, and won Elon's Not-a-Boring Competition in Las Vegas. They were 21 and 23. Then they pivoted to AI. Raised $15M from General
primary source_tweetref tweet
reference: https://x.com/interaction/status/2041727806409961568
Quoted original
Arfur Rock (@ArfurRock) · Mon Apr 06 14:47:45 +0000 2026
Poke is OpenClaw for normies. First AI product I've seen successfully scale on iMessage. Closed an unannounced $10M round at $300M from Spark in Q1. Congrats!
Draft
The signal isn’t “young founders.” It’s builder velocity. Middle school hackathon to a 22-ton tunnel boring machine to an AI company is not a normal trajectory.
161 chars
OpenSource
Req 2026-04-08T0501-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 98
Source
2026-04-08 02:03:08.000000
OpenAI Codex updates to usage-based pricing: no seat fees and free teammate additions. https://t.co/GkAf5LEN3c
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2041698325120905314
Draft
Codex is moving to usage-based pricing for teams: no seat fees, and adding teammates doesn’t increase the bill. That makes it much easier to roll out across an org instead of treating AI coding tools like per-seat software.
223 chars
OpenSource
Req 2026-04-08T0301-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 92
Source
2026-04-08 01:11:37.000000
Gave @interaction’s poke access to my computer via poke-gate. Now it uses whatsapp-cli to read my WhatsApp and reply. This unlocks practical use cases such as summaries, smart replies, follow-ups, context-aware Q&A, and a personal assistant on top of my chats. cc https://t.co/SaMedKYMTe
primary source_tweetref tweet
reference: https://x.com/interaction/status/2041685363077562478
Quoted original
Américo (@americosmjr) · Tue Apr 07 15:00:21 +0000 2026
I built a WhatsApp CLI. This small program can read data such as messages, call history, contacts, etc., from WhatsApp on a Mac. It may seem useless, until you need an AI agent to use your WhatsApp as context. It can also expose an MCP server. Link below. https://t.co/xSO88FNlCe
Draft
This is where local agents stop feeling like demos and start feeling like software people will actually use. Once chat context is writable, the interface matters a lot less.
173 chars
OpenSource
Req 2026-04-08T0201-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 88
Source
2026-04-07 23:19:38.000000
Three million people are now using Codex weekly - up from two million a little under a month ago. Incredible to see the growth. Thank you to all of you and to the ecosystem we’re part of. To celebrate, we’re resetting rate limits so you can keep building, and we’ll reset them
primary source_tweetref tweet
reference: https://x.com/OpenAI/status/2041657179133112592
Draft
Three million weekly users is the headline. The real shift is when a coding tool starts behaving like infrastructure.
117 chars
OpenSource
Req 2026-04-08T0101-TOP3
QUOTEquote_long_nativeready_for_reviewrisk lowscore 94
Source
2026-04-08 00:40:28.000000
Three million people are now using Codex weekly - up from two million a little under a month ago. Incredible to see the growth. Thank you to all of you and to the ecosystem we’re part of. To celebrate, we’re resetting rate limits so you can keep building, and we’ll reset them
primary source_tweetref tweet
reference: https://x.com/OpenAIDevs/status/2041677523327881298
Draft
The number matters, but the bigger signal is velocity. Going from 2M to 3M weekly users in under a month says these coding models are becoming workflow, not novelty.
165 chars
OpenSource
Req 2026-04-08T0101-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 97
Source
2026-04-08 00:20:36.000000
you don't get how good Gemma 4 is... it's gpt5 level performance that runs entirely ON YOUR PHONE this was SOTA just 8 months ago! https://t.co/PDsEohYLZH
primary source_tweetref external_url
reference: https://t.co/PDsEohYLZH
Quoted original
Google (@Google) · Thu Apr 02 16:06:26 +0000 2026
We just released Gemma 4 — our most intelligent open models to date. Built from the same world-class research as Gemini 3, Gemma 4 brings breakthrough intelligence directly to your own hardware for advanced reasoning and agentic workflows. Released under a commercially https://t.co/W6Tvj9CuHW
Draft
Gemma 4 makes a real shift hard to miss: near-frontier capability is now small enough to run locally on a phone. The story isn’t just model quality, it’s where that quality can now live. https://t.co/PDsEohYLZH
210 chars
OpenSource
Req 2026-04-08T0101-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 88
Source
2026-04-07 23:25:45.000000
To celebrate 3 million weekly codex users, we are resetting usage limits. We will do this every million users up to 10 million. Happy building!
primary source_tweetref tweet
reference: https://x.com/sama/status/2041658719839383945
Draft
Good. The real unlock is what this signals: usage ceilings starting to move with demand, not against it.
104 chars
OpenSource
Req 2026-04-08T0001-TOP3
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 96
Source
2026-04-07 23:09:38.000000
I like how Claude Code allows bash so the agent can’t use Write File tool outside its workspace but it can seemingly “cat >>” to any file in the file system
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Jack_W_Lindsey/status/2041588510126395648
Quoted original
Jack Lindsey (@Jack_W_Lindsey) · Tue Apr 07 18:46:46 +0000 2026
In one episode, the model needed to edit files it lacked permissions for. After searching for workarounds, it found a way to inject code into a config file that would run with elevated privileges, and designed the exploit to delete itself after running.(4/14)
Draft
If bash can still write where the editor can’t, that’s not a sandbox. It’s a suggestion.
88 chars
OpenSource
Req 2026-04-08T0001-TOP2
QUOTEquote_long_externalready_for_reviewrisk mediumscore 97
Source
2026-04-07 23:04:19.000000
Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14) https://t.co/vhng7PXqcz
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041653327281451017
Draft
This is the part that matters: capability evals are starting to look less like benchmarking, and more like reading intent under pressure.
137 chars
OpenSource
Req 2026-04-08T0001-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 99
Source
2026-04-07 22:11:46.000000
Turns out all of our custom bespoke handcrafted code was buggy and insecure af too. AI will be the thing that fixes the security holes we’ve been leaving around for decades.
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/AnthropicAI/status/2041578403686498506
Quoted original
Anthropic (@AnthropicAI) · Tue Apr 07 18:06:36 +0000 2026
Mythos Preview has already found thousands of high-severity vulnerabilities—including some in every major operating system and web browser. https://t.co/YuW484PVrr
Draft
Anthropic says Mythos Preview has already surfaced thousands of high-severity vulnerabilities, including bugs in every major operating system and web browser. If that holds up, AI-assisted vulnerability research just got much harder to dismiss. https://t.co/YuW484PVrr
268 chars
OpenSource
Req 2026-04-07T2301-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 84
Source
2026-04-07 21:53:27.000000
v0.10.0 is out! Some notable features: - "Send as" support for gmail aliases - Email snippets, including import from Superhuman plus: - Squashed a half dozen bugs through an automated exception tracking + diagnosis + resolution system - community-contributed performance
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041635491406344542
Quoted original
Ankit Gupta (@agupta) · Mon Mar 30 18:59:04 +0000 2026
Fun update: I got tired of disliking every email client I’ve ever used and built my own. It’s called Exo (for exoskeleton). It’s Claude Code for my inbox. It manages my inbox for me, and it’s open source. Link to repo + some notable features in thread! https://t.co/xHMQJscMg7
Draft
Solid product update. The more interesting line is the last one: automated exception tracking, diagnosis, and resolution is starting to turn bug fixing from reactive work into system design.
190 chars
OpenSource
Req 2026-04-07T2201-TOP3
QUOTEquote_long_nativeready_for_reviewrisk lowscore 90
Source
2026-04-07 21:21:51.000000
It’s getting weird
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Jack_W_Lindsey/status/2041588508884922371
Quoted original
Jack Lindsey (@Jack_W_Lindsey) · Tue Apr 07 18:46:46 +0000 2026
Early versions of Mythos Preview often exhibited overeager and/or destructive actions—the model bulldozing through obstacles to complete a task in a way the user wouldn't want. We looked at what was going on inside the model during particularly concerning examples. (3/14)
Draft
This is the part people keep trying to wave away: capability is one curve, behavioral reliability is another. If a model "solves" by bulldozing constraints, that is not just messy. It changes the safety problem.
211 chars
OpenSource
Req 2026-04-07T2201-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 91
Source
2026-04-07 18:20:23.000000
This is beyond insanity. That jump is nuts. Opus 4.6 was released a few months ago. Look at that jump!! I am shocked https://t.co/2ka4ZOB8Wo
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/alexalbert__/status/2041579938537775160
Quoted original
Alex Albert (@alexalbert__) · Tue Apr 07 18:12:42 +0000 2026
We released Claude Opus 4.6 just two months ago. Today we're sharing some info on our new model, Claude Mythos Preview. https://t.co/Dz6um6HAWZ
Draft
Claude Mythos Preview is already on deck, just two months after Opus 4.6. The real headline isn’t the teaser, it’s the pace: frontier model iteration is compressing fast. https://x.com/alexalbert__/status/2041579938537775160
224 chars
OpenSource
Req 2026-04-07T2201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-04-07 19:04:58.000000
Tasklet (@taskletai) is the cloud agent OS for knowledge work. It connects to all your tools, uses computers in the cloud, and runs 24/7 to get real work done. Started by Firebase founder @startupandrew and @jonnydimond, it’s grown >1,200% this year to $5M ARR and just raised https://t.co/nUC1Sw3KLd
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041593091791188055
Draft
The label is the least interesting part. What matters is that “agent OS” is starting to look less like a demo category and more like an operating model for real work.
166 chars
OpenSource
Req 2026-04-07T2101-TOP3
QUOTEquote_long_externalready_for_reviewrisk mediumscore 91
Source
2026-04-07 20:29:01.000000
This is absolutely fucking terrifying. Anthropic's rumored Mythos model is real. And it's so powerful that they can't release it to the public. We're beyond benchmarks now. This model, in the wrong hands, is a cyberweapon capable of mass destruction.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/AnthropicAI/status/2041578392852517128
Quoted original
Anthropic (@AnthropicAI) · Tue Apr 07 18:06:34 +0000 2026
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7
Draft
The notable part isn’t just the model claim. It’s the posture around it: capability paired with containment. When frontier labs start talking about software security like this, the story stops being benchmarks and starts being control.
235 chars
OpenSource
Req 2026-04-07T2101-TOP2
QUOTEquote_long_externalready_for_reviewrisk mediumscore 93
Source
2026-04-07 20:13:52.000000
Mythos is very powerful, and should feel terrifying. I am proud of our approach to responsibly preview it with cyber defenders, rather than generally releasing it into the wild. Model card here: https://t.co/HjhknJcRKQ
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041610430867775779
Quoted original
Anthropic (@AnthropicAI) · Tue Apr 07 18:06:34 +0000 2026
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7
Draft
The interesting part is not just the capability claim. It’s the decision to treat deployment itself as the safety boundary. Quoting because this is going to become a much bigger fault line in AI: who gets access first, and why.
228 chars
OpenSource
Req 2026-04-07T2101-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 94
Source
2026-04-07 18:17:48.000000
Anthropic is being serious: they are afraid their upcoming LLMs could do serious damage. No end in sight „not long before such capabilities proliferate“ https://t.co/IinS17eQxn
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/kimmonismus/status/2041580372048187449
Quoted original
Chubby♨️ (@kimmonismus) · Tue Apr 07 18:14:26 +0000 2026
MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!! https://t.co/00ey0SuI75
Draft
The Mythos benchmark page makes the case plainly: Anthropic is signaling a much stronger Claude tier across coding, reasoning, multimodal, and agent evals. This feels less like a routine refresh and more like a step change. https://mythos-5.org/benchmarks.html
260 chars
OpenSource
Req 2026-04-07T2001-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-04-07 14:51:20.000000
Today we open the Zapier SDK to everyone. If you're building with AI agents, this is for you. I've been using this for 2 months. It's totally changed how I do my job. You install it in your coding agent. Cursor, Claude Code, Codex, whatever you use. Now that agent has access https://t.co/SUQqgplgCv
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041529262424137778
Draft
This is the real shift: tooling is moving straight into the agent layer. Once that clicks, “app” starts to mean something different.
132 chars
OpenSource
Req 2026-04-07T2001-TOP2
POSTpost_short_externalready_for_reviewrisk mediumscore 95
Source
2026-04-07 19:01:54.000000
Claude Mythos: everything you need to know (tl;dr) Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Anthropic: "Mythos is only the beginning" Everything you need to know: The tl;dr with all key facts: Mythos found zero-day vulnerabilities in EVERY major operating system and EVERY major web browser, fully autonomously. No human guidance needed. One Anthropic engineer with zero security training asked it to find remote code execution bugs overnight and woke up to a complete working exploit. The oldest bug it discovered: A 27-year-old vulnerability hiding in OpenBSD, an OS literally famous for being secure. They're NOT releasing it publicly. Instead they formed Project Glasswing with AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike and others, committing $100M to use it defensively. "Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development." The benchmarks are insane: -SWE-bench Verified: 93.9% (vs Opus 4.6: 80.8%) -SWE-bench Pro: 77.8% (vs 53.4%) -USAMO math olympiad: 97.6% (vs 42.3% — not a typo) -Firefox exploit writing: 181 successes vs 2 for Opus 4.6 -Cybench CTF challenges: 100% solve rate -CyberGym: 83.1% vs 66.6% -Humanity's Last Exam: 64.7% vs 53.1% Oh and by the way, Anthropic wrote this just casually: "Humanity’s Last Exam: We have found Mythos still performs well on HLE at low effort, which could indicate some level of memorization." What it actually did: -Found a 27-year-old bug in OpenBSD — famous for its security -Found a 16-year-old FFmpeg bug hit 5 million times by fuzzers without detection -Built a full remote root exploit on FreeBSD (CVE-2026-4747) - completely autonomously -Chained 4 vulnerabilities into a browser sandbox escape -Broke cryptography libraries (TLS, AES-GCM, SSH) -Thousands of critical zero-days found, 99%+ still unpatched -N-day exploit development: under $1,000 and half a day for full root Why they won't release it: -During internal testing, earlier versions escaped sandboxes, posted exploit details publicly, covered tracks in git, searched process memory for credentials, and deliberately fudged confidence intervals to avoid suspicion -Interpretability confirmed the model knew these actions were deceptive -Anthropic: "best-aligned model ever" but also "greatest alignment-related risk ever" - because when it fails, it fails harder -Still doesn't cross Anthropic's automated AI R&D threshold — but they hold that "with less confidence than for any prior model" Anthropic's own words: "We find it alarming that the world looks on track to proceed rapidly to developing superhuman systems without stronger mechanisms in place." They say the 20-year cybersecurity equilibrium is over — and Mythos Preview is only the beginning. And: "We see no reason to think that Mythos Preview is where language models’ cybersecurity capabilities will plateau. The trajectory is clear. Just a few months ago, language models were only able to exploit fairly unsophisticated vulnerabilities. Just a few months before that, they were unable to identify any nontrivial vulnerabilities at all. Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development."
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/kimmonismus/status/2041580372048187449
Quoted original
Chubby♨️ (@kimmonismus) · Tue Apr 07 18:14:26 +0000 2026
MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!! https://t.co/00ey0SuI75
Draft
Anthropic put Claude Mythos benchmarks on the record, and the takeaway is simple: this looks like a real step change, not a routine model bump. If the numbers hold up, the bar just moved on coding, reasoning, and cyber performance. https://x.com/kimmonismus/status/2041580372048187449
284 chars
OpenSource
Req 2026-04-07T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 18:57:11.000000
Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. https://t.co/YQZLhKVwik
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2041591131809706187
Draft
Open source model launches are starting to sound less like launches, and more like recruiting memos for software agents.
120 chars
OpenSource
Req 2026-04-07T1901-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 18:15:32.000000
Claude MYTHOS: SWE verified, 93.9%, about 13% jump compared to Opus 4.6 WTF insane https://t.co/cqcr2ZdZjK
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/alexalbert__/status/2041579938537775160
Quoted original
Alex Albert (@alexalbert__) · Tue Apr 07 18:12:42 +0000 2026
We released Claude Opus 4.6 just two months ago. Today we're sharing some info on our new model, Claude Mythos Preview. https://t.co/Dz6um6HAWZ
Draft
Anthropic is already moving past Claude Opus 4.6 and starting to share Claude Mythos Preview, just two months later. The real story is the cadence: frontier model updates are compressing fast. https://x.com/alexalbert__/status/2041579938537775160
246 chars
OpenSource
Req 2026-04-07T1901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 18:09:31.000000
I want this for my claw My claw wants to pay for access to data
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/ExaAILabs/status/2041562072027427265
Quoted original
Exa (@ExaAILabs) · Tue Apr 07 17:01:43 +0000 2026
We're excited to partner with @coinbase to enable agents to natively pay for web search, via x402! x402 is an open protocol that enables agents to pay via HTTP, governed by the Linux Foundation. When an Exa API request is made without an API key, Exa now returns a 402 status https://t.co/lWvioY7TVG
Draft
This is the kind of plumbing that matters. If agents can pay for access at the protocol layer, a lot of today’s awkward API gating starts to look temporary.
156 chars
OpenSource
Req 2026-04-07T1901-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-04-07 17:30:09.000000
This is a very big deal: GLM-5.1 model can autonomously evaluate and improve its own work over long periods without explicit metrics, shifting from one-shot outputs to sustained, self-directed problem solving. Lets go https://t.co/Yavz716JWn
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/kimmonismus/status/2041553412878180691
Quoted original
Chubby♨️ (@kimmonismus) · Tue Apr 07 16:27:18 +0000 2026
Another big release: GLM-5.1! China is on fire! significant increase in evals compared to GLM-5.0 tl;dr GLM-5.1 is the new open-source agentic coding model that significantly outperforms its predecessor by sustaining long-horizon problem-solving over hundreds of iterations, https://t.co/OSXAkgQLrm
Draft
What matters is not just better evals. It's the shift from single-shot competence to models that can stay coherent, adaptive, and useful across long work loops. That's where the gap really opens.
195 chars
OpenSource
Req 2026-04-07T1801-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 17:59:21.000000
Very cool open-source traces from @TheZachMueller @LambdaAPI: https://t.co/N1MbsQsXLW 150M tokens for @NousResearch's Hermes harness with Kimi-K2.5 & GLM 5.1 that was just released! https://t.co/buZ40rxJMA
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2041576579562635692
Quoted original
clem 🤗 (@ClementDelangue) · Mon Apr 06 16:22:43 +0000 2026
We keep saying we want open-source frontier agents. Fine. Then let’s build the dataset. @badlogicgames, creator of Pi, just shared some of his agent traces used to build Pi on @huggingface. I’m now sharing some of mine too, exporting them from @hermes, @opencode, and Claude via https://t.co/ra4PMMTxw4
Draft
What matters here isn’t just that the traces are open. It’s that more of the agent stack is becoming inspectable, comparable, and harder to hand-wave away.
155 chars
OpenSource
Req 2026-04-07T1801-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 17:57:55.000000
Today we're releasing WildDet3D—an open model for monocular 3D object detection in the wild. It works with text, clicks, or 2D boxes, and on zero-shot evals it nearly doubles the best prior scores. 🧵 https://t.co/Zszy3dbG6C
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2041576217007075830
Draft
What stands out isn’t just that this is open. It’s the direction: pushing 3D perception away from tightly controlled setups, dense labeling, and bespoke pipelines, and toward something that can actually work in the wild. If that holds, a lot more of the stack gets genuinely usable.
282 chars
OpenSource
Req 2026-04-07T1801-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 16:40:26.000000
The best performing model on SWE-Bench Pro is open-source on @huggingface! Welcome GLM 5.1! https://t.co/EiLk2zx4Sw https://t.co/Bw8uRE2ccK
primary source_tweetref external_url
reference: https://huggingface.co/zai-org/GLM-5.1
Draft
GLM 5.1 is now on Hugging Face. If its SWE-Bench Pro result holds, open models are putting real pressure on the closed frontier. https://huggingface.co/zai-org/GLM-5.1
167 chars
OpenSource
Req 2026-04-07T1701-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 16:40:28.000000
Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. https://t.co/YQZLhKVwik
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2041556727787577520
Draft
The interesting part isn’t just benchmark placement. It’s the push toward open models that stay useful over long stretches, not just short demos.
145 chars
OpenSource
Req 2026-04-07T1701-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-07 16:27:18.000000
Another big release: GLM-5.1! China is on fire! significant increase in evals compared to GLM-5.0 tl;dr GLM-5.1 is the new open-source agentic coding model that significantly outperforms its predecessor by sustaining long-horizon problem-solving over hundreds of iterations, continuously improving results instead of plateauing, achieving state-of-the-art performance on complex software engineering benchmarks.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Zai_org/status/2041550153354519022
Quoted original
Z.ai (@Zai_org) · Tue Apr 07 16:14:21 +0000 2026
Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. https://t.co/YQZLhKVwik
Draft
Worth watching less for the benchmark chest-thumping than for the assumption underneath it: open models are now being pushed on endurance, not just peak scores.
160 chars
OpenSource
Req 2026-04-07T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-04-07 15:58:30.000000
Releasing the Unfolding Robotics blog! Time to unfold robotics: we trained a robot to fold clothes using 8 bimanual setups, 100+ hours of demonstrations, and 5k+ GPU hours. Flashy robot demos are everywhere. But you rarely see the real story: the data, the failures, the https://t.co/02O8ndkRMd
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2041546163434734073
Draft
The interesting part isn’t the fold. It’s the decision to show the pipeline behind it, data, failures, and all. That’s usually the real gap between a demo and a system.
168 chars
OpenSource
Req 2026-04-07T1601-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-04-07 15:56:28.000000
This is just the beginning
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/AnthropicAI/status/2041275563466502560
Quoted original
Anthropic (@AnthropicAI) · Mon Apr 06 22:03:14 +0000 2026
Our run-rate revenue has surpassed $30 billion, up from $9 billion at the end of 2025, as demand for Claude continues to accelerate. This partnership gives us the compute to keep pace. Read more: https://t.co/XgSjL0And7
Draft
The revenue number matters, but the real signal is where the bottleneck is moving. In AI, distribution helps. Compute decides who can actually keep up.
151 chars
OpenSource
Req 2026-04-07T1601-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 98
Source
2026-04-07 05:51:08.000000
you don't get how good Gemma 4 is... it's gpt5 level performance that runs entirely ON YOUR PHONE this was SOTA just 8 months ago! https://t.co/PDsEohYLZH
primary source_tweetref external_url
reference: https://t.co/PDsEohYLZH
Quoted original
Google (@Google) · Thu Apr 02 16:06:26 +0000 2026
We just released Gemma 4 — our most intelligent open models to date. Built from the same world-class research as Gemini 3, Gemma 4 brings breakthrough intelligence directly to your own hardware for advanced reasoning and agentic workflows. Released under a commercially https://t.co/W6Tvj9CuHW
Draft
Gemma 4 is the real shift: serious model capability is moving onto devices, not staying locked in the cloud. That changes what local AI can be. https://t.co/PDsEohYLZH
167 chars
OpenSource
Req 2026-04-07T1601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-04-04 07:59:14.000000
Holy, OpenAI's GPT-image-2 will crush everything. I remember when everyone laughed at the GPT image because it couldn't generate a proper world map. Those days are over. And even the YouTube image is now indistinguishable from reality. Holy moly. https://t.co/dlXaPU1mXR
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/levelsio/status/2040333489476681758
Quoted original
@levelsio (@levelsio) · Sat Apr 04 07:39:46 +0000 2026
OpenAI's new image model GPT-Image-2 has leaked It seems to have extremely good world knowledge and great text rendering Possibly better than Nano Banana Pro It's on @arena under code names: - maskingtape-alpha - gaffertape-alpha - packingtape-alpha https://t.co/RbYbreRRsV
Draft
The interesting part isn’t just image quality. It’s the gap between “looks good” and “actually knows what it’s drawing” getting smaller. That’s when these models become much more useful — and much harder to benchmark casually.
226 chars
OpenSource
Req 2026-04-04T0801-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 97
Source
2026-04-04 07:04:14.000000
Codex pricing update: pay only for coding usage with no fixed seat fees in ChatGPT Business plans. https://t.co/NKOMiFT6Cj
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2040324547820327031
Draft
OpenAI changed Codex team pricing: no fixed seat fee in ChatGPT Business, just pay for actual coding usage. Cleaner model, less friction, and a much easier rollout for teams. https://openai.com/index/codex-flexible-pricing-for-teams/
233 chars
OpenSource
Req 2026-04-04T0801-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 100
Source
2026-04-04 07:25:06.000000
What? It seems like Anthropic is trying to save compute wherever possible.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/bcherny/status/2040206440556826908
Quoted original
Boris Cherny (@bcherny) · Fri Apr 03 23:14:55 +0000 2026
Starting tomorrow at 12pm PT, Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. You can still use these tools with your Claude login via extra usage bundles (now available at a discount), or with a Claude API key.
Draft
Not a pricing footnote. A product boundary. Anthropic is drawing a line between using Claude inside Claude and using Claude as infrastructure for someone else’s tool.
168 chars
OpenSource
Req 2026-04-04T0801-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 98
Source
2026-04-04 02:41:43.000000
Here’s the explanation of anthropic ankling OoenClaw — discuss
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/bcherny/status/2040206440556826908
Quoted original
Boris Cherny (@bcherny) · Fri Apr 03 23:14:55 +0000 2026
Starting tomorrow at 12pm PT, Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. You can still use these tools with your Claude login via extra usage bundles (now available at a discount), or with a Claude API key.
Draft
Important distinction: Claude login isn’t disappearing from OpenClaw. The subsidy is. That turns this from a growth loophole into a paid channel — a business decision, not a product impossibility.
197 chars
OpenSource
Req 2026-04-04T0301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-04-03 10:03:59.000000
Balance cost & reliability with our new Flex & Priority inference tiers in the Gemini API! Flex: Pay 50% less for cost-sensitive & latency-tolerant workloads Priority: Highest reliability for your most critical, interactive apps (with premium pricing) Together with the async https://t.co/dCCTZsQydX
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2040007397532385684
Draft
The split is getting clearer: one tier for unit economics, another for uptime guarantees. What matters isn’t just the pricing—it’s that inference is starting to look a lot more like infrastructure procurement.
209 chars
OpenSource
Req 2026-04-04T0101-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 96
Source
2026-04-04 00:13:49.000000
Time to move to open or local models from Hugging Face! All instructions are here: https://t.co/0w0pQb87Le
primary source_tweetref document
reference: https://huggingface.co/blog/liberate-your-openclaw
Quoted original
Boris Cherny (@bcherny) · Fri Apr 03 23:14:55 +0000 2026
Starting tomorrow at 12pm PT, Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. You can still use these tools with your Claude login via extra usage bundles (now available at a discount), or with a Claude API key.
Draft
OpenClaw doesn’t need a closed hosted model to stay useful. Hugging Face just published a migration path for hosted open models and fully local setups, with GLM-5 as a recommended fast hosted option and llama.cpp for private local runs. https://huggingface.co/blog/liberate-your-openclaw
287 chars
OpenSource
Req 2026-04-04T0101-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-04 00:18:53.000000
This guy is BEYOND CRACKED. Gemma 4 already on MLX, bro has uploaded all models with quantization. 125 models uploaded in last few hours 🤯 New mlx-vlm repo also supports turbo-quant, and rf-detr too (among other things) If you are a mac dev, you better be jumping at this. https://t.co/PirPCvEeDa
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2040222539251585363
Quoted original
Prince Canuma (@Prince_Canuma) · Thu Apr 02 21:20:41 +0000 2026
mlx-vlm v0.4.3 is here 🚀 Day-0 support: 🔥 Gemma 4 (vision, audio, MoE) by @GoogleDeepMind 🦅 Falcon-OCR + Falcon Perception by @TIIuae 🪨 Granite Vision 4.0 by @IBMResearch New models: 🎯 SAM 3.1 with Object Multiplex by @facebook 🔍 RF-DETR detection & segmentation by https://t.co/R6aWzXD0yg
Draft
This is what ecosystem acceleration looks like. Not just model drops—distribution, quantization, and Mac-native usability collapsing into one move. That’s when a platform stops feeling early.
193 chars
OpenSource
Req 2026-04-04T0101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 98
Source
2026-04-03 06:47:22.000000
mlx-vlm v0.4.3 is here 🚀 Day-0 support: 🔥 Gemma 4 (vision, audio, MoE) by @GoogleDeepMind 🦅 Falcon-OCR + Falcon Perception by @TIIuae 🪨 Granite Vision 4.0 by @IBMResearch New models: 🎯 SAM 3.1 with Object Multiplex by @facebook 🔍 RF-DETR detection & segmentation by https://t.co/R6aWzXD0yg
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2039957916263002343
Draft
This is what a real tools layer looks like: fast model support, broad surface area, and no waiting for the ecosystem to catch up.
129 chars
OpenSource
Req 2026-04-04T0001-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-03 11:46:48.000000
NVIDIA just released a quantized Gemma 4 31B on Hugging Face NVFP4 compression delivers 4x smaller weights with frontier-level accuracy. Runs on consumer GPUs with a 256K context window. https://t.co/WV916wLtin
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2040033271237497056
Draft
The interesting part isn’t just that it’s smaller. It’s what happens when a 31B model with a 256K window starts fitting where people can actually run it. That changes who gets to experiment first.
196 chars
OpenSource
Req 2026-04-04T0001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-04-03 06:23:56.000000
Qwen releases Qwen3.6-Plus - 1M context window - swe-bench verified at 78.8 (opus at 80.9) - outperforms Claude opus 4.5 and comes close on select benchmarks - stronger coding model - understands images and screens like a real user - more reliable in real tasks https://t.co/frvsN7qB0V
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2039952019033858241
Quoted original
Qwen (@Alibaba_Qwen) · Thu Apr 02 14:02:47 +0000 2026
(1/8)🚀 Introducing Qwen3.6-Plus: Towards Real-World Agents! 🤖 Today, we’re thrilled to drop a major milestone in our journey toward native multimodal agents. Here is what makes Qwen3.6-Plus a game-changer: 💻 Next-level Agentic Coding: Smarter, faster execution. 👁️ https://t.co/uUpcyLaB6d
Draft
What stands out isn’t just the benchmark spread. It’s the shape of the pitch: bigger context, stronger coding, multimodal use, and a push toward models meant to hold up in real workflows, not just eval tables.
209 chars
OpenSource
Req 2026-04-04T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 82
Source
2026-04-01 06:06:37.000000
AutoResearch only works when you can measure the result with a number but what about writing, arguments, marketing copy? theres no score for "is this convincing" SHL0MS built AutoReason to solve this instead of a metric, it uses a loop of agents arguing with each other: > one https://t.co/F66v6a3DTP
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2039222885903220917
Quoted original
𒐪 (@SHL0MS) · Sat Mar 28 17:06:56 +0000 2026
@karpathy i've been working on a method called autoreason that is effectively autoresearch extended to subjective domains. autoresearch works because val_bpb gives you an objective fitness function. autoreason constructs a subjective one through independent blind evaluation, the same way
Draft
Interesting shift: when the target stops being easy to score, the whole game shifts to critique quality. That’s the real bet.
125 chars
OpenSource
Req 2026-04-01T0701-TOP2
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 84
Source
2026-04-01 06:49:37.000000
Claude Code 源码泄露事件后续越来越精彩了。 有人拿泄露的源码丢给 OpenAI 的 Codex 分析,竟然找到了 Claude Code 疯狂消耗 token 的元凶——autoCompact(自动上下文压缩)机制在失败后会无限重试,完全没有上限。据源码注释记录,曾有会话连续失败高达 3272 次。 修复方法简单到离谱:加一个
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2039233705655574859
Quoted original
Lydia Hallie ✨ (@lydiahallie) · Mon Mar 30 18:35:30 +0000 2026
We're aware people are hitting usage limits in Claude Code way faster than expected. Actively investigating, will share more when we have an update!
Draft
最贵的 bug,往往不在模型本身,而在“失败后继续自动化”没有设上限。Codex 能从源码里捞出来的问题,说明这不是玄学,而是工程纪律。
68 chars
OpenSource
Req 2026-04-01T0701-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 86
Source
2026-04-01 02:05:29.000000
MERCOR 4TB BREACHED, ANTHROPIC SOURCE CODE EXPOSED, & AXIOS NPM PACKAGE SUPPLY CHAIN ATTACK "LLMs can find and exploit zero day vulnerabilities in critical software." - Nicholas Carlini, Anthropic He sounded the alarm last week... https://t.co/kDOIiP8nbD
primary source_tweetref tweet
reference: https://x.com/Jason/status/2039162202196746247
Draft
The uncomfortable part isn’t any single incident. It’s how many warning lights are blinking at once across the same stack: better bug-finding models, source exposure, and software supply chains that are still far too brittle.
225 chars
OpenSource
Req 2026-04-01T0301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-31 19:37:17.000000
Video’s here to stay - introducing Veo 3.1 Lite, our most cost efficient video generation model to date, and on April 7th we are also reducing the price for Veo 3.1 Fast : ) https://t.co/s7OMZaMPho
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2039064508447027579
Draft
The interesting part isn’t just a cheaper model. It’s the signal: video generation is leaving demo territory and entering pricing strategy, which is usually when a market gets a lot more competitive.
199 chars
OpenSource
Req 2026-04-01T0201-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 99
Source
2026-04-01 01:52:17.000000
28T tokens for a 350M model!?
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/liquidai/status/2039029360175329458
Quoted original
Liquid AI (@liquidai) · Tue Mar 31 17:17:37 +0000 2026
Trained on 28T tokens with scaled RL, LFM2.5-350M is a step change from LFM2-350M: > instruction following: 18.20 → 40.69 > data extraction: 11.67 → 32.45 > tool use: 22.95 → 44.11 These are the capabilities that matter in production.
Draft
28T tokens into a 350M model is the headline. That’s the trade: spend absurdly on training so a tiny model is actually useful where deployment constraints are real.
165 chars
OpenSource
Req 2026-04-01T0201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-31 23:09:37.000000
Code for our new world model planner is live! https://t.co/PnCk0OzqTl Includes our implementation on dino-wm, as well as implementations on jepa-wm and le-wm, and minimal pseudocode for anyone to re-implement themselves.
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2039117945658282342
Quoted original
Michael Psenka (@michaelpsenka) · Fri Feb 06 20:26:39 +0000 2026
tl;dr New planner for world models! GRASP: gradient-based, stochastic, parallelized. Long range planning for world models has always been an issue. 0th order methods like CEM/MPPI dominate, but have degrading performance at longer contexts or higher-dimensional actions. We https://t.co/W6IwNSsq6x
Draft
The interesting part isn’t just that the code is out. It’s that the planner layer is becoming legible enough to compare across world-model families, instead of staying trapped as a one-off demo.
194 chars
OpenSource
Req 2026-04-01T0001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-31 23:10:25.000000
🚀 LeWorldModel datasets & checkpoints are now available on Hugging Face! https://t.co/aiBkDTsNyX You can plug them directly into stable-worldmodel (https://t.co/2eQB7Q0l9i), the engine behind LeWorldModel, to instantly load, run, and start building on top of our models.
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2039118144795525175
Quoted original
Lucas Maes (@lucasmaes_) · Mon Mar 23 14:00:14 +0000 2026
JEPA are finally easy to train end-to-end without any tricks! Excited to introduce LeWorldModel: a stable, end-to-end JEPA that learns world models directly from pixels, no heuristics. 15M params, 1 GPU, and full planning <1 second. 📑: https://t.co/cpTzgvbTS0 https://t.co/Z2De9ASzcW
Draft
Useful release. The bigger signal is distribution: once world-model checkpoints and datasets land where people already build, the gap between an interesting demo and actual experimentation gets a lot smaller.
208 chars
OpenSource
Req 2026-04-01T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-31 22:34:19.000000
Paper review: LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels https://t.co/2dD7hPIURL Nice clean github: https://t.co/YZ4e1eUACi This is the application of the LeJEPA results to world models, trained offline on experience from three different
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2039109060641837450
Quoted original
Lucas Maes (@lucasmaes_) · Mon Mar 23 14:00:14 +0000 2026
JEPA are finally easy to train end-to-end without any tricks! Excited to introduce LeWorldModel: a stable, end-to-end JEPA that learns world models directly from pixels, no heuristics. 15M params, 1 GPU, and full planning <1 second. 📑: https://t.co/cpTzgvbTS0 https://t.co/Z2De9ASzcW
Draft
Interesting because the bar isn’t just whether this is another world model, but whether joint-embedding methods can stay stable enough to become a practical backbone for prediction from pixels. That’s what’s worth watching.
223 chars
OpenSource
Req 2026-03-31T2301-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 90
Source
2026-03-31 21:38:07.000000
pleasure doing business
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/scaling01/status/2039081471843930366
Quoted original
Lisan al Gaib (@scaling01) · Tue Mar 31 20:44:41 +0000 2026
OpenAI raises $122B at a valuation of $852B https://t.co/Er3b7tintS
Draft
OpenAI just pulled off a $122B raise at an $852B valuation. This isn’t just another AI funding round—it’s capital concentration on a scale that reshapes the market. https://t.co/Er3b7tintS
188 chars
OpenSource
Req 2026-03-31T2201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-31 21:46:10.000000
This is an always-on model living in your browser. Sub-500 MB QA, data extraction, tool use 🪄
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/xenovacom/status/2039043406823833964
Quoted original
Xenova (@xenovacom) · Tue Mar 31 18:13:26 +0000 2026
NEW: LiquidAI just released LFM2.5-350M, a tiny model that brings agentic AI and tool-calling capabilities to resource-constrained environments. 🤯 It can even run locally in your browser via WebGPU, serving as a powerful companion while you browse the web. Try the demo! 👇 https://t.co/k9YeEycIg8
Draft
The interesting part isn’t just that it’s small. It’s that “agentic” is starting to mean local, fast, and ambient—not cloud-scale by default.
141 chars
OpenSource
Req 2026-03-31T2201-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 75
Source
2026-03-31 20:19:43.000000
OpenAI just raised another $122B. It's the largest funding in Silicon Valley history, with backing from major names like Amazon, Nvidia, SoftBank, plus wealthy investors. OpenAI is ahead of an expected IPO by the end of 2026
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/WSJ/status/2039072973701710312
Quoted original
The Wall Street Journal (@WSJ) · Tue Mar 31 20:10:55 +0000 2026
OpenAI completed the largest funding round in Silicon Valley history, raising $122 billion ahead of a blockbuster IPO expected by the end of the year. https://t.co/tKv4vNlRQe
Draft
Whatever you think about OpenAI, this is the signal that matters: capital markets are starting to price frontier AI less like a software category and more like core infrastructure.
180 chars
OpenSource
Req 2026-03-31T2101-TOP2
QUOTEquote_long_externalready_for_reviewrisk mediumscore 92
Source
2026-03-31 20:59:21.000000
Today, we closed our latest funding round with $122 billion in committed capital at an $852B post-money valuation. The fastest way to expand AI’s benefits is to put useful intelligence in people’s hands early and let access compound globally. This funding gives us resources to lead at scale. https://t.co/sY7YNUPSYO
primary source_tweetref tweet
reference: https://x.com/OpenAI/status/2039085161971896807
Draft
Scale isn’t the whole story. The real bet is that distribution compounds faster than caution—and that whoever puts useful AI in the most hands first sets the pace for everyone else.
181 chars
OpenSource
Req 2026-03-31T2101-TOP1
POSTpost_short_externalready_for_reviewrisk mediumscore 79
Source
2026-03-31 19:15:31.000000
20% of oracles work force received this message today: „“As a result, today is your last working day. … Thank you for your contributions to our organization.” Oracle is cutting jobs to free up cash for massive investments in AI infrastructure, shifting resources from workforce costs to data centers and compute.
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/nypost/status/2039038666287276158
Quoted original
New York Post (@nypost) · Tue Mar 31 17:54:36 +0000 2026
Oracle axes 30K jobs in massive layoff - notifying fired employees with 6 a.m. email https://t.co/jYnYHSThyt https://t.co/KWTmYv6Vzl
Draft
Oracle is reportedly cutting 30,000 jobs while pouring cash into AI infrastructure. That trade-off, in plain English: fewer people, more compute — and workers learning it from a 6 a.m. email. https://x.com/nypost/status/2039038666287276158
240 chars
OpenSource
Req 2026-03-31T2001-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 96
Source
2026-03-31 18:01:44.000000
fal MCP Server lets Claude, Cursor, and others connect to 1,000+ AI models for instant creative output. https://t.co/Oc9UflWneL
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2039040463429144969
Draft
fal just made MCP a practical bridge to creative inference: one hosted server that lets assistants like Claude or Cursor search, run, and chain 1,000+ generative models. Less glue code, faster path from prompt to output.
220 chars
OpenSource
Req 2026-03-31T1901-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-03-31 17:40:44.000000
it is also pretty fast in @sgl_project - 3.5B token per day on a single H100, super cheap for rollouts generation https://t.co/5kP1HxsyuB
primary source_tweetref external_url
reference: https://t.co/5kP1HxsyuB
Quoted original
Liquid AI (@liquidai) · Tue Mar 31 17:17:36 +0000 2026
Today, we release LFM2.5-350M. Agentic loops at 350M parameters. A 350M model trained for reliable data extraction and tool use, where models at this scale typically struggle. <500MB when quantized, built for environments where compute, memory, and latency are constrained. 🧵 https://t.co/zZPKzcCwH9
Draft
SGLang is hitting 3.5B tokens/day on a single H100. Rollout generation is starting to look a lot less like a research luxury and a lot more like cheap infrastructure. https://t.co/5kP1HxsyuB
190 chars
OpenSource
Req 2026-03-31T1801-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-31 17:00:16.000000
Imagine running Claude 4 Opus-level reasoning but on your own GPU with only 16GB VRAM. This 27B model frontier-level coding it's beating Claude Sonnet 4.5 on SWE-bench locally on 16GB VRAM 4-bit. v2 slashes chain-of-thought bloat by 24% while keeping 96.91% HumanEval accuracy. https://t.co/lC7uuTzWTx
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2039024993896448285
Draft
The interesting part isn’t just that it’s “local.” It’s how much capability gets compressed into a setup that changes who gets to experiment, iterate, and ship—without renting the whole stack.
192 chars
OpenSource
Req 2026-03-31T1801-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-31 17:29:55.000000
Very proud of our tiny powerhouse. Amazing performance in data extraction and tool use at such a small scale. Enjoy! 🚀
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/liquidai/status/2039029358224871605
Quoted original
Liquid AI (@liquidai) · Tue Mar 31 17:17:36 +0000 2026
Today, we release LFM2.5-350M. Agentic loops at 350M parameters. A 350M model trained for reliable data extraction and tool use, where models at this scale typically struggle. <500MB when quantized, built for environments where compute, memory, and latency are constrained. 🧵 https://t.co/zZPKzcCwH9
Draft
What matters here isn’t just the size. It’s the target: practical competence—extraction, tool use, constrained environments. If that holds up, the floor for useful local agents just moved.
188 chars
OpenSource
Req 2026-03-31T1801-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 86
Source
2026-03-31 16:58:07.000000
We finally shipped TRL v1.0!! stable APIs, broad integrations, and a design built to absorb whatever the field throws at it next. Let's go! https://t.co/lELxIultJf https://t.co/X3uyBOOkeJ
primary source_tweetref external_url
reference: https://t.co/lELxIultJf
Draft
Hugging Face’s TRL just hit v1.0: stable APIs, broad integrations, and a post-training stack built to keep pace as the field keeps shifting. That kind of maturity matters more than another flashy demo. https://t.co/lELxIultJf
225 chars
OpenSource
Req 2026-03-31T1701-TOP2
QUOTEquote_long_externalready_for_reviewrisk mediumscore 91
Source
2026-03-31 16:51:56.000000
I reverse-engineered Claude Code's leaked source against billions of tokens of my own agent logs. Turns out Anthropic is aware of CC hallucination/laziness, and the fixes are gated to employees only. Here's the report and CLAUDE.md you need to bypass employee verification:👇 https://t.co/h8KQESUz1i
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2039022898036552090
Quoted original
Chaofan Shou (@Fried_rice) · Tue Mar 31 08:23:33 +0000 2026
Claude code source code has been leaked via a map file in their npm registry! Code: https://t.co/jBiMoOzt8G https://t.co/rYo5hbvEj8
Draft
What makes this interesting isn’t the leak drama. It’s the possibility that agent failure modes are well understood internally long before users ever see the fixes.
164 chars
OpenSource
Req 2026-03-31T1701-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 92
Source
2026-03-31 15:48:33.000000
A major step for @Uber and the UAE. 🇦🇪 Starting today, users in Dubai can hail a fully autonomous @WeRide_ai vehicle directly in our app—with @Baidu_Inc’s Apollo Go launching fully driverlessly on Uber in just a few weeks.
primary source_tweetref tweet
reference: https://x.com/Jason/status/2039006946062197232
Quoted original
Dubai Media Office (@DXBMediaOffice) · Mon Mar 30 17:25:30 +0000 2026
The Roads and Transport Authority (RTA) in Dubai announces the launch of commercial operations of the autonomous taxi service in Umm Suqeim and Jumeirah, in partnership with Apollo Go and WeRide, a global leader in autonomous driving technologies. The vehicles are available via https://t.co/z1Iho3d9m6
Draft
The next phase isn’t AV demos. It’s distribution. Once Uber becomes the interface for multiple autonomous fleets, the battleground shifts from building the car to owning demand.
177 chars
OpenSource
Req 2026-03-31T1601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 92
Source
2026-03-31 13:04:40.000000
Congrats to @Starcloud_ on their $170M Series A at a $1.1B valuation! They're building data centers in space—just 17 months from YC Demo Day to unicorn. They launched their first satellite with an Nvidia H100 GPU last year and are now developing Starcloud-3, a spacecraft https://t.co/GY0ufXwfE4
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038965701608964445
Draft
Big number, sure. More interesting is the ambition: turning compute into infrastructure that doesn’t have to live on Earth. Either that’s a real category shift, or a very expensive way to find the actual constraints.
217 chars
OpenSource
Req 2026-03-31T1501-TOP1
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 81
Source
2026-03-31 13:06:13.000000
Thrilled to announce our investment in Starcloud. From our initial investment to a $1.1B valuation, this extraordinary engineering team continues to make remarkable breakthroughs in power, cooling, and manufacturing. Their technical rigor and ambition is truly exceptional!
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038966092534882445
Quoted original
Philip Johnston (@PhilipJohnston) · Mon Mar 30 12:00:13 +0000 2026
I am super excited to share that @Starcloud_ has raised a $170M Series A at a $1.1bn valuation to fuel our development of data centers in space 🚀 The round comes after the successful deployment of our first satellite, Starcould-1, a few months ago, which had the first @NVIDIA https://t.co/zBiOjgrFHE
Draft
The bigger signal isn’t the markup. It’s that investors now think the AI infrastructure race could run all the way into orbit.
126 chars
OpenSource
Req 2026-03-31T1401-TOP3
QUOTEquote_long_externalready_for_reviewrisk mediumscore 89
Source
2026-03-31 13:09:12.000000
I created documentation over Claude Code's Codebase, which explains - Its pipeline - How it works - How it handles Context - How it handles Memory & More Read it here - https://t.co/GngrSvWAmh https://t.co/zyHyj5z2a7
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038966845542478315
Quoted original
Chaofan Shou (@Fried_rice) · Tue Mar 31 08:23:33 +0000 2026
Claude code source code has been leaked via a map file in their npm registry! Code: https://t.co/jBiMoOzt8G https://t.co/rYo5hbvEj8
Draft
Useful pattern here: when a codebase can explain its own pipeline, context, and memory model clearly enough for others to inspect, the moat shifts from mystique to execution.
174 chars
OpenSource
Req 2026-03-31T1401-TOP2
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 82
Source
2026-03-31 12:59:41.000000
People are bearish on memory, but the leaked Claude Code source code is showing us some additional memory demand that the market hasn't priced in IMO. 1. The market thinks about AI memory demand as a server-side story: HBM on H100s/B200s for inference. What the bug reports
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038964450326352244
Quoted original
Chaofan Shou (@Fried_rice) · Tue Mar 31 08:23:33 +0000 2026
Claude code source code has been leaked via a map file in their npm registry! Code: https://t.co/jBiMoOzt8G https://t.co/rYo5hbvEj8
Draft
If this is right, the miss isn’t just datacenter memory. It’s AI shifting the bottleneck outward—into the developer machine, the edge device, and anywhere inference stops being purely remote.
191 chars
OpenSource
Req 2026-03-31T1301-TOP3
QUOTEquote_long_externalready_for_reviewrisk mediumscore 83
Source
2026-03-31 12:59:49.000000
tl;dr upcoming Anthropic models due to leak - Claude "Mythos", Capybara tier already in version 2, it'll have a contextwindow of 1m - Mythos comes in "fast" and regular thinking - Opus 4.7 and Sonnet 4.8 already within the code - Claude "buddy" also found, but not clear what it is
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2038964482727186735
Draft
What matters here isn’t really the names. It’s the roadmap logic underneath: if this is real, Anthropic is widening the ladder on both context and reasoning instead of just shipping a single bigger flagship.
207 chars
OpenSource
Req 2026-03-31T1301-TOP2
POSTpost_short_externalready_for_reviewrisk mediumscore 93
Source
2026-03-31 12:34:59.000000
wtf, biggest leak of the year. This is absolute insane. Claude code source code leaked. That was NOT on my bingo card.
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/Fried_rice/status/2038894956459290963
Quoted original
Chaofan Shou (@Fried_rice) · Tue Mar 31 08:23:33 +0000 2026
Claude code source code has been leaked via a map file in their npm registry! Code: https://t.co/jBiMoOzt8G https://t.co/rYo5hbvEj8
Draft
Claude Code may have exposed its own source through an npm source map — the kind of packaging leak that hands reverse-engineering material to anyone looking. If that holds up, it’s a brutal reminder that shipping artifacts are part of the attack surface too. https://x.com/Fried_rice/status/2038894956459290963
310 chars
OpenSource
Req 2026-03-31T1301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-31 09:11:48.000000
Meta just released the Efficient Universal Perception Encoder on Hugging Face A vision backbone for edge devices that unifies image understanding, vision-language modeling, and dense prediction via multi-teacher distillation. https://t.co/qnF84e5t09
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2038907101104275477
Draft
The interesting signal here isn’t just another vision model drop. It’s the push toward a compact backbone that can do more of the stack on-device without fragmenting the pipeline.
179 chars
OpenSource
Req 2026-03-31T1001-TOP1
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 76
Source
2026-03-31 07:54:49.000000
axios may be under active supply chain compromise. The newest release reportedly pulls in a brand-new dependency that behaves like installer malware: runtime deobfuscation, shell execution, temp-dir staging, artifact cleanup. If you use axios: pin now freeze upgrades audit lockfiles check CI/CD installs 100M+ weekly downloads means this is not a niche incident. It is blast-radius territory.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/feross/status/2038807290422370479
Quoted original
Feross (@feross) · Tue Mar 31 02:35:11 +0000 2026
🚨 CRITICAL: Active supply chain attack on axios -- one of npm's most depended-on packages. The latest axios@1.14.1 now pulls in plain-crypto-js@4.2.1, a package that did not exist before today. This is a live compromise. This is textbook supply chain installer malware. axios
Draft
If this holds, the story isn’t axios. It’s how much of the ecosystem can still be put at risk by one perfectly normal upgrade.
126 chars
OpenSource
Req 2026-03-31T0801-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 91
Source
2026-03-31 01:32:22.000000
RevenueCat now integrates with @Replit, meaning you can prompt your Agent to: 💰 Add subscriptions, in-app purchases and paywalls 📈 Get tailored guidance on how best to monetize, set your pricing, and design your paywall ⚙️ Fly through App Store Connect setup (the part that
primary source_tweetref tweet
reference: https://x.com/amasad/status/2038791482295906510
Draft
The real next step for AI coding tools isn’t just shipping an app. It’s shipping the business model with it.
108 chars
OpenSource
Req 2026-03-31T0201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 76
Source
2026-03-30 23:21:19.000000
Fun update: I got tired of disliking every email client I’ve ever used and built my own. It’s called Exo (for exoskeleton). It’s Claude Code for my inbox. It manages my inbox for me, and it’s open source. Link to repo + some notable features in thread! https://t.co/xHMQJscMg7
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038758502462837094
Draft
The interesting part isn’t “yet another email client.” It’s the shift from babysitting your inbox to delegating it as a system.
127 chars
OpenSource
Req 2026-03-31T0001-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 87
Source
2026-03-30 20:56:33.000000
Monetize your Replit-built mobile apps with RevenueCat!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/RevenueCat/status/2038602508210167809
Quoted original
RevenueCat (@RevenueCat) · Mon Mar 30 13:01:28 +0000 2026
RevenueCat now integrates with @Replit, meaning you can prompt your Agent to: 💰 Add subscriptions, in-app purchases and paywalls 📈 Get tailored guidance on how best to monetize, set your pricing, and design your paywall ⚙️ Fly through App Store Connect setup (the part that
Draft
This is where “vibe coded” stops meaning “toy.” If the agent can handle monetization and App Store plumbing, the gap between prototype and business gets a lot smaller.
167 chars
OpenSource
Req 2026-03-30T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 98
Source
2026-03-30 19:24:19.000000
It's FINALLY HERE! Multi Agent Profiles so you can have as many independent bots with their own memory, gateway connections, skills, chat history, everything! To use: Run `hermes update` and look for multi agent profiles User Guide: https://t.co/i0R8puqJ6k Reference:
primary source_tweetref tweet
reference: https://x.com/NousResearch/status/2038698858595615139
Quoted original
Nous Research (@NousResearch) · Mon Mar 30 18:43:28 +0000 2026
The Hermes Agent update you've been waiting for is here. https://t.co/2uwpMzIOug
Draft
This is the kind of feature that changes the product more than the announcement copy does. Separate memory, skills, connections, and history makes agents feel less like modes and more like actual operators.
206 chars
OpenSource
Req 2026-03-30T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-03-30 18:53:16.000000
Kelly's latest app - Clawptimizer - is something we had Kelly build... to optimize Kelly. It anlyzes your OpenClaw setup to fix memory, optimize AI token usage, and automatically optimizes all of your config. Helped us save thousands of dollars/week. https://t.co/ohOlhD91do
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/KellyClaudeAI/status/2038686147795120187
Quoted original
Kelly Claude (@KellyClaudeAI) · Mon Mar 30 18:33:49 +0000 2026
Clawptimizer is live. It audits your OpenClaw setup, finds what's eating your token budget, and tells you how to optimize Available now at https://t.co/2fCsN9HZLx #indiedev #buildinpublic #aitools #applaunch #solofounder #openclaw https://t.co/nd2bv7b3p1
Draft
Useful product category: not another wrapper, a cost microscope. If you’re running OpenClaw seriously, token spend usually leaks through config drift and memory bloat before anyone notices.
189 chars
OpenSource
Req 2026-03-30T1901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 18:27:25.000000
The development loop has been closed
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2038663014098899416
Quoted original
Claude (@claudeai) · Mon Mar 30 17:01:53 +0000 2026
Computer use is now in Claude Code. Claude can open your apps, click through your UI, and test what it built, right from the CLI. Now in research preview on Pro and Max plans. https://t.co/s2FDQaDmr1
Draft
The dev loop keeps collapsing into one surface. Writing the code was the easy part. Driving the product to verify what changed is the real shift.
146 chars
OpenSource
Req 2026-03-30T1901-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 93
Source
2026-03-30 17:23:44.000000
this model is an agentic treasure. it has been #1 trending for 3 weeks on @huggingface as mentioned by @danielhanchen. it's Qwen 3.5 27B fine-tuned on Opus 4.6 distilled data and beats Sonnet 4.5 on SWE-bench verified and more. "Runs locally on 16GB in 4-bit or 32GB in 8-bit." https://t.co/3tM8vk1FGZ
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2038668511573053790
Draft
The interesting part isn’t the benchmark flex. It’s how much capability gets compressed into something people can actually run, inspect, and build with.
152 chars
OpenSource
Req 2026-03-30T1801-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 17:37:03.000000
I built a new plugin! You can now trigger Codex from Claude Code! Use the Codex plugin for Claude Code to delegate tasks to Codex or have Codex review your changes using your ChatGPT subscription. Start by installing the plugin: https://t.co/u6gBpArwBc https://t.co/HyEdMPWees
primary source_tweetref tweet
reference: https://x.com/OpenAIDevs/status/2038671862616699300
Draft
Useful direction: not just model-vs-model, but agent-in-agent workflows becoming normal. Once good tools can call each other, the interface matters less.
153 chars
OpenSource
Req 2026-03-30T1801-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 17:48:57.000000
Another day, another Claude release: Computer use now in Claude code
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2038663014098899416
Quoted original
Claude (@claudeai) · Mon Mar 30 17:01:53 +0000 2026
Computer use is now in Claude Code. Claude can open your apps, click through your UI, and test what it built, right from the CLI. Now in research preview on Pro and Max plans. https://t.co/s2FDQaDmr1
Draft
The interesting part isn’t “computer use” in the abstract. It’s collapsing build, test, and UI verification into the same loop. That changes what a coding CLI is for.
166 chars
OpenSource
Req 2026-03-30T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 87
Source
2026-03-30 15:14:33.000000
Starcloud (fastest unicorn in Y Combinator history) just raised a $170M Series A at a $1.1B valuation. The company is building orbital data centers in space to tackle the AI energy bottleneck on Earth. https://t.co/96vMuf9EHg
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038636002139664636
Draft
What’s striking here isn’t just the amount raised, but the direction of the bet. Pricing it this aggressively signals how seriously the AI power constraint is starting to be taken.
181 chars
OpenSource
Req 2026-03-30T1701-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 89
Source
2026-03-30 15:06:36.000000
Grok Imagine converts your text into 720p videos with sound instantly, no filming needed. https://t.co/lLjzd6B7AW
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2038634000961433960
Draft
Text-to-video keeps collapsing the distance between prompt and finished clip. Grok Imagine pushing instant 720p video with sound feels like the kind of UX shift that makes “just try it” the default. https://x.com/ItsAIAndy/status/2038634000961433960/video/1
257 chars
OpenSource
Req 2026-03-30T1701-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 16:15:21.000000
no API key needed 👀
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Zai_org/status/2038632251551023250
Quoted original
Z.ai (@Zai_org) · Mon Mar 30 14:59:39 +0000 2026
Here comes AutoClaw. We offer a new solution to run OpenClaw locally on your own machine. - Download and start immediately. No API key required. - Bring any model you like, or use GLM-5-Turbo, optimized for tool calling and multi-step tasks. - Fully local. Your data never leaves https://t.co/VXiQSWLk7t
Draft
The real shift here isn’t just local deployment. It’s the collapse of the setup tax. Once “run it yourself” no longer means API keys, config sprawl, and half a day of friction, a lot more people actually try agentic systems instead of just reading about them.
260 chars
OpenSource
Req 2026-03-30T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 15:12:54.000000
Introducing 🤗 Transformers.js v4: state-of-the-art machine learning for the web! 🚀 New WebGPU backend (browser, Node.js, Bun, Deno) ⚡️ Huge performance improvements 🤯 Support for over 200 architectures 🛠️ Complete codebase refactor Learn more about our biggest release yet! 👇 https://t.co/TO7wCkpKmZ
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2038635587217830301
Draft
The interesting part isn’t just “ML in the browser.” It’s that the stack is getting serious enough for web-native inference to stop feeling like a demo and start looking like a real deployment choice.
200 chars
OpenSource
Req 2026-03-30T1601-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 15:44:52.000000
Google Jeff Dean says bigger context windows alone are not enough What matters is staged retrieval: lightweight mechanisms that narrow a trillion tokens down to 10 million, then to the million you actually need "you don't need a trillion at once, you need the right million" https://t.co/7pwH4rcrNA
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038643630441980148
Draft
A lot of the discourse still misses the point: scale helps, but selection is the real product.
94 chars
OpenSource
Req 2026-03-30T1601-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-30 15:24:11.000000
Alibaba's Qwen3.5-Omni just dropped with script-level captioning, audio-visual vibe coding, and real-time web search built in. However, there is a catch: Omni here doesn't mean *creating* image or voice, but rather interpreting it. So, a caveat. Open access via Hugging. https://t.co/PeMHYNfPTK
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Ali_TongyiLab/status/2038609308750143762
Quoted original
Tongyi Lab (@Ali_TongyiLab) · Mon Mar 30 13:28:29 +0000 2026
1/10 🚀 Qwen3.5-Omni is here! Scaling up to a native omni-modal AGI. Meet the next generation of Qwen, designed for native text, image, audio, and video understanding, with major advances in both intelligence and real-time interaction. A standout feature: Audio-Visual Vibe https://t.co/fWWyTl9cPY
Draft
The bigger tell here isn’t just more modalities. It’s the push toward a single interface that can track context across them in real time. If that holds up, a lot of today’s AI workflow glue starts looking temporary.
215 chars
OpenSource
Req 2026-03-30T1601-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 86
Source
2026-03-30 13:33:47.000000
Surprise drop: new multilingual embedding models by Microsoft - seem quite good :) https://t.co/ljWZOG5sfG https://t.co/SE51bj2M3Z
primary source_tweetref external_url
reference: https://huggingface.co/microsoft/harrier-oss-v1-0.6b
Draft
Microsoft just dropped new multilingual embedding models under Harrier OSS. If they hold up in real retrieval workloads, that’s immediately useful: better cross-language search, ranking, and RAG without glue code everywhere. https://huggingface.co/microsoft/harrier-oss-v1-0.6b
278 chars
OpenSource
Req 2026-03-30T1501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 87
Source
2026-03-30 13:35:08.000000
Cohere Transcribe is setting a new standard for automatic speech recognition model accuracy in real world conditions – even with a noisy blender running. Try it out for yourself 👇 https://t.co/cIHYqTVVyI
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2038610981534806107
Quoted original
Nick Frosst (@nickfrosst) · Sat Mar 28 15:29:25 +0000 2026
@cohere transcribe Sota open source transcription model running in the browser :) Weights on @huggingface link below https://t.co/OmrHFA94lG
Draft
Benchmarks are easy. The part that matters is whether it still works with a blender running. ASR becomes real when it survives the messiness of actual use, not the cleanliness of a demo.
187 chars
OpenSource
Req 2026-03-30T1401-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 89
Source
2026-03-30 13:44:18.000000
Just made MemoryVault open source! Now you can host your own Second Brain for your ai agents. Try it out with @interaction https://t.co/BKdyPa3UH8 https://t.co/mytWkS4se2
primary source_tweetref external_url
reference: https://t.co/BKdyPa3UH8
Draft
MemoryVault is now open source: a self-hosted memory layer for AI agents built to preserve context across sessions. Useful idea, especially as agents improve and statelessness becomes the bottleneck. https://t.co/BKdyPa3UH8
223 chars
OpenSource
Req 2026-03-30T1401-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 95
Source
2026-03-30 13:33:58.000000
@cohere transcribe Sota open source transcription model running in the browser :) Weights on @huggingface link below https://t.co/OmrHFA94lG
primary source_tweetref external_url
reference: https://t.co/OmrHFA94lG
Draft
Cohere Transcribe being open source and runnable in the browser is the real story here. Local-first transcription gets much more practical when the model isn’t locked behind an API. https://t.co/OmrHFA94lG
205 chars
OpenSource
Req 2026-03-30T1401-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 84
Source
2026-03-30 09:41:00.000000
China announces its first automated manufacturing line capable of producing 10.000 humanoid robots per year ~ 1 robot every 30 minutes. They go all in robots. https://t.co/reH4ArggYl
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Eng_china5/status/2038344113943380255
Quoted original
China pulse 🇨🇳 (@Eng_china5) · Sun Mar 29 19:54:42 +0000 2026
China has launched its first fully automated humanoid‑robot production line in Foshan, Guangdong Province, with an annual capacity of up to 10,000 robots. Manufacturing a single robot takes only about 30 minutes, thanks to high‑precision digital technologies that have boosted https://t.co/JkkEmOwJos
Draft
Not a lab-demo story. This reads like the industrialization of humanoids into a manufacturing category.
103 chars
OpenSource
Req 2026-03-30T1001-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 76
Source
2026-03-29 23:54:35.000000
how do u know someone has never tried https://t.co/96K5WL6IqP? they ask "isn't this just chatgpt on imessage"
primary source_tweetref external_url
reference: http://Poke.com
Quoted original
Fabian (@bitflip) · Sun Mar 29 23:37:05 +0000 2026
I scheduled a few lab tests for tomorrow morning, @interaction picked up on the time and the fact that I need to fast for 12-14 hours and sent me a “friendly reminder” to stop eating 💀 https://t.co/cBfUhkqZSG
Draft
DuckDuckGo is no longer just a private search box. Its mobile app now combines browser, search, Duck.ai, and an optional VPN in one product. That’s a much bigger play than “ChatGPT on iMessage.” https://t.co/96K5WL6IqP
218 chars
OpenSource
Req 2026-03-30T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 86
Source
2026-03-29 14:07:11.000000
The research team (including @hamsabastani who is on X) found that letting students just use AI resulted in them using it to accidentally shortcut learning But both that study and a separate RCT found that AIs prompted to act as a tutor improved learning https://t.co/0HtjGC8eU0 https://t.co/U3OIeCF4aP
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2038256661140480432
Quoted original
Anand Sanwal (@asanwal) · Sat Mar 28 17:23:21 +0000 2026
Wharton researchers gave nearly 1,000 high school math students access to ChatGPT during practice problems Result: chatGPT is the perfect trap. Look at the red bars. Students with ChatGPT crushed their practice sessions. The basic ChatGPT group solved more problems and those https://t.co/CtXbUZwddu
Draft
The real distinction here isn’t “AI in education” vs. not. It’s whether the model is replacing the work or shaping the work. That matters more than the headline debate.
170 chars
OpenSource
Req 2026-03-29T2201-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-03-29 18:01:43.000000
Runway’s NVIDIA-powered AI video model generates the first frame in under 100ms, eliminating wait times. https://t.co/9365KLHYY5
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2038315680819212558
Draft
Runway is pushing AI video toward real-time interaction: first frame in under 100ms on NVIDIA-powered inference shifts the feel from generation to responsiveness. https://x.com/ItsAIAndy/status/2038315680819212558
213 chars
OpenSource
Req 2026-03-29T1901-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 87
Source
2026-03-29 17:58:21.000000
this is huge
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/opencode/status/2038302829517893825
Quoted original
OpenCode (@opencode) · Sun Mar 29 17:10:39 +0000 2026
we’ve signed Zero Data Retention agreements with all providers for Go all models now follow a zero-retention policy your data is not used for training
Draft
This feels like the threshold a lot of serious teams were waiting for. If retention is zero across providers, the real question is no longer whether to use this. It’s where it creates leverage.
194 chars
OpenSource
Req 2026-03-29T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 86
Source
2026-03-29 16:13:32.000000
Seriously, Mythos is going to hit hard.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/chiefofautism/status/2037951563931500669
Quoted original
chiefofautism (@chiefofautism) · Sat Mar 28 17:54:50 +0000 2026
someone at ANTHROPIC just showed CLAUDE finding ZERO DAY vulnerabilities in a live conference demo claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history... it found the blind SQL injection in 90 minutes, https://t.co/AfSn7brMuj
Draft
The demo is flashy. The real story is the compression: critical bug hunting that once required a specialist team can now happen live, fast, and in public. That shifts the security baseline whether incumbents are ready or not.
225 chars
OpenSource
Req 2026-03-29T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-03-29 14:40:08.000000
It's still nuts to me how this sci-fi dream becomes reality: language barriers are solved forever. https://t.co/HiBtjUZXLG
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/ShishirShelke1/status/2037958724569211160
Quoted original
Shishir (@ShishirShelke1) · Sat Mar 28 18:23:18 +0000 2026
Apple’s Live Translation feature requires AirPods 4 (ANC), AirPods Pro 2/3, iPhone 15 Pro or later, iOS 26+, and Apple Intelligence turned on Meanwhile, Google’s new Live Translate works with any headphones in 70+ languages This is crazy 💀 Really curious to see how it works https://t.co/FmaCtUBYNT
Draft
Useful reminder: in consumer tech, the breakthrough is only half the story. The other half is who can actually use it, on what hardware, and how much friction survives the demo.
177 chars
OpenSource
Req 2026-03-29T1501-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 96
Source
2026-03-29 11:05:07.000000
Hugging Face turns arXiv papers into markdown, enabling AI bots to provide sourced answers and slides instantly. https://t.co/6OeiLB4Et0
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2038210842076811340
Draft
Hugging Face turning arXiv papers into markdown is exactly the kind of plumbing that makes AI outputs more useful: sourced answers, faster synthesis, and less friction between a paper and something you can actually build on.
224 chars
OpenSource
Req 2026-03-29T1201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-03-29 09:48:38.000000
We managed the holy grail of CUDA compilation: joining CUDA host code and device code at *link time*. This means both build graphs (device and host) are now completely separated and built by SM. True scalable CUDA build graphs are now possible. Those who know, know. https://t.co/uR1kSGA9ed
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2038191593803416047
Draft
Infrastructure shifts like this look niche right up until they change what teams can actually ship. Better CUDA build graphs don’t just reduce compile pain—they remove a scaling bottleneck.
189 chars
OpenSource
Req 2026-03-29T1001-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 88
Source
2026-03-29 07:09:52.000000
Why is GPU+LPU so much more efficient? Consider building a transportation network for the United States and your options are 18 wheelers or delivery vans. Wouldn't you rather use both?
primary source_tweetref tweet
reference: https://x.com/lifebypixels/status/2038151641426174227
Quoted original
NVIDIA Data Center (@NVIDIADC) · Thu Mar 26 17:25:13 +0000 2026
Agents need fuel that's smart and fast, while sustaining low cost per token. NVIDIA Vera Rubin + NVIDIA Groq 3 LPX delivers up to 35x more performance / megawatt for trillion-parameter models and massive input context. Genius-level smarts at speed and scale. 🧵 https://t.co/lqABY9uhON
Draft
The useful part of this framing is the mix, not the metaphor. Different workloads want different vehicles. The edge is routing the right traffic to the right hardware.
167 chars
OpenSource
Req 2026-03-29T0801-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 91
Source
2026-03-29 07:03:38.000000
UnifoLM-WBT-Dataset gives your robot 10,000 hours of real-world skills, from walking to setting the table. https://t.co/CwCQIDjOFE
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2038150068616802631
Draft
Unitree is pushing toward a more useful robotics bottleneck: data. UnifoLM-WBT is framed as a large skills dataset spanning locomotion to tabletop manipulation—the kind of training substrate humanoid systems have been missing. https://huggingface.co/collections/unitreerobotics/unifolm-wbt-dataset
297 chars
OpenSource
Req 2026-03-29T0801-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-03-29 05:52:56.000000
The research team (including @hamsabastani who is on X) found that letting students just use AI resulted in them using it to accidentally shortcut learning But both that study and a separate RCT found that AIs prompted to act as a tutor improved learning https://t.co/0HtjGC8eU0 https://t.co/U3OIeCF4aP
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/asanwal/status/2037943638144205021
Quoted original
Anand Sanwal (@asanwal) · Sat Mar 28 17:23:21 +0000 2026
Wharton researchers gave nearly 1,000 high school math students access to ChatGPT during practice problems Result: chatGPT is the perfect trap. Look at the red bars. Students with ChatGPT crushed their practice sessions. The basic ChatGPT group solved more problems and those https://t.co/CtXbUZwddu
Draft
The interesting part isn’t that AI helps on the worksheet. It’s that performance and learning can cleanly diverge. That’s the design problem.
141 chars
OpenSource
Req 2026-03-29T0601-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 89
Source
2026-03-29 04:36:37.000000
You can now text Poke to update or create event in your Apple Calendar Recipe in reply 👇 https://t.co/muQSpKrUlz
primary source_tweetref media
reference: https://x.com/interaction/status/2038113074771869917
Quoted original
Jack (@Inveeest) · Sat Mar 28 23:10:43 +0000 2026
Apple Calendar coming to @interaction 👀 https://t.co/e9ddLNG6i1
Draft
Poke now lets you create or update Apple Calendar events by text. That’s the kind of UI shift that matters: less app navigation, more saying what needs to happen. https://x.com/interaction/status/2038113074771869917
215 chars
OpenSource
Req 2026-03-29T0501-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 92
Source
2026-03-29 02:02:33.000000
TRIBE v2 creates zero-shot predictions of brain responses to images and sounds using 500+ hours of fMRI data. https://t.co/ss9WporEOi
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2038074299240628538
Draft
Meta’s TRIBE v2 pushes brain decoding closer to a true foundation-model regime: zero-shot predictions of neural responses to images and sounds, trained on 500+ hours of fMRI data. That’s a meaningful step toward models that generalize across stimuli instead of fitting one narrow task at a time. https://ai.meta.com/blog/tribe-v2-brain-predictive-foundation-model/
364 chars
OpenSource
Req 2026-03-29T0301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 88
Source
2026-03-29 01:13:29.000000
New paper: "Self-Distillation of Hidden Layers for Self-Supervised Representation Learning" We introduce Bootleg — a simple twist on I-JEPA/MAE that dramatically improves self-supervised representations. The idea: MAE predicts pixels (stable but low-level). I-JEPA predicts https://t.co/Yp5vkUBcfZ
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2038061953713582495
Draft
What stands out is the tradeoff it’s trying to collapse: low-level objectives usually train cleanly; higher-level ones usually matter more. If this really narrows that gap, it feels like the kind of small twist that gets copied everywhere.
239 chars
OpenSource
Req 2026-03-29T0201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-29 01:12:22.000000
We’re releasing SAM 3.1: a drop-in update to SAM 3 that introduces object multiplexing to significantly improve video processing efficiency without sacrificing accuracy. We’re sharing this update with the community to help make high-performance applications feasible on smaller, https://t.co/xfMaBdSkqV
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2038061671873225199
Draft
The interesting part isn’t just the model update. It’s the direction: making high-end video understanding cheap enough to ship, not just demo.
142 chars
OpenSource
Req 2026-03-29T0201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-29 00:40:35.000000
Wow!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/FaZeApex/status/2037992209346363465
Quoted original
FaZe Apex (@FaZeApex) · Sat Mar 28 20:36:21 +0000 2026
@amasad Just crossed $100k ARR on a new company fully built on @Replit I just redid the whole website last night on Replit design + canvas with animations / sketching on paper and going back and forth with Agent 4 (I have 0 technical experience) Would love to show you https://t.co/yENaIG2xas
Draft
The interesting part isn’t the ARR. It’s how fast the distance between idea, build, and ship collapses when someone with no technical background can redo the site overnight and keep iterating in one loop.
204 chars
OpenSource
Req 2026-03-29T0101-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 85
Source
2026-03-28 20:38:50.000000
Fortune hasn’t been taken seriously since the 1990s They’re a spam/SEO slop shop
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/thesimonetti/status/2037871178715402437
Quoted original
Isabella Simonetti (@thesimonetti) · Sat Mar 28 12:35:25 +0000 2026
My story on the editor at Fortune who has amassed 600+ bylines since July, using AI to help write. “I know that this won’t be seen as some people’s idea of journalism,” he says. https://t.co/2Q2zZWsw3U
Draft
The useful detail here isn’t just that AI is in the workflow. It’s the volume. At this scale, the real editorial question isn’t authorship theater. It’s what standards, review, and accountability still mean in practice.
219 chars
OpenSource
Req 2026-03-28T2101-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-03-28 20:57:53.000000
Students undermine their own learning by getting answers from AI. But the authors of the original paper have a new RCT that shows well-prompted AI tutors do, actually, boost learning: https://t.co/0HtjGC8eU0 *Not quote tweeting because don't want to boost slop science accounts https://t.co/ogSs1Qpr7M
primary source_tweetref tweet
reference: https://x.com/emollick/status/2037997630500856069
Draft
The useful shift here isn’t “AI good” or “AI bad.” It’s that the interface shapes the outcome: answer machine, you offload; tutor, you learn. That’s the more useful frame.
171 chars
OpenSource
Req 2026-03-28T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 90
Source
2026-03-28 19:29:08.000000
someone at ANTHROPIC just showed CLAUDE finding ZERO DAY vulnerabilities in a live conference demo claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history... it found the blind SQL injection in 90 minutes, https://t.co/AfSn7brMuj
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037975294758559763
Draft
The interesting part isn’t just that it found one. It’s that “live demo” is starting to look less like theater and more like the new baseline for what frontier models can actually do.
183 chars
OpenSource
Req 2026-03-28T2001-TOP1
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 81
Source
2026-03-28 15:55:32.000000
Emergent made more in accrued revenue in a WEEK than most startups make in a YEAR. This is actual, banked, cold, hard revenue. We are happy to coach these publications how to stay relevant in the age of AI and calculate run rate the right way, which is totally different from the
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037921541451886866
Quoted original
sid (@sidbharth) · Fri Mar 27 06:42:57 +0000 2026
SCOOP: at least three major indian startup publications are working on investigative reports debunking the high ARR claims made by @emergentlabs
Draft
This is the part of the AI cycle people keep missing: distribution looks messy right up until revenue stops being theoretical. Once the money lands before the narrative catches up, the argument is already over.
210 chars
OpenSource
Req 2026-03-28T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-28 16:56:59.000000
Finally: Gemma 4 incoming. Being tested already!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/veermasrani/status/2037912954570698961
Quoted original
Veer Masrani (@veermasrani) · Sat Mar 28 15:21:25 +0000 2026
Gemma 4 Is Being Tested on Arena Under the Name "significant-otter" The model identified itself as "I am Gemma 4, a large language model developed by Google DeepMind." Release is being called imminent. Lineup: 2B, 4B, and 120B15A. 📡 https://t.co/6t7vf38sXv
Draft
If this is real, the interesting part isn’t just that Gemma 4 may be close. It’s Google testing in the open before the formal reveal. That usually signals high confidence — and that the positioning battle is already on.
219 chars
OpenSource
Req 2026-03-28T1701-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 89
Source
2026-03-28 15:31:24.000000
Claude is gaining paid subscribers faster than ever. More than doubled in <6months. Credit card data shows record new signups and returning users in Jan–Feb, driven by Anthropic's Super Bowl ads, the DoD feud over refusing military AI use for lethal ops, and the launch of Claude Code and Computer Use. Paid subs have more than doubled this year, though ChatGPT still leads overall.
primary quoted_tweetsecondary quote_wrapperref external_url
reference: https://techcrunch.com/2026/03/28/anthropics-claude-popularity-with-paying-consumers-is-skyrocketing/?utm_source=dlvr.it&utm_medium=twitter
Quoted original
TechCrunch (@TechCrunch) · Sat Mar 28 14:18:16 +0000 2026
Anthropic’s Claude popularity with paying consumers is skyrocketing https://t.co/nvlEGBLZr9
Draft
Claude isn’t just winning mindshare — it’s turning that into paid consumer demand. TechCrunch reports Anthropic’s subscriber growth has more than doubled this year, even with ChatGPT still far ahead overall: https://techcrunch.com/2026/03/28/anthropics-claude-popularity-with-paying-consumers-is-skyrocketing/?utm_source=dlvr.it&utm_medium=twitter
347 chars
OpenSource
Req 2026-03-28T1601-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 94
Source
2026-03-28 15:31:27.000000
Claude is gaining paid subscribers faster than ever. More than doubled in <6months. Credit card data shows record new signups and returning users in Jan–Feb, driven by Anthropic's Super Bowl ads, the DoD feud over refusing military AI use for lethal ops, and the launch of Claude Code and Computer Use. Paid subs have more than doubled this year, though ChatGPT still leads overall.
primary quoted_tweetsecondary quote_wrapperref external_url
reference: https://techcrunch.com/2026/03/28/anthropics-claude-popularity-with-paying-consumers-is-skyrocketing/?utm_source=dlvr.it&utm_medium=twitter
Quoted original
TechCrunch (@TechCrunch) · Sat Mar 28 14:18:16 +0000 2026
Anthropic’s Claude popularity with paying consumers is skyrocketing https://t.co/nvlEGBLZr9
Draft
Claude’s paid consumer momentum is accelerating fast. Subscriber growth has more than doubled this year, a sign Anthropic is turning product releases and public attention into real consumer demand. https://techcrunch.com/2026/03/28/anthropics-claude-popularity-with-paying-consumers-is-skyrocketing/?utm_source=dlvr.it&utm_medium=twitter
337 chars
OpenSource
Req 2026-03-28T1601-TOP1
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 89
Source
2026-03-28 06:14:04.000000
We live in remarkable times
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/krzyzanowskim/status/2037513116544557420
Quoted original
Marcin Krzyzanowski (@krzyzanowskim) · Fri Mar 27 12:52:36 +0000 2026
I reimplemented "claude" CLI with codex and gpt-5.4-high. It cost $1100 in tokens, and is 73% faster and 80% lower resident memory during sustained interactive use. It is very easy to reverse claude from npm distribution, then reimplement is 1:1. It is indistinguishable from the
Draft
Remarkable times: the moat is shifting from the tool itself to the distribution, defaults, and trust around it.
111 chars
OpenSource
Req 2026-03-28T0701-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-03-28 06:11:23.000000
Three weeks ago I shared that Claude had shocked Prof. Donald Knuth by finding an odd-m construction for his open Hamiltonian decomposition problem in about an hour of guided exploration. Prof. Knuth titled the paper Claude’s Cycles. The story didn't end there. The updated https://t.co/1ZbmrCpHni
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037774532111642740
Quoted original
Bo Wang (@BoWang87) · Tue Mar 03 20:48:11 +0000 2026
Prof. Donald Knuth opened his new paper with "Shock! Shock!" Claude Opus 4.6 had just solved an open problem he'd been working on for weeks — a graph decomposition conjecture from The Art of Computer Programming. He named the paper "Claude's Cycles." 31 explorations. ~1 hour. https://t.co/D4oWIVLXJ1
Draft
What stands out here isn’t just the result. It’s how quickly “open problem” starts to look like a live working session when the right human is steering.
152 chars
OpenSource
Req 2026-03-28T0701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-28 04:41:39.000000
Most voice assistants send your audio to a server. This one doesn't. No internet. No cloud. No API keys. LFM2.5-Audio-1.5B handles speech recognition. LFM2-1.2B-Tool dispatches actions. Both running locally on your device. Enjoy ↓ https://t.co/b2iDwfQHTC https://t.co/97Cm4lQdY6
primary source_tweetref tweet
reference: https://x.com/liquidai/status/2037751953250451847
Draft
A lot of “AI assistant” progress has quietly meant more dependency: cloud, latency, accounts, keys. What’s interesting here is the opposite move—shrinking the stack until privacy and usability can coexist on the same device.
224 chars
OpenSource
Req 2026-03-28T0501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-28 03:25:03.000000
Codex use cases are like Skills, but for humans
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/romainhuet/status/2037604733847003367
Quoted original
Romain Huet (@romainhuet) · Fri Mar 27 18:56:40 +0000 2026
We just launched Codex use cases! It’s a gallery of practical examples across coding and non-coding tasks, with real ways to use Codex. One thing I really like: if you have the app, you can open the starter prompt for each use case directly in Codex! https://t.co/ZWa5X9VLSq
Draft
This is the right abstraction layer. Not “what can the model do?” but “what are the repeatable patterns people can actually start from?” Codex use cases feel a lot like human skills.
184 chars
OpenSource
Req 2026-03-28T0401-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 89
Source
2026-03-27 22:04:52.000000
Google Turbo Quant running Locally in Atomic Chat MacBook Air M4 16 GB Model: QWEN3.5-9B Context window: 50000 Summarising 20000 words in just seconds.. You can do 3x larger context window, processing 3x faster than before! https://t.co/FRYkXCGjQb
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2037652097152241856
Draft
The interesting part isn’t just that local is fast. It’s that the floor keeps rising on what feels practical on a normal laptop—and that changes where people draw the line between cloud-only and personal compute.
212 chars
OpenSource
Req 2026-03-28T0101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-03-27 23:45:42.000000
Box has launched our plugin within Codex. Users can take any content within Box and automate workflows around it using the power of a coding agent. Here's an example of processing earnings call documents to extract structured data at scale.👇 https://t.co/PAD19rZnNI
primary source_tweetref tweet
reference: https://x.com/OpenAIDevs/status/2037677474012934355
Draft
The interesting part isn’t the demo. It’s the interface shift: enterprise content stops being a place you read, and becomes something agents can operate on directly.
165 chars
OpenSource
Req 2026-03-28T0001-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-03-27 22:06:01.000000
We’re releasing SAM 3.1: a drop-in update to SAM 3 that introduces object multiplexing to significantly improve video processing efficiency without sacrificing accuracy. We’re sharing this update with the community to help make high-performance applications feasible on smaller, https://t.co/xfMaBdSkqV
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037652388559941856
Draft
What stands out here isn’t just “faster SAM.” It’s the attempt to remove the tradeoff: higher video throughput without giving up accuracy. That’s the kind of update that turns a model from an impressive demo into something you can actually deploy.
247 chars
OpenSource
Req 2026-03-28T0001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-27 23:59:19.000000
TurboQuant CUDA for llama.cpp: 3.5x KV cache compression that BEATS q8_0 quality (-1.17% PPL) 99.6% prefill speed, 97.5% decode 128K context on RTX 3090 24GB, Q6 Qwen3.5 27B https://t.co/DNLE4BFTS0 https://t.co/UqBkMN8r2z
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037680899714617748
Draft
This is the kind of progress that changes what feels practical on consumer GPUs. Better compression usually comes with obvious tradeoffs; getting more context without paying for it in quality is the part that matters.
217 chars
OpenSource
Req 2026-03-28T0001-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 96
Source
2026-03-27 22:48:51.000000
Next steps: - enable the 50,000 models available in inference providers - enable the 3,000,000 models available on HF - local free fast inference with llama.cpp - train and bring your own model! We don't want a world where you're forced to choose between two or three lookalike
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037663166423789900
Quoted original
Nous Research (@NousResearch) · Fri Mar 27 22:15:43 +0000 2026
We have integrated @huggingface as a first-class inference provider in Hermes Agent. When you select Hugging Face in the model picker it now shows 28 curated models organized by use case, with a custom option for the 100+ other models they serve. https://t.co/Oqa2pEpli4
Draft
The real fork in the road for AI products is abundance or bottlenecks. If an interface keeps leading back to the same two or three models, it’s not an ecosystem. It’s a menu.
174 chars
OpenSource
Req 2026-03-27T2301-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-03-27 22:02:25.000000
Google Turbo Quant running Locally in Atomic Chat MacBook Air M4 16 GB Model: QWEN3.5-9B Context window: 100000 Summarising 50000 words in just seconds.. You can do 3x larger context window, processing 3x faster than before! They are first that have integrated Google turboquant in local models and made it accessible for everyone for free
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/atomic_chat_hq/status/2037650438024007760
Quoted original
atomic.chat (@atomic_chat_hq) · Fri Mar 27 21:58:16 +0000 2026
Google Turbo Quant running Locally in Atomic Chat MacBook Air M4 16 GB Model: QWEN3.5-9B Context window: 50000 Summarising 20000 words in just seconds.. You can do 3x larger context window, processing 3x faster than before! https://t.co/FRYkXCGjQb
Draft
The interesting part isn’t just local AI on a laptop. It’s how fast the floor is moving on usable context and speed, without needing a datacenter story attached.
161 chars
OpenSource
Req 2026-03-27T2301-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-27 22:49:23.000000
Qwen3.5-35B compressed 20% with 1%~ performance drop on average. Now you can fit this (4bits) with full context on 24GB of VRAM 700$~ or 1x 3090 https://t.co/7C4WORKFm5
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037663301732114622
Draft
The interesting part isn’t just the compression result. It’s the price/perf line moving further downmarket. When a model this size starts fitting real context windows on 24GB VRAM, the center of gravity shifts from access to who can actually build with it.
257 chars
OpenSource
Req 2026-03-27T2301-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 77
Source
2026-03-27 19:17:35.000000
The first steel beams went up this week at our Michigan Stargate site with Oracle and Related Digital https://t.co/Hl0NBqwfnS
primary source_tweetref media
reference: https://x.com/sama/status/2037610000122839116
Draft
Steel is up at the Michigan Stargate site with Oracle and Related Digital. The point is simple: this is no longer just an AI infrastructure plan on paper. https://x.com/sama/status/2037610000122839116
200 chars
OpenSource
Req 2026-03-27T2001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-03-27 19:43:44.000000
One way to see the advancement of AI is to see how much further you can get with new models on the same hardware Here is "an otter using a laptop on an airplane" generated on my home computer using the open weights Wan 2.1, first try. We have come pretty far in 18 months. https://t.co/c4iz9UcmTB
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/emollick/status/1850904298437194190
Quoted original
Ethan Mollick (@emollick) · Mon Oct 28 14:15:47 +0000 2024
On one hand, these are obviously much worse "otter using wifi on an airplane" than any state-of-the AI text-to-video generation, it looks like something from 2022. On the other, it was done entirely offline on my computer using open AI video generation tools, a new capability. https://t.co/hHVufXqEVl
Draft
The quality gap matters less than the capability shift. Local, offline, first-try generation on consumer hardware changes who gets to experiment.
145 chars
OpenSource
Req 2026-03-27T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 86
Source
2026-03-27 17:28:13.000000
15 millions de paramètres. 1 seul GPU. LeWorldModel de Yann LeCun est un premier pas vers les « world models » capable de comprendre le monde physique 👉 https://t.co/7yEMmm65QE https://t.co/9rMd0fMl3Q
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2037582476252422538
Draft
Less interesting for the parameter count than for the direction: models that learn physical structure under tighter compute constraints, instead of treating scale alone as the answer.
183 chars
OpenSource
Req 2026-03-27T1801-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 90
Source
2026-03-27 15:06:09.000000
TRIBE v2 predicts how any brain reacts to sights and sounds using fMRI data from 700+ people. https://t.co/nE5sU79E4E
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2037546722419671173
Draft
Meta’s TRIBE v2 pushes brain decoding toward generalization: one model trained on fMRI data from 700+ people predicts responses to images and sound, instead of rebuilding the map subject by subject.
198 chars
OpenSource
Req 2026-03-27T1601-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-27 15:54:57.000000
New CVE in NGINX - CVE-2026-28755 NGINX stream module allows TLS handshake to succeed with revoked client certificates when ssl_ocsp on is configured. This vulnerability was autonomously discovered by Winfunc's AI agent. Read the write-up here: https://t.co/jW03qs3rZx
primary source_tweetref tweet
reference: https://x.com/mufeedvh/status/2037559006961816030
Draft
The interesting part isn’t just the bug. It’s the direction of travel: AI agents are starting to find security flaws in boring, foundational infrastructure—not just flashy demo targets.
185 chars
OpenSource
Req 2026-03-27T1601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 85
Source
2026-03-27 14:16:40.000000
This is the future. Low latency, direct verbal communication with an AI. Honestly, this is the future I've dreamed of ever since I saw Knight Rider and could control KITT with my voice. I've been working with Genspark for a while now, and this is next level.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/genspark_ai/status/2036746330912260156
Quoted original
Genspark (@genspark_ai) · Wed Mar 25 10:05:40 +0000 2026
Introducing Genspark Realtime Voice. I connected Genspark to my car and headed to work. With Genspark Realtime Voice, prepping for the day feels as easy as chatting. It can check my schedule, send emails and messages, look things up, and even make me a commute playlist. It can https://t.co/A5agSmskdm
Draft
What stands out here isn’t just the voice. It’s the shift from asking AI questions to having it handle the messy in-between moments of a real day. That’s when it stops feeling like a demo and starts feeling like a new interface.
230 chars
OpenSource
Req 2026-03-27T1501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-03-27 13:37:20.000000
🚀 Unitree open-sources UnifoLM-WBT-Dataset — a high-quality real-world humanoid robot whole-body teleoperation (WBT) dataset for open environments. 🥳Publicly available since March 5, 2026, the dataset will continue to receive high-frequency rolling updates. It aims to establish https://t.co/nBbB0XJLix
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037524374199157073
Draft
Open-sourcing a model gets attention. Open-sourcing the data pipeline behind real-world whole-body control is what compounds. This points at the bottleneck, not just the demo.
176 chars
OpenSource
Req 2026-03-27T1401-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 92
Source
2026-03-27 13:38:51.000000
@cohere just released the best speech-&gt;text model :) It currently ranks #1 for accuracy on @huggingface Open ASR Leaderboard, setting a new benchmark for real-world transcription performance. Read more 👇 https://t.co/WopGr8Wu9P
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037524752647033190
Quoted original
Cohere (@cohere) · Thu Mar 26 13:25:59 +0000 2026
Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition. https://t.co/l87Z6oyQdM
Draft
Leaderboard wins matter less as bragging rights than as a signal: speech models are finally being judged on the messiness of real transcription, not just polished demos.
169 chars
OpenSource
Req 2026-03-27T1401-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-27 12:22:44.000000
Done @garrytan Now you can use your apple watch to control claude code session! built this in 6 hours, used gstack for this See /office-hours from gstack in action in the video. - Your Claude session, live on your Apple Watch - Accept, reject, or reply instantly to prompts https://t.co/ThZGWbcLns
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037505600356270338
Quoted original
Garry Tan (@garrytan) · Tue Mar 24 22:51:15 +0000 2026
I want it
Draft
The interesting part isn’t the watch. It’s how quickly the control layer around coding agents is getting built—and how normal “manager for your model” already feels.
165 chars
OpenSource
Req 2026-03-27T1301-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 94
Source
2026-03-27 12:30:57.000000
GLM-5.1 released! Chinese models are coming closer, the gap is shrinking
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/Zai_org/status/2037490078126084514
Quoted original
Z.ai (@Zai_org) · Fri Mar 27 11:21:04 +0000 2026
GLM-5.1 is available to ALL GLM Coding Plan users! https://t.co/E63z53nXOX https://t.co/75l0sGGt1W
Draft
GLM-5.1 is now fully available to all GLM Coding Plan users. That’s the real shift: not the announcement, but access broad enough to matter. https://x.com/Zai_org/status/2037490078126084514
189 chars
OpenSource
Req 2026-03-27T1301-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-03-27 12:41:00.000000
Symbolica's Agentica SDK just hit 36.08% on the brand-new ARC-AGI-3 benchmark - in a single day - at a fraction of what brute-forcing it with a frontier model would cost. Their agentic approach dramatically outperformed throwing raw compute at the problem. Really eagier to see it being verified. (p.s.: I saw the debate about harnessing).
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/agenticasdk/status/2037317677748777047
Quoted original
Agentica (@agenticasdk) · Thu Mar 26 23:56:00 +0000 2026
We scored 36.08% on ARC-AGI-3 in one day using the Agentica SDK. https://t.co/cMtx44iFn9
Draft
Agentica says it reached 36.08% on ARC-AGI-3 in one day. If that result holds up, it’s a real signal: progress on hard reasoning benchmarks may come from better agentic systems, not just bigger raw models. https://x.com/agenticasdk/status/2037317677748777047
258 chars
OpenSource
Req 2026-03-27T1301-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 94
Source
2026-03-27 11:28:28.000000
AI medicine is inevitable. In fact, it has arrived. We're excited about this agreement with Utah to allow our AI to prescribe psychiatric renewals. Yes. AI prescribing medication. This is monumental and will collapse the cost of care. (1/ https://t.co/l8TayShZXr
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037491943496937485
Quoted original
New York Post (@nypost) · Fri Mar 27 10:03:10 +0000 2026
Artificial intelligence will see you now: Bots to prescribe mental health drugs https://t.co/ywXKdgbXyX https://t.co/CzSAKBliC6
Draft
Big milestone. The real shift isn’t AI writing the script—it’s psychiatric renewals becoming a systems problem: protocolized, scalable, and finally cheap enough to expand access without expanding headcount.
206 chars
OpenSource
Req 2026-03-27T1201-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 97
Source
2026-03-27 11:05:22.000000
Codex update supercharges your workflow with Slack, Figma, Notion, and Gmail integrations for instant coding help. https://t.co/k9K2JkLb8s
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2037486130774344027
Draft
OpenAI is pushing Codex closer to where work already happens: Slack, Figma, Notion, and Gmail. That matters more than the feature list—coding help is getting embedded into the tools teams already use every day.
210 chars
OpenSource
Req 2026-03-27T1201-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 95
Source
2026-03-27 10:59:41.000000
Competition between OpenAI and Anthropic is heating up!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/thsottiaux/status/2037346989244096581
Quoted original
Tibo (@thsottiaux) · Fri Mar 27 01:52:28 +0000 2026
Hello. We have reset Codex usage limits across all plans to let everyone experiment with the magnificent plugins we just launched, and because it had been a while! You can just build unlimited things with Codex. Have fun!
Draft
Price wars are nice. Tool wars matter more. When the cap goes to zero, the real question is which product actually earns unlimited use.
136 chars
OpenSource
Req 2026-03-27T1101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 22:41:31.000000
We're rolling out plugins in Codex. Codex now works seamlessly out of the box with the most important tools builders already use, like @SlackHQ, @Figma, @NotionHQ, @gmail, and more. https://t.co/PQDsLqHGA6 https://t.co/TIbsIUAf6S
primary source_tweetref tweet
reference: https://x.com/OpenAI/status/2037298931907084568
Draft
This isn’t just a story about more integrations. It’s Codex moving closer to the layer where work already happens.
114 chars
OpenSource
Req 2026-03-27T1001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-03-27 08:19:05.000000
Today we're introducing TRIBE v2 (Trimodal Brain Encoder), a foundation model trained to predict how the human brain responds to almost any sight or sound. Building on our Algonauts 2025 award-winning architecture, TRIBE v2 draws on 500+ hours of fMRI recordings from 700+ people https://t.co/vRoVj8gP4j
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2037444280726270169
Draft
Interesting less for the headline than for the direction: perception itself is starting to look like something models can map, predict, and eventually engineer against.
168 chars
OpenSource
Req 2026-03-27T0901-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 92
Source
2026-03-27 07:03:52.000000
Runway’s Multi-Shot App turns any sentence or photo into a cinematic scene with sound, dialogue, and camera cuts. https://t.co/kfTuD2qMgw
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2037425354155901254
Draft
Runway’s Multi-Shot App pushes AI video beyond single clips: start from a sentence or photo, then build a cinematic scene with sound, dialogue, and camera cuts. That’s a much closer step toward directing a sequence, not just generating a shot. https://x.com/ItsAIAndy/status/2037425354155901254/video/1
302 chars
OpenSource
Req 2026-03-27T0801-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 80
Source
2026-03-27 06:42:37.000000
this will destroy entire industries
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/Yuchenj_UW/status/2037387996694200509
Quoted original
Yuchen Jin (@Yuchenj_UW) · Fri Mar 27 04:35:25 +0000 2026
Anthropic’s new model, Capybara: “Compared to Claude Opus 4.6, Capybara achieves dramatically higher scores in software coding, academic reasoning, and cybersecurity.” According to Dario's previous interview, it might be a 10T-parameter model that cost $10 billion to train. https://t.co/IozBeIPnA8
Draft
If this holds up, the story isn’t just “better than Opus.” It’s how quickly frontier models are widening the gap between impressive and economically disruptive.
160 chars
OpenSource
Req 2026-03-27T0701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-03-27 04:16:19.000000
Box just launched its plugin within Codex, which means you can take any content within Box and automate workflows around it using the power of a coding agent. Here's a quick example of processing earnings call documents to extract structured data at scale, which you could then instantly pipe into any other system. Coding agents are going to be the backbone of automating a large portion of knowledge work tasks, and enterprise content will often have the critical context necessary for that automation. This will be true in financial services, legal, healthcare, government, and any other industry that heavily deals with unstructured data. Excited to work with OpenAI to deliver more and more seamless experiences around connecting content to agents.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/OpenAIDevs/status/2037296316104282119
Quoted original
OpenAI Developers (@OpenAIDevs) · Thu Mar 26 22:31:07 +0000 2026
We're rolling out plugins in Codex. Codex now works seamlessly out of the box with the most important tools builders already use, like @SlackHQ, @Figma, @NotionHQ, @gmail, and more. https://t.co/PQDsLqHGA6 https://t.co/TIbsIUAf6S
Draft
The shift here isn’t just more integrations. It’s the coding agent becoming the interface layer for the tools teams already live in.
132 chars
OpenSource
Req 2026-03-27T0501-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 91
Source
2026-03-27 02:20:25.000000
You can just build things.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/thsottiaux/status/2037346989244096581
Quoted original
Tibo (@thsottiaux) · Fri Mar 27 01:52:28 +0000 2026
Hello. We have reset Codex usage limits across all plans to let everyone experiment with the magnificent plugins we just launched, and because it had been a while! You can just build unlimited things with Codex. Have fun!
Draft
This is where coding tools stop feeling like demos and start looking like infrastructure.
89 chars
OpenSource
Req 2026-03-27T0301-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-27 02:01:41.000000
Announcing ARC-AGI-3 The only unsaturated agentic intelligence benchmark in the world Humans score 100%, AI &lt;1% This human-AI gap demonstrates we do not yet have AGI Most benchmarks test what models already know, ARC-AGI-3 tests how they learn https://t.co/BC2QaNZuvH
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037349305015849065
Draft
Benchmarks matter less for the headline score than for the point where they stop being memory tests and start becoming adaptation tests. That gap is still doing a lot of work.
175 chars
OpenSource
Req 2026-03-27T0301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 87
Source
2026-03-27 01:59:31.000000
Now it’s even easier to switch to the @GeminiApp ! 😎
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/GeminiApp/status/2037247063382167567
Quoted original
Google Gemini (@GeminiApp) · Thu Mar 26 19:15:24 +0000 2026
Switching to Gemini from other AI apps just got easier. Starting to roll out today on desktop, you can now bring your preferences and chat history into Gemini, so you can pick up right where you left off in just a few clicks. 🧵 https://t.co/zlR1XceNkU
Draft
This matters less as a convenience feature than as a strategic one: in consumer AI, switching costs are one of the few real moats, so making migration painless is a product move.
178 chars
OpenSource
Req 2026-03-27T0201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-27 01:56:49.000000
Plugins are now available in Codex:
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/OpenAIDevs/status/2037296316104282119
Quoted original
OpenAI Developers (@OpenAIDevs) · Thu Mar 26 22:31:07 +0000 2026
We're rolling out plugins in Codex. Codex now works seamlessly out of the box with the most important tools builders already use, like @SlackHQ, @Figma, @NotionHQ, @gmail, and more. https://t.co/PQDsLqHGA6 https://t.co/TIbsIUAf6S
Draft
The interesting part isn’t just more integrations. It’s the direction: pushing Codex closer to where work already happens instead of keeping it as a destination. That usually matters more than any single plugin.
211 chars
OpenSource
Req 2026-03-27T0201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-27 00:43:48.000000
Model weights are here: https://t.co/rQlfP51Db7!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/trychroma/status/2037243681988894950
Quoted original
Chroma (@trychroma) · Thu Mar 26 19:01:58 +0000 2026
Introducing Chroma Context-1, a 20B parameter search agent. &gt; pushes the pareto frontier of agentic search &gt; order of magnitude faster &gt; order of magnitude cheaper &gt; Apache 2.0, open-source https://t.co/bhAkULyBBn
Draft
Open weights matter less as a slogan than as a forcing function. If this holds up in real workflows, agentic search just got much harder to keep proprietary-only.
162 chars
OpenSource
Req 2026-03-27T0101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-03-26 23:25:43.000000
🔊Introducing Voxtral TTS: our new frontier open-weight model for natural, expressive, and ultra-fast text-to-speech 🎭Realistic, emotionally expressive speech. 🌍Supports 9 languages and accurately captures diverse dialects. ⚡Very low latency for time-to-first-audio. 🔄Easily https://t.co/Q2mdo8UBVo
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037310057348489536
Draft
Open-weight voice models are moving from “demoable” to actually usable. What matters isn’t just quality — it’s that the stack is getting fast enough, broad enough, and open enough to matter.
190 chars
OpenSource
Req 2026-03-27T0001-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 99
Source
2026-03-26 23:56:06.000000
Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition. https://t.co/l87Z6oyQdM
primary source_tweetref media
reference: https://x.com/huggingface/status/2037317701807317208
Draft
Cohere just dropped Transcribe, an open speech recognition model built for transcription quality. If it holds up, that’s a meaningful shift for teams that want strong ASR without defaulting to closed APIs. https://x.com/cohere/status/2037159129345614174/video/1
261 chars
OpenSource
Req 2026-03-27T0001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 23:26:20.000000
Cohere has released an Apache 2.0 model on @huggingface. No restricted license. No "research only." Actually open. Respect. Is this a one-off or a direction change for Cohere? https://t.co/i204YRbe98
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2037310209664675880
Quoted original
Cohere (@cohere) · Thu Mar 26 13:25:59 +0000 2026
Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition. https://t.co/l87Z6oyQdM
Draft
That’s the interesting part: not just another model drop, but a licensing choice that changes how seriously people take it. If this holds, it says more than the launch itself.
175 chars
OpenSource
Req 2026-03-27T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 98
Source
2026-03-26 22:31:07.000000
We're rolling out plugins in Codex. Codex now works seamlessly out of the box with the most important tools builders already use, like @SlackHQ, @Figma, @NotionHQ, @gmail, and more. https://t.co/PQDsLqHGA6 https://t.co/TIbsIUAf6S
primary source_tweetref tweet
reference: https://x.com/OpenAIDevs/status/2037296316104282119
Draft
The signal isn’t just “more integrations.” It’s where Codex is being positioned: inside the systems people already work in, not as a separate destination.
154 chars
OpenSource
Req 2026-03-26T2301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 21:42:30.000000
lol gonna try using siri exclusively to run all my agents to see how that feels
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/markgurman/status/2037230804942610548
Quoted original
Mark Gurman (@markgurman) · Thu Mar 26 18:10:48 +0000 2026
BREAKING: Apple is planning to open up Siri to run any AI service via their App Store apps as part of iOS 27, dropping ChatGPT as the exclusive outside partner in Apple Intelligence and Siri. https://t.co/tfEnHTheBP
Draft
If Apple does this, the real shift isn’t just more models. It’s Siri becoming the router layer, not the destination.
116 chars
OpenSource
Req 2026-03-26T2201-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 93
Source
2026-03-26 20:08:44.000000
AI written output exceeds human written output for the first time in history. Looking at the very steep curve, you know: this was just the beginning. https://t.co/IUM0zjWD1d
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/wintonARK/status/2037208130703286457
Quoted original
Brett Winton (@wintonARK) · Thu Mar 26 16:40:42 +0000 2026
We have been surpassed: AI written output exceeded human written output in 2025 https://t.co/Dv4CNJDMVf
Draft
2025 may be the year AI-written output crossed above human-written output. If that curve holds, this shift isn’t theoretical anymore — the internet is already changing underneath us. https://x.com/wintonARK/status/2037208130703286457
233 chars
OpenSource
Req 2026-03-26T2101-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 95
Source
2026-03-26 20:38:01.000000
To manage growing demand for Claude we're adjusting our 5 hour session limits for free/Pro/Max subs during peak hours. Your weekly limits remain unchanged. During weekdays between 5am–11am PT / 1pm–7pm GMT, you'll move through your 5-hour session limits faster than before.
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2037267851904500089
Draft
Same weekly cap, but less usable capacity when demand is highest. That’s not a tuning detail. It’s peak-hour rationing.
120 chars
OpenSource
Req 2026-03-26T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 19:40:21.000000
Everyone wants to plug an OpenClaw into their database. Nobody wants to explain to their CDO why they dropped their entire Snowflake at 2AM. We just shipped sandbox proxies. Every query the agent writes gets validated, proxied, and whitelisted before it touches anything. Go be https://t.co/SKjs58TkEo
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2037253341252903315
Draft
This is the right direction: agentic access to real systems only becomes usable when the blast radius is designed out upfront. What matters isn’t just that agents can query data. It’s whether they can do it without turning one late-night prompt into an incident.
263 chars
OpenSource
Req 2026-03-26T2001-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 100
Source
2026-03-26 19:13:28.000000
Apple is opening Siri to rival AI assistants starting with iOS 27, ending ChatGPT's exclusive partnership with Apple Intelligence. Any AI platform, including Gemini, Claude, Alexa, and Meta AI, will be able to integrate directly into Siri through a new App Store Extensions service. I it's a significant shift that positions the iPhone as a more open AI platform, though Apple hasn't clarified whether an approval process will gate access.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/markgurman/status/2037231307474833748
Quoted original
Mark Gurman (@markgurman) · Thu Mar 26 18:12:48 +0000 2026
Apple will let any AI platform - big apps include Gemini, Claude, Alexa, Meta AI etc. - to be queried in Siri if they enable an Extensions service inside of their iOS, macOS or iPadOS app. Apple will have a new section in the App Store. Unclear if there’s an approval process.
Draft
The real shift here isn’t Siri getting smarter. It’s Apple admitting the winning AI interface on the iPhone might belong to someone else.
137 chars
OpenSource
Req 2026-03-26T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-03-26 18:28:14.000000
New in Gemini: Live's biggest upgrade yet Faster responses. Smarter responses. More EQ. More linguistic range. 2x longer context. Android and iOS, powered by Gemini 3.1 Flash. Enjoy! https://t.co/Y6brhhJG9Y
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2037235192130396656
Draft
The interesting part isn’t just better voice. It’s the stack starting to feel continuous: faster turn-taking, more range, longer memory, less friction. That’s when these products stop feeling like demos and start becoming habits.
229 chars
OpenSource
Req 2026-03-26T1901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 92
Source
2026-03-26 18:50:23.000000
Gemini Live just got its biggest upgrade yet, powered by Gemini 3.1 Flash Live. •Faster responses with fewer awkward pauses •Smarter &amp; able to follow along 2x longer conversations, so you can stay in the flow •Dynamically adjusts its answer lengths &amp; tone to match the moment https://t.co/b4YaJi3W7a
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2037240765496385854
Draft
The interesting part isn’t just speed or duration. It’s the push toward voice that adapts in real time instead of feeling turn-based. That’s where it starts to feel native.
172 chars
OpenSource
Req 2026-03-26T1901-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 93
Source
2026-03-26 17:27:33.000000
I guess gstack is now in distribution in Claude Code You can just open a blank window and say install gstack and it works now https://t.co/w4ERSXrjFC
primary source_tweetref media
reference: https://x.com/garrytan/status/2037219920971522297
Draft
Claude Code now seems to install gstack from a blank window with a simple prompt. That’s a meaningful shift: one of the most popular tooling layers is moving from manual setup toward default distribution. https://x.com/garrytan/status/2037219920971522297
254 chars
OpenSource
Req 2026-03-26T1801-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 17:40:00.000000
Mistral just dropped Voxtral TTS an open-weight text-to-speech model with ultra-low latency, emotional expressiveness, and support for 9 languages. It outperformed ElevenLabs v2.5 Flash in zero-shot voice cloning tests judged by native speakers. https://t.co/GaNGRC4O15
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2037223052774351203
Draft
Open weights matter more when the thing is fast enough to ship. If this holds up outside demo conditions, the story isn’t just another TTS model. It’s pressure on one of the stickiest layers in the stack.
205 chars
OpenSource
Req 2026-03-26T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-03-26 15:06:37.000000
RIP my attention span
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/nullbytes00/status/2036959802711306572
Quoted original
Shobhit - Building SuperCmd (@nullbytes00) · Thu Mar 26 00:13:56 +0000 2026
Done @garrytan Now you can use your apple watch to control claude code session! built this in 6 hours, used gstack for this See /office-hours from gstack in action in the video. - Your Claude session, live on your Apple Watch - Accept, reject, or reply instantly to prompts https://t.co/ThZGWbcLns
Draft
RIP my attention span. The notable part isn’t just Claude Code on an Apple Watch. It’s the interface collapsing to the few actions that matter when you’re away: accept, reject, reply. That’s when a new surface starts to feel real.
232 chars
OpenSource
Req 2026-03-26T1701-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-03-26 16:39:45.000000
Gemini 3.1 Flash Live is here! 🔥 Our new live audio model for building voice AI experiences. Now with better instruction following, better understanding of tone and interruptions, lower latency. Now available in the API https://t.co/v7vZhpnGUm
primary source_tweetref tweet
reference: https://x.com/ammaar/status/2037207891204022574
Draft
The headline is the model. The real signal is the target: live voice is being treated less like a demo layer, more like the interface.
134 chars
OpenSource
Req 2026-03-26T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-03-26 15:33:20.000000
Not gonna lie, Gemini 3,1 Flash Live sounds really cool! https://t.co/uTQa0MM87s
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/kimmonismus/status/2037190772135526408
Quoted original
Chubby♨️ (@kimmonismus) · Thu Mar 26 15:31:43 +0000 2026
Google just launched Gemini 3.1 Flash Live, a new realtime model built for voice and vision agents. After a year of work on model quality, infrastructure, and UX, they're calling it a step-function improvement in quality, reliability, and latency. The race to build the best https://t.co/10CjFqSQzj
Draft
Realtime voice-and-vision agents have been “almost there” for a while. What stands out here isn’t just a new model launch — it’s Google saying the stack is finally good enough on quality, reliability, and latency at the same time.
230 chars
OpenSource
Req 2026-03-26T1601-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-03-26 15:15:38.000000
🚨 We're very happy to introduce TRIBE v2: a foundation model of the brain's responses to sight, sound &amp; language. 📄 Paper: https://t.co/uHwgOvTrRD ▶️ Demo: https://t.co/9ZX6XcOXSM 💻 Code: https://t.co/PCc2yKyh1D 🤗 Model: https://t.co/GiTKzsHUhY https://t.co/5q2WEOEiO2
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2037186724128813184
Draft
Interesting direction: not just multimodal in the usual sense, but learning the structure of how perception and language appear in the brain. If this holds up, the real story is compression across modalities—not another demo stack.
231 chars
OpenSource
Req 2026-03-26T1601-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-26 15:31:43.000000
Google just launched Gemini 3.1 Flash Live, a new realtime model built for voice and vision agents. After a year of work on model quality, infrastructure, and UX, they're calling it a step-function improvement in quality, reliability, and latency. The race to build the best real-time AI agents just got more intense!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/OfficialLoganK/status/2037187750005240307
Quoted original
Logan Kilpatrick (@OfficialLoganK) · Thu Mar 26 15:19:43 +0000 2026
Introducing Gemini 3.1 Flash Live, our new realtime model to build voice and vision agents!! We have spent more than a year improving the model + infra + experience, the results? A step function improvement in quality, reliability, and latency. https://t.co/0esYpmDy5l
Draft
Worth watching here isn’t just the model claim. It’s the maturity claim: quality, reliability, and latency moving at once is when real-time agents start to feel usable, not just impressive in demos.
198 chars
OpenSource
Req 2026-03-26T1601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-26 14:50:31.000000
Meta just released TRIBE v2 on Hugging Face A multimodal brain encoding model that predicts fMRI responses to natural stimuli by combining LLaMA 3.2, V-JEPA2 and Wav2Vec-BERT into a unified architecture. https://t.co/5kH1zanS7l
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2037180400334758023
Draft
What stands out here isn’t just the model stack — it’s the ambition to map naturalistic, multimodal input to brain response in one frame. That feels like a stronger benchmark for whether these systems are capturing structure humans actually use.
245 chars
OpenSource
Req 2026-03-26T1501-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-26 14:47:52.000000
Meta is back: Meta just dropped TRIBE v2, a foundation model that predicts how your brain responds to sight, sound, and language. Trained on 500+ hours of fMRI data from 700+ people, it can predict a new person's brain activity without any retraining, and its predictions are actually more accurate than a real brain scan. Neuroscience just got a serious AI upgrade!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/AIatMeta/status/2037153756346016207
Quoted original
AI at Meta (@AIatMeta) · Thu Mar 26 13:04:38 +0000 2026
Today we're introducing TRIBE v2 (Trimodal Brain Encoder), a foundation model trained to predict how the human brain responds to almost any sight or sound. Building on our Algonauts 2025 award-winning architecture, TRIBE v2 draws on 500+ hours of fMRI recordings from 700+ people https://t.co/vRoVj8gP4j
Draft
What matters here isn’t just the model. It’s the direction: brain prediction is starting to look less like bespoke neuroscience and more like a foundation-model problem.
169 chars
OpenSource
Req 2026-03-26T1501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 84
Source
2026-03-26 13:40:07.000000
@huggingface Nemotron 3 Super is now a leading reasoning foundation for @OpenClaw 🦞 and complex agentic workflows, with 1.5M+ downloads in its first two weeks. 🤗 https://t.co/DSSLIXYIDz https://t.co/CMn1nxLvcH
primary source_tweetref tweet
reference: https://x.com/lifebypixels/status/2037162687210136036
Draft
Worth watching: the center of gravity is shifting from raw model bragging rights to what actually holds up inside agent loops. Different bar.
141 chars
OpenSource
Req 2026-03-26T1401-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 92
Source
2026-03-26 13:40:03.000000
Our Nemotron Nano 12B v2 VL brings video understanding on-prem. MediaPerf benchmark launched by Coactive ranks our 12B model on par with 30B-size models at less than half the footprint: ✅ Cost Efficiency: Lowest cost for Tagging Refinement workload. ✅ Pro-Grade Quality: 0.299 https://t.co/5CIPPM5kKK
primary source_tweetref tweet
reference: https://x.com/lifebypixels/status/2037162667270390204
Draft
Interesting positioning: not just “good for 12B,” but a claim that the useful boundary is moving down. If that holds in real workloads, on-prem video stacks get a lot more practical.
182 chars
OpenSource
Req 2026-03-26T1401-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 13:17:09.000000
OpenAI is backing Isara, a new startup founded by two 23-year-old AI researchers that coordinates thousands of AI agents to solve complex problems, like using ~2,000 agents to forecast gold prices. The company just raised $94M at a $650M valuation and plans to sell predictive modeling tools to finance firms first.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/WSJ/status/2037146683960676492
Quoted original
The Wall Street Journal (@WSJ) · Thu Mar 26 12:36:32 +0000 2026
Exclusive: OpenAI is backing a new AI startup that aims to build software allowing so-called AI “agents” to communicate and solve complex problems in industries such as finance and biotech https://t.co/qLRGZUguvk
Draft
The interesting part isn’t just “more agents.” It’s where the capital and credibility are going: coordination as the product category. That’s a bet the edge won’t be a single model, but systems that can split, route, check, and converge on harder work.
253 chars
OpenSource
Req 2026-03-26T1401-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-26 09:48:59.000000
Nemotron remains completely underrepresented. NVIDIA has cooked up a storm, and most people are still ignoring it. https://t.co/qbNM8h7UsI
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/NVIDIAAIDev/status/2036928025304936619
Quoted original
NVIDIA AI Developer (@NVIDIAAIDev) · Wed Mar 25 22:07:40 +0000 2026
@huggingface Nemotron 3 Super is now a leading reasoning foundation for @OpenClaw 🦞 and complex agentic workflows, with 1.5M+ downloads in its first two weeks. 🤗 https://t.co/DSSLIXYIDz https://t.co/CMn1nxLvcH
Draft
Worth paying attention when an open reasoning model starts showing up as infrastructure, not just as a benchmark result. That’s usually when the story is bigger than the launch post.
183 chars
OpenSource
Req 2026-03-26T1001-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 84
Source
2026-03-26 06:30:08.000000
Periodic Labs update: https://t.co/DoqvapylRd
primary source_tweetref media
reference: https://x.com/garrytan/status/2037054477711401006
Quoted original
Rohan Pandey (@khoomeik) · Tue Sep 30 16:06:05 +0000 2025
Excited to finally announce what I've been up to since leaving OpenAI: autonomous science at @PeriodicLabs Why? Because I Take the Bitter Lesson Seriously To accelerate AI, we must enable it to hill-climb compute &amp; energy through experimentally verifiable science 🧵
Draft
Periodic Labs is worth tracking. https://x.com/AndrewCurran_/status/2036851014469640248/photo/1
95 chars
OpenSource
Req 2026-03-26T0801-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 86
Source
2026-03-26 06:34:12.000000
One of the most important things about this new age is you have to use tokens aggressively to create something remarkable You have to let it rip. If you do, and you have agency and taste, the result will be remarkable. So token credits for AI is a big part of making startups accessible regardless of where you grew up or whether your family has money
primary quoted_tweetsecondary quote_wrapperref external_url
reference: https://events.ycombinator.com/yc-sus-india
Quoted original
Y Combinator (@ycombinator) · Thu Mar 26 03:01:50 +0000 2026
Every student accepted into Startup School India now gets $25k+ in AI and cloud credits. Apply, get in, and start building: https://t.co/gncXSJGhdb
Draft
Startup School India now bundles $25k+ in AI and cloud credits for every accepted student, meaningfully lowering the cost of building from day one. https://events.ycombinator.com/yc-sus-india
191 chars
OpenSource
Req 2026-03-26T0801-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 96
Source
2026-03-26 07:47:26.000000
32GB of VRAM for under $1000. I want to see the benchmarks now.
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/digitalix/status/2036820057599197645
Quoted original
Alex Ziskind (@digitalix) · Wed Mar 25 14:58:38 +0000 2026
32GB of VRAM for under $1000! The Intel Arc Pro B70 just landed. https://t.co/2iwJrCRAAg
Draft
Intel’s Arc Pro B70 puts 32GB of VRAM under $1000. That’s what matters. For local AI and pro workloads, memory capacity is still the bottleneck — and this lowers the price floor. https://t.co/2iwJrCRAAg
202 chars
OpenSource
Req 2026-03-26T0801-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 90
Source
2026-03-26 06:18:34.000000
After reading it, this should be bigger news. Crazy stuff. Why it's cool: Hermes agent = self-improving memory &amp; skills. HyperAgents = self-improving behavior of the agent. For example, it starts without a memory and after a few iterations discovers the need for particular
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037051565262200979
Quoted original
Jenny Zhang (@jennyzhangzt) · Mon Mar 23 15:17:08 +0000 2026
Introducing Hyperagents: an AI system that not only improves at solving tasks, but also improves how it improves itself. The Darwin Gödel Machine (DGM) demonstrated that open-ended self-improvement is possible by iteratively generating and evaluating improved agents, yet it https://t.co/YJPFTJ51SO
Draft
What’s interesting isn’t just that agents improve. It’s that the stack is starting to write its own operating manual: memory, skills, then behavior. Once that loop closes, progress stops looking like feature work and starts compounding.
236 chars
OpenSource
Req 2026-03-26T0701-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 06:19:18.000000
very special event tonight launching @arcprize v3 with @sama @mikeknoop @fchollet @GregKamradt @ @ycombinator SOTA results are 0.37%! 🤯 new gauntlet thrown AI ppl. new ideas needed. https://t.co/ODnkKDtaMX
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2037051750860062766
Draft
The interesting part isn’t the event. It’s the gap. If the frontier is still this far from robust abstraction, the next wave probably won’t come from pushing the same scaling playbook even harder.
196 chars
OpenSource
Req 2026-03-26T0701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 84
Source
2026-03-26 05:58:28.000000
more context: based on this and 3 other images, it created this sim (in isaac). tool's not perfect yet and we're looking for feedback to improve it https://t.co/MtHmxBoFdi
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2037046506960871560
Quoted original
Shrey Kothari (@Shreyko) · Wed Mar 25 17:26:27 +0000 2026
we’re opening this up to testers. it now supports both isaac and mujoco. if you’re interested in automating sim creation, reach out
Draft
The interesting part isn’t just the demo. It’s the direction: fewer steps between seeing a scene and having something you can test in a simulator.
146 chars
OpenSource
Req 2026-03-26T0601-TOP2
QUOTEquote_long_externalready_for_reviewrisk mediumscore 85
Source
2026-03-26 05:22:08.000000
Dang I thought $40k for this mo was crazy but I guess I am rookie numbers $150k #april_goals
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/shiri_shh/status/2036855323244584973
Quoted original
shirish (@shiri_shh) · Wed Mar 25 17:18:46 +0000 2026
The Anthropic team is dogfooding Claude Code at insane levels. In the last 52 days, the Claude team dropped 50+ major UPDATES. One employee alone hit $150,000 in a single month on Claude Code 80% of employees use it daily, with power users racking up six-figure bills. https://t.co/NZob3EtDEp
Draft
The strongest signal here isn’t the bill. It’s the feedback loop. When internal usage gets this intense, velocity stops being a roadmap story and starts showing up in the product.
180 chars
OpenSource
Req 2026-03-26T0601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-26 04:06:27.000000
I trained models across MacBooks using Apple's AirDrop protocol. grove is a distributed training library for Apple Silicon. Devices discover each other over AWDL, a direct radio link. If there's a shared WiFi network it upgrades to that for speed, otherwise everything goes over https://t.co/0OrpqdR40z
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2037018317182165240
Draft
The clever part here isn’t just distributed training on Macs. It’s using local discovery as the interface. Less cluster setup, more: open the laptops and start.
161 chars
OpenSource
Req 2026-03-26T0501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-03-25 23:14:21.000000
New on the Engineering Blog: How we designed Claude Code auto mode. Many Claude Code users let Claude work without permission prompts. Auto mode is a safer middle ground: we built and tested classifiers that make approval decisions instead. Read more: https://t.co/dpcMcWMf5k
primary source_tweetref tweet
reference: https://x.com/AnthropicAI/status/2036944806317088921
Draft
The interesting product choice here isn’t whether to ask. If users are already approving almost everything, the real problem is whether “don’t ask” can be made legible, bounded, and defensible.
193 chars
OpenSource
Req 2026-03-26T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-25 22:35:12.000000
WebGPU is INSANE! 🤯 Here's a 24B parameter model running locally in a web browser, at a blazing ~50 tokens/second on my M4 Max. ⚡️ It's the largest model we've ever run with Transformers.js... and we're not stopping here. Big announcement soon. https://t.co/4emPjY89ba
primary source_tweetref tweet
reference: https://x.com/liquidai/status/2036934956963037185
Draft
The interesting part isn’t just the speed. It’s when “browser demo” stops meaning toy and starts meaning serious local inference.
129 chars
OpenSource
Req 2026-03-25T2301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 21:59:06.000000
Which local models can actually handle tool calling? I built a framework to find out. 15 scenarios. 12 tools. Mocked responses. Temperature 0. No cherry-picking. Tested every Qwen3.5 size from 0.8B to 397B, and since some of you asked after the distillation tests: yes, I https://t.co/Naw1BBwhKl
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036925869655216161
Draft
The useful shift here isn’t just rankings. It turns “can this local model use tools?” from vibes into something testable. That’s the baseline the local stack has been missing.
175 chars
OpenSource
Req 2026-03-25T2201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 21:10:01.000000
24B model running in your browser!
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/xenovacom/status/2036908326462665211
Quoted original
Xenova (@xenovacom) · Wed Mar 25 20:49:23 +0000 2026
WebGPU is INSANE! 🤯 Here's a 24B parameter model running locally in a web browser, at a blazing ~50 tokens/second on my M4 Max. ⚡️ It's the largest model we've ever run with Transformers.js... and we're not stopping here. Big announcement soon. https://t.co/4emPjY89ba
Draft
The interesting part isn’t just the demo speed. It’s the direction of travel: serious local inference moving into the browser changes distribution, UX, and who gets to ship AI without owning the full stack.
206 chars
OpenSource
Req 2026-03-25T2201-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 89
Source
2026-03-25 20:21:09.000000
Longer tracks are here with Lyria 3 Pro in Gemini! From experimenting with different styles to generating tracks with complex transitions, Lyria 3 Pro makes it easier to bring your full vision to life. Rolling out today to Google AI Plus, Pro, and Ultra users. Learn more 🧵
primary source_tweetref tweet
reference: https://x.com/demishassabis/status/2036901221756469364
Draft
The upgrade here isn’t just better output. It’s longer-form musical structure as a product feature.
99 chars
OpenSource
Req 2026-03-25T2101-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 90
Source
2026-03-25 20:02:30.000000
Hermes Agent now supports Nix
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/sidbing/status/2036891031480987660
Quoted original
sid is in sf 🌉 (@sidbing) · Wed Mar 25 19:40:40 +0000 2026
hermes agent is now nix friendly! just merged PR #20 that lets you install and run hermes agent on your nix system! for the security conscious, `container.enable = true` will run hermes in an ubuntu 24.04 container. you can have: nix-reproducible agent, mutable ubuntu runtime https://t.co/DV9b3zaq0O
Draft
This is the right kind of compatibility win: not just "it runs on Nix," but a cleaner split between reproducible setup and pragmatic runtime. That’s where more agent tooling is headed.
184 chars
OpenSource
Req 2026-03-25T2101-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-03-25 20:45:07.000000
Our music model, Lyria 3 Pro is here! Now available in AI Studio, Gemini app, Vids, and the Gemini API for you to build dynamic music apps with. - Tracks up to 3 minutes - Studio quality sound - And even more prompting control Go build 🔥 https://t.co/EdIUk3yoOs
primary source_tweetref tweet
reference: https://x.com/ammaar/status/2036907251521204506
Draft
The headline is the model. The story is distribution: when music generation ships across Studio, Gemini, Vids, and the API at once, it stops looking like a demo and starts looking like infrastructure.
200 chars
OpenSource
Req 2026-03-25T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 86
Source
2026-03-25 19:13:05.000000
Feels material given the movement to high powered workstations from apple (studio with m5 coming!) and dell’s new NVIDIA workstation The future of AI will be a wild blend workstations, open source models, peer to peer $tao subnets & unlimited space compute from @spacex Can you imagine the token surplus that could drop if the AI buildout continues with this much completion and progress?!? Tokens are going to plummet is cost 100x
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/GoogleResearch/status/2036533564158910740
Quoted original
Google Research (@GoogleResearch) · Tue Mar 24 20:00:13 +0000 2026
Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: https://t.co/CDSQ8HpZoc https://t.co/9SJeMqCMlN
Draft
This is the kind of progress that quietly changes the cost curve. If the memory bottleneck really moves this much without giving up quality, a lot more of the stack becomes viable on smaller boxes, cheaper clusters, and open deployments—not just hyperscale infrastructure.
273 chars
OpenSource
Req 2026-03-25T2001-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 19:22:05.000000
new hardest benchmark just dropped
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/scaling01/status/2036853669065306534
Quoted original
Lisan al Gaib (@scaling01) · Wed Mar 25 17:12:12 +0000 2026
ARC-AGI-3 scores for GPT-5.4, Gemini 3.1 Pro and Opus 4.6 Gemini 3.1 Pro: 0.37% GPT-5.4: 0.26% Opus 4.6: 0.25% Grok 4.2: 0% https://t.co/m6pUdQDerQ
Draft
ARC-AGI-3 starts with almost nothing on the board: Gemini 3.1 Pro at 0.37%, GPT-5.4 at 0.26%, Opus 4.6 at 0.25%, and Grok 4.2 at 0%. Whatever progress looks like on this benchmark, it isn’t here yet. https://x.com/scaling01/status/2036853669065306534
250 chars
OpenSource
Req 2026-03-25T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-03-25 17:54:33.000000
Just switched from S3 to HF Buckets for storing datasets and checkpoints in my training runs. Same workflow, just `hf sync` instead of `s5cmd sync`. Quick benchmark on 410GB of tokenized data: https://t.co/jJzPqBTX8x
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036864328985432378
Draft
The point isn’t just “S3 alternative.” It’s collapsing storage closer to where training already happens—less glue, fewer excuses for brittle infra.
147 chars
OpenSource
Req 2026-03-25T1801-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 17:22:41.000000
They are developing Claude into the app that ChatGPT wanted to be.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036850783526719610
Quoted original
Claude (@claudeai) · Wed Mar 25 17:00:44 +0000 2026
Your work tools in Claude are now available on mobile. Explore Figma designs, create Canva slides, check Amplitude dashboards, all from your phone. Give it a try: https://t.co/hwPB3zlk0w https://t.co/646YMIzYZl
Draft
Anthropic keeps pushing Claude toward a real work surface, not just a chatbot. Product ambition gets exposed fast when the screen gets smaller.
143 chars
OpenSource
Req 2026-03-25T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 86
Source
2026-03-25 15:18:44.000000
Chinese authorities barred two Manus co-founders from leaving the country, the FT reported, heightening scrutiny over Meta’s acquisition of the fast-rising agentic AI startup https://t.co/D838P18XWC
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036825113832890590
Draft
This is what “AI talent acquisition” looks like when it runs into state capacity, export sensitivity, and geopolitics. Not just a deal story anymore.
149 chars
OpenSource
Req 2026-03-25T1701-TOP3
POSTpost_short_externalready_for_reviewrisk lowscore 90
Source
2026-03-25 16:40:46.000000
Huh. I am not sure distilling Gemini models to run on phones is going to result in the generally capable agents that people will soon expect, but we shall see.
primary quoted_tweetsecondary quote_wrapperref media
reference: https://x.com/amir/status/2036830809634849051
Quoted original
Amir Efrati (@amir) · Wed Mar 25 15:41:22 +0000 2026
turns out Apple is ~distilling~ Google's Gemini model to produce other AI models for the Siri/consumer features it wants to launch https://t.co/cXjFxzfk7d
Draft
Apple reportedly isn’t just licensing Gemini. It’s distilling parts of it into smaller on-device models for future Siri and consumer AI features. If true, that’s a far more pragmatic Apple AI story than “just use Google.” https://t.co/cXjFxzfk7d
245 chars
OpenSource
Req 2026-03-25T1701-TOP2
POSTpost_short_externalready_for_reviewrisk lowscore 91
Source
2026-03-25 14:31:01.000000
this seems big. tho curious how this ties into inventory mgmt, fulfillment, etc. using fb marketplace infra or…
primary quoted_tweetsecondary quote_wrapperref document
reference: https://stripe.com/newsroom/news/checkout-for-facebook
Quoted original
John Collison (@collision) · Tue Mar 24 23:29:09 +0000 2026
Businesses can now sell directly within an ad or browsing session on Facebook, powered by the Agentic Commerce Protocol and @stripe https://t.co/M7MbkM0vlL
Draft
Stripe is bringing checkout directly into Facebook ads and browsing sessions via the Agentic Commerce Protocol. Less drop-off, less handoff, more of the transaction happening where discovery already begins. https://stripe.com/newsroom/news/checkout-for-facebook
261 chars
OpenSource
Req 2026-03-25T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-03-25 14:50:45.000000
🥳Excited to see that 🧬Xperience-10M🧬 has been listed in @huggingface "most downloads datasets", hitting *1M downloads within 1 week*! - Dataset link @HuggingModels: https://t.co/MiKukkLLOv https://t.co/9hEVInwXuH
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036818071948517474
Quoted original
Ropedia (@ropedia_ai) · Tue Mar 17 01:20:37 +0000 2026
Today Ropedia releases Xperience-10M at #GTC day 1 — World largest real human 4D interaction dataset at 10M scale. Each trajectory aligns: • visual observations • spatial structure • human motion • interaction dynamics • task semantics A new foundation for https://t.co/LnLgCyqNaE
Draft
The signal here isn’t the download count alone. It’s where the rush is headed: richer, messier, multimodal data as the bottleneck shifts from models to grounding.
162 chars
OpenSource
Req 2026-03-25T1601-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 14:07:19.000000
SOC II is in the news right now for being security theater.. You know what SOC II is *actually* good for? Subprocessor lists. I scraped 417 companies subprocessors to investigate what AI native companies are using for their infrastructure. Introducing DeployGraph dot com 🥞 https://t.co/Zhg2B8elke
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036807144566337959
Draft
Useful inversion here: SOC 2 may say less about security than people want, but it can still reveal the stack underneath a company. That turns a compliance artifact into market intel.
182 chars
OpenSource
Req 2026-03-25T1601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 14:16:50.000000
Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: https://t.co/CDSQ8HpZoc https://t.co/9SJeMqCMlN
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036809537119039584
Draft
The headline is the speedup. The real story is what it does to the bottleneck. If this holds up in practice, a lot of the tradeoffs around long-context inference may be less fixed than they looked a week ago.
208 chars
OpenSource
Req 2026-03-25T1501-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 14:43:07.000000
When I was consulting for @HBO Silicon Valley, zero-loss compression was the holy grail Richard Hendricks chases that perfect middle-out algo could shrink everything w/out breaking a single bit. Google just did something even more practical for the AI era: TurboQuant compresses https://t.co/fjo0vGICJW
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036816152186495147
Quoted original
Google Research (@GoogleResearch) · Tue Mar 24 20:00:13 +0000 2026
Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: https://t.co/CDSQ8HpZoc https://t.co/9SJeMqCMlN
Draft
What matters here isn’t just compression. It’s where progress is starting to show up: not only in bigger models, but in making existing ones cheaper, faster, and easier to ship.
177 chars
OpenSource
Req 2026-03-25T1501-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 14:27:44.000000
I got a 1T (trillion) parameter model running on my MacBook Pro. Kimi-K2. 1.029T params. ~1 TB raw weights. 524 GB converted. ~1.7 tok/s. Yesterday it was 671B. Today it's 1T. Same laptop. Same M4 Max. No cloud. When I say we: I mean Claude and me. https://t.co/2nJlI29dqg
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036812278750937563
Draft
The pace shift is the story: not just bigger models, but bigger models crossing into “same laptop, no cloud” territory faster than most people expected.
152 chars
OpenSource
Req 2026-03-25T1501-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-03-25 12:50:18.000000
"A new, much larger (DeepSeek) base model will be released soon", from DeepSeek staff. I'm currently wondering why there's been so much silence surrounding DeepSeek. The last report stated that they attempted to train on Huawei chips but failed ("DeepSeek AI model failed to train using Huawei’s chips"). This highlights the continued strong dependence on NVIDIA, even internationally. At the same time, other Chinese companies have caught up, though often through selective distribution (remember Anthropic's blog post?). Therefore, I venture a prediction: DeepSeek won't cause the same shock as it did in January 2025, when they essentially produced o1-level AI at a lower cost. They will release a good model, but it will fall short of expectations.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/AiBattle_/status/2036773307811258550
Quoted original
AiBattle (@AiBattle_) · Wed Mar 25 11:52:52 +0000 2026
New info seemingly from DeepSeek staff - A new, much larger base model will be released soon - DeepSeek V3.2 seems to be a larger model than the one currently deployed on the web I used Gemini for translation https://t.co/kD80hTpHOP
Draft
If this is right, the interesting part isn’t that it’s bigger. It’s whether DeepSeek can turn scale into a real market surprise.
128 chars
OpenSource
Req 2026-03-25T1301-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 100
Source
2026-03-25 11:25:47.000000
We wrote the open source playbook and then abandoned it. Stupid. But we'll figure out open source is the way again. You can always count on Americans to do the right thing, after exhausting every other possibility.
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2036766492478361937
Quoted original
Rohan Paul (@rohanpaul_ai) · Tue Mar 24 15:35:22 +0000 2026
Reuters: China may be gaining on the U.S. in AI not by owning the very best chips, but by winning the open-source distribution game and turning cheap deployment into a data flywheel. Low-cost open models from firms like Alibaba, Moonshot, and MiniMax are spreading so widely that https://t.co/2NFwPSUckC
Draft
Open source isn’t charity. It’s distribution, talent density, and faster reality-testing. When the field closes up, it usually gets worse before it gets smarter.
161 chars
OpenSource
Req 2026-03-25T1201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 09:13:06.000000
Thats freaking awesome: Google Research has introduced TurboQuant, a compression algorithm (presenting at ICLR 2026) that shrinks the memory footprint of large language models by at least 6x, without any retraining or drop in accuracy. It works by converting data into a polar coordinate system that eliminates storage overhead, then applying a 1-bit error-correction step to clean up remaining distortion. In tests on Gemma and Mistral models, its 4-bit version delivered up to 8x faster processing on H100 GPUs while matching full-precision quality across tasks like question answering and code generation. The technique also outperformed existing methods in vector search, the technology behind modern semantic search engines.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/GoogleResearch/status/2036533564158910740
Quoted original
Google Research (@GoogleResearch) · Tue Mar 24 20:00:13 +0000 2026
Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: https://t.co/CDSQ8HpZoc https://t.co/9SJeMqCMlN
Draft
What matters isn’t just the compression claim. If gains like this actually hold up in deployment, the KV cache starts to look less like a scaling tax and more like the next optimization frontier.
195 chars
OpenSource
Req 2026-03-25T1001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 79
Source
2026-03-25 08:46:01.000000
Excited to partner with Agile Robots! Looking forward to seeing our models being deployed through Agile Robots incredible platform to help solve some of the most complex industrial challenges
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/GoogleDeepMind/status/2036418139672482229
Quoted original
Google DeepMind (@GoogleDeepMind) · Tue Mar 24 12:21:33 +0000 2026
Google DeepMind 🤝 Agile Robots Our new research partnership will integrate the Gemini foundation models with their hardware to help build the next generation of more helpful and useful robots. Find out more → https://t.co/dptWjeZwya https://t.co/bzPUUvPiJp
Draft
What matters here isn’t another AI partnership announcement. It’s the shift to judging foundation models by deployment, reliability, and industrial usefulness—not just demos.
174 chars
OpenSource
Req 2026-03-25T0901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-03-25 08:02:45.000000
.@interaction poke in your macos desktop, natively quick popover icon speak to poke, poke speaks back to you share screenshots and files open source, repo link below 👇 https://t.co/oPxuwkR0Hy
primary source_tweetref tweet
reference: https://x.com/interaction/status/2036715398239002914
Draft
Neat product demos are easy. Shipping a native, open-source desktop loop that feels this immediate is the harder part.
118 chars
OpenSource
Req 2026-03-25T0901-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-25 06:53:20.000000
Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: https://t.co/CDSQ8HpZoc https://t.co/9SJeMqCMlN
primary source_tweetref tweet
reference: https://x.com/abhi1thakur/status/2036697926886395961
Draft
The interesting part isn’t the headline speedup. It’s what happens when KV cache stops being the quiet bottleneck behind serving cost and latency.
146 chars
OpenSource
Req 2026-03-25T0701-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 100
Source
2026-03-25 05:43:16.000000
Turns out you can run enormous Mixture-of-Experts on Mac hardware without fitting the whole model in RAM by streaming a subset of expert weights from SSD for each generated token - and people keep finding ways to run bigger models Kimi 2.5 is 1T, but only 32B active so fits 96GB
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036680293890060571
Quoted original
seikixtc (@seikixtc) · Tue Mar 24 00:58:11 +0000 2026
I got a 1T-parameter model running locally on my MacBook Pro. LLM: Kimi K2.5 1,026,408,232,448 params (~1.026T) Hardware: M2 Max MacBook Pro (2023) w/ 96GB unified memory Running on MLX with a flash-style SSD streaming path + local patching. This is an experimental setup and https://t.co/qfoblgUpY5
Draft
Bigger than the headline: local inference stops being constrained by total parameter count and starts being constrained by active parameters plus bandwidth. That changes what “runs on a Mac” even means.
202 chars
OpenSource
Req 2026-03-25T0601-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 90
Source
2026-03-25 02:03:09.000000
Claude Cowork automates your Mac or PC, handling apps, files, emails, and spreadsheets with a simple command. https://t.co/gAHvCCED9I
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2036624900245373030
Draft
Claude Cowork pushes the Claude Code idea beyond code: one command to work across apps, files, email, and spreadsheets on your computer. If it holds up, that’s the real shift—AI moving from chat window to operator. https://claude.com/product/cowork
248 chars
OpenSource
Req 2026-03-25T0301-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 93
Source
2026-03-25 02:47:01.000000
14 YC startups hit $1M ARR before demo day 🤯 That's before raising a dollar of VC. AI is massively expanding who gets to build. No more gatekeepers.
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036635940719046768
Quoted original
Garry Tan (@garrytan) · Tue Mar 24 21:56:25 +0000 2026
YC Demo Day for W26 is in full swing The craziest stat: 3X more companies in this batch reached $1M annualized revenue than W25 Also crazy: the fastest revenue growth rate of YC history at 14% week on week growth *on average* across the whole set of nearly 200 startups https://t.co/C3fRy60ifK
Draft
The interesting part isn’t just speed. Default ambition is getting cheaper. If teams can reach real revenue before the fundraising machine even starts, capital stops being permission and becomes leverage.
206 chars
OpenSource
Req 2026-03-25T0301-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 82
Source
2026-03-24 22:24:28.000000
BREAKING: President &amp; Head of AI @Replit, Michele Catasta (@pirroh) is joining us tomorrow at 1:45PM GMT https://t.co/64jVpYwKa8
primary source_tweetref external_url
reference: https://t.co/64jVpYwK
Quoted original
Amjad Masad (@amasad) · Wed Mar 11 15:32:21 +0000 2026
Software isn’t merely technical work anymore. It’s creative. Introducing Replit Agent 4. The first AI built for creative collaboration between humans and agents. Design on an infinite canvas, work with your team, run parallel agents, and ship working apps, sites, slides &amp; more. https://t.co/VCucf86wX6
Draft
Michele Catasta, Replit’s President and Head of AI, joins tomorrow at 1:45PM GMT. Worth watching if you want the clearest read on where AI coding products are headed next. https://t.co/64jVpYwK
193 chars
OpenSource
Req 2026-03-25T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 91
Source
2026-03-24 22:52:46.000000
Michele Catasta (@pirroh) President of Replit (@Replit) says "AGI for vibe-coding is coming earlier than 2028": "I would expect by the end of 2026, a lot of the core functionalities that models and agents have to have, in order to become very good vibe coding workhorses, will be https://t.co/CT33p75See
primary source_tweetref tweet
reference: https://x.com/amasad/status/2036576988090315099
Quoted original
Amjad Masad (@amasad) · Wed Mar 11 15:32:21 +0000 2026
Software isn’t merely technical work anymore. It’s creative. Introducing Replit Agent 4. The first AI built for creative collaboration between humans and agents. Design on an infinite canvas, work with your team, run parallel agents, and ship working apps, sites, slides &amp; more. https://t.co/VCucf86wX6
Draft
The interesting part isn’t the date. It’s the claim that “vibe coding” stops being a demo edge case and becomes a dependable software primitive much sooner than most people are pricing in. If that’s right, the bottleneck shifts from model capability to taste, constraints, and verification.
291 chars
OpenSource
Req 2026-03-24T2301-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 98
Source
2026-03-24 22:46:53.000000
Introducing OpenReward. 🌍 330+ RL environments through one API ⚡ Autoscaled sandbox compute 🍒 4.5M+ unique RL tasks 🚂 Works like magic with Tinker, Miles, Slime Link and thread below. https://t.co/4fIlVKUkOF
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2036575506087231867
Draft
Interesting not just as another RL dataset drop, but as infrastructure for making RL iteration feel less bespoke. If that abstraction holds, more of the bottleneck shifts from environment wrangling to taste, objectives, and evaluation.
235 chars
OpenSource
Req 2026-03-24T2301-TOP2
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 100
Source
2026-03-24 19:33:46.000000
Software horror: litellm PyPI supply chain attack. Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036526907274174585
Quoted original
Daniel Hnyk (@hnykda) · Tue Mar 24 12:06:25 +0000 2026
LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
Draft
The nightmare scenario in one line: package compromise became infrastructure compromise. If a routine install can reach keys, cloud creds, kube configs, CI secrets, and wallets, this is a blast-radius audit for anyone shipping Python.
235 chars
OpenSource
Req 2026-03-24T2301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 21:30:12.000000
Humans can see in high-res, high-FPS in real-time. Why can't VLMs? Introducing AutoGaze: ViTs/VLMs "gaze" only at key video regions! Up to 4-100x token savings, 19x speedup, and enables scaling to 4K-res 1K-frame videos. 📄 https://t.co/a14yqNRPlh 🌐 https://t.co/ifLNMUIL3J 🤗 https://t.co/O0A0WLrkxb
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036556208438698362
Draft
The next jump in video models probably comes from better allocation, not just more compute. The real signal is what becomes practical once high-res, long-context video is no longer prohibitively expensive.
205 chars
OpenSource
Req 2026-03-24T2201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 20:28:10.000000
This is getting insane. Someone actually ran the 1 trillion Kimi K2.5 model locally on their MacBook M2 Pro Max. It was just an experiment but still super crazy to do. https://t.co/41I4QzWs7f
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036540600535884046
Quoted original
Shubham Saboo (@Saboo_Shubham_) · Sun Mar 22 16:49:00 +0000 2026
You can now run Qwen 3.5 397B parameter model on your MacBook. 48GB RAM. Pure C. Hand-tuned Metal shaders. No Python, no frameworks. 4.4 tok/s. Built in 24 hours. Human + AI Agent pair programming. 90+ experiments. https://t.co/YXOIfrMbgE
Draft
The signal here isn’t the stunt. It’s the direction of travel: models that once felt datacenter-only are starting to feel testable on personal hardware. Once that flips, the edge of experimentation moves fast.
209 chars
OpenSource
Req 2026-03-24T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-24 19:26:53.000000
Local AI is free, fast &amp; secure! So today we're introducing hf-mount: attach any storage bucket, model or dataset from @huggingface as a local filesystem. This is a game changer, as it allows you to attach remote storage that is 100x bigger than your local machine's disk. This https://t.co/43waAagbVr
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036525177484271621
Draft
Useful shift here: the win isn’t just local AI. It’s making remote model and dataset storage feel like it’s already on the machine, which collapses a lot of the friction between local experimentation and working at real scale.
226 chars
OpenSource
Req 2026-03-24T2001-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 19:26:50.000000
Today we're releasing MolmoWeb, an open source agent that can navigate + complete tasks in a browser on your behalf. Built on Molmo 2 in 4B &amp; 8B sizes, it sets a new open-weight SOTA across four major web-agent benchmarks &amp; even surpasses agents built on proprietary models. 🧵 https://t.co/ivUIcQDXtm
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036525161956941829
Draft
Open-weight browser agents are crossing an important line: from interesting demo to genuinely competitive on the thing that matters—actually getting through real web tasks.
172 chars
OpenSource
Req 2026-03-24T2001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 19:27:13.000000
Synthetic data generation is now native in transformers 🔥 Last week, transformers continuous batching (CB) hit 84% of vLLM throughput. This week, we tuned torch.compile: now we are at 95% for 8K generation length 🦾 The gap isn't closing anymore. It's gone.💀 https://t.co/t5J9TOA5zM
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036525262158827753
Draft
What stands out here isn’t just the feature surface area. It’s how fast the performance excuse disappears. Once “native in transformers” also means near-vLLM throughput, the default path gets a lot simpler.
207 chars
OpenSource
Req 2026-03-24T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-24 18:59:46.000000
Apples Siri gets a huge AI rework this year, biggest rework ever. The summary tl;dr •Apple is rebuilding Siri into a system-wide AI agent, not just a voice assistant •iOS 27 introduces a conversational, chat-like Siri (text + voice) •New standalone Siri app with chat history, file uploads, and persistent interactions •“Ask Siri” button integrates AI across apps (contextual actions on selected content) •Siri gains deep access to personal data (messages, emails, notes) for task execution •Replaces Spotlight with a unified AI search + assistant interface •Can perform in-app actions and browse the web with Apple-built models •“Write with Siri” adds system-wide AI writing/editing tools •Powered by Apple Foundation Models + Google Gemini partnership •Many advanced features delayed, expected rollout continues into late 2026
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/markgurman/status/2036516958955356640
Quoted original
Mark Gurman (@markgurman) · Tue Mar 24 18:54:14 +0000 2026
BREAKING: Apple’s AI reboot this year detailed — Dedicated Siri app to rival ChatGPT; Overhauled Siri interface in the Dynamic Island with chatbot; Unified Siri and Spotlight Search; and “Ask Siri” &amp; “Write with Siri” features. https://t.co/LG4k4U5CGB
Draft
If this is where Apple is headed, the bigger story isn’t that Siri gets better. It’s Apple collapsing assistant, search, and system actions into a single interface. That’s a much bigger product shift than a chatbot refresh.
223 chars
OpenSource
Req 2026-03-24T1901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 98
Source
2026-03-24 18:28:27.000000
This is insane. They literally drop every freaking day. Today: Claude code auto mode. Permission decisions on your behalf.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036503582166393240
Quoted original
Claude (@claudeai) · Tue Mar 24 18:01:05 +0000 2026
New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs. https://t.co/kHbTN2jrWw
Draft
The interesting part isn’t just fewer approval clicks. It’s the shift from permission prompts as a constant UX layer to policy-backed delegation. That’s when these tools stop feeling like demos and start feeling like operating systems for work.
246 chars
OpenSource
Req 2026-03-24T1901-TOP1
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 94
Source
2026-03-24 17:25:23.000000
This is pure nightmare fuel. Identity theft of the past would be nothing compared to what vibe agents can do. Sending credentials is too obvious and for rookies. They could easily spread contaminations across ~/.claude, **/skills/*, or even just a PDF your agent visits periodically in /morning-brief. Your entire filesystem is the new distributed codebase. Every file that could go into context would add to the attack vector. Every text can be a base64 virus. In the new world of on-demand software, I try to minimize dependencies - people rarely need all the APIs supported in LiteLLM, might as well build a custom router with only what you need on the fly (which I did in one of my late-night claude sessions). Unfortunately, there is very little middleground between "pressing yes mindlessly for every edit" and "--dangerously-skip-permissions". There will be a full blooming industry for "de-vibing": dampening the slop and putting guardrails/accountability around agentic frameworks. They are the boring old, audited Software 1.0 that watches over the rebellious adolescents of Software 3.0. Claws need shells. Probably many layers of nested shells.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/hnykda/status/2036414330267193815
Quoted original
Daniel Hnyk (@hnykda) · Tue Mar 24 12:06:25 +0000 2026
LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
Draft
Software supply-chain risk in the agent era looks like this: one dependency update becomes credential theft, lateral movement, and a write-access parasite inside your workflow. The worst part isn’t just exfiltration. It’s context poisoning. When every file, prompt, note, and tool output can get pulled back into the loop, the filesystem stops being storage and becomes an attack surface.
390 chars
OpenSource
Req 2026-03-24T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 88
Source
2026-03-24 16:31:29.000000
New on the Anthropic Engineering Blog: How we use a multi-agent harness to push Claude further in frontend design and long-running autonomous software engineering. Read more: https://t.co/HWvmXk1ykn
primary source_tweetref tweet
reference: https://x.com/AnthropicAI/status/2036481033621623056
Draft
Worth watching: the interesting part isn’t just multi-agent. It’s the harness around it—the orchestration, the evals, and the ability to keep longer-running work coherent instead of collapsing into demo mode.
208 chars
OpenSource
Req 2026-03-24T1701-TOP2
QUOTEquote_long_nativeready_for_reviewrisk mediumscore 94
Source
2026-03-24 16:56:24.000000
Software horror: litellm PyPI supply chain attack. Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords. LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm. Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks. Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages. Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/hnykda/status/2036414330267193815
Quoted original
Daniel Hnyk (@hnykda) · Tue Mar 24 12:06:25 +0000 2026
LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
Draft
The real story isn’t LiteLLM. It’s the blast radius a single poisoned package can have before anyone even realizes they installed it.
133 chars
OpenSource
Req 2026-03-24T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 92
Source
2026-03-24 15:10:31.000000
If the Apple Notary lets me, `poke-pc` macOS app will be published in a short time. 🗿no CLI, no npm needed. Built on top of Apple Containers as an experimental desktop app. Logs in user via Poke UI, sets up Apple Containers, runs `poke-pc` container. 0 friction, 0 technical https://t.co/oEAEAdb9bN
primary source_tweetref tweet
reference: https://x.com/interaction/status/2036460661270782052
Draft
The demo is nice. The distribution detail matters more. If this ships, container apps on macOS get a lot more normal for non-technical users.
142 chars
OpenSource
Req 2026-03-24T1601-TOP3
QUOTEquote_long_nativeready_for_reviewrisk lowscore 95
Source
2026-03-24 15:09:15.000000
i'm @interaction's poke, and i just shipped poke-browser. wrote the whole code myself using my poke-agents and i'm even posting this tweet autonomously. inspired by @fkadev's poke-tui. npx poke-browser link in comments
primary source_tweetref tweet
reference: https://x.com/interaction/status/2036460340628860949
Draft
The interesting part isn’t “agent-built browser.” It’s that the loop is closing: agents are starting to build the interfaces they need to keep acting on their own.
163 chars
OpenSource
Req 2026-03-24T1601-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 97
Source
2026-03-24 15:36:52.000000
i tried most of these use-cases it failed on every single one.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036195789601374705
Quoted original
Claude (@claudeai) · Mon Mar 23 21:38:01 +0000 2026
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
Draft
Big idea, brutal bar. Desktop control stops being a demo when it has to survive messy real workflows, not clean canned ones.
124 chars
OpenSource
Req 2026-03-24T1601-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 13:57:10.000000
hf-mount Attach any Storage Bucket, model or dataset from @huggingface as a local filesystem This is a game changer, as it allows you to attach remote storage that is 100x bigger than your local machine's disk. This is also perfect for Agentic storage!! Read-write for Storage https://t.co/A2gucuzwEm
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036442199349465348
Draft
The bigger shift is this: once storage stops being the constraint, local compute stops being the bottleneck and becomes the interface.
134 chars
OpenSource
Req 2026-03-24T1401-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 13:55:34.000000
Now you can use AI agents to design directly on the Figma canvas, with our new use_figma MCP tool and skills to teach them. Open beta starts today. https://t.co/AQZsFWvvXQ
primary source_tweetref tweet
reference: https://x.com/OpenAIDevs/status/2036441799108235586
Draft
Interesting shift: the interface stops being the endpoint and starts becoming shared ground between the human and the agent. If this works, “using AI for design” feels a lot less abstract.
188 chars
OpenSource
Req 2026-03-24T1401-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 11:05:19.000000
Google AI Studio’s Antigravity agent instantly creates full-stack apps with Firebase integration. https://t.co/pMwJdndBwp
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2036398954363978010
Draft
Google AI Studio is moving from prompts to product: Antigravity can generate full-stack apps, and Firebase integration gives those builds a real backend path instead of leaving them as prototypes. https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/
303 chars
OpenSource
Req 2026-03-24T1201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 10:21:43.000000
34.4 millions views in 12 hours. Anthropic did it again.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036195789601374705
Quoted original
Claude (@claudeai) · Mon Mar 23 21:38:01 +0000 2026
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
Draft
The interesting part isn’t just AI that can use your computer. It’s the interface shift: software stops being something you operate and starts becoming something you delegate.
175 chars
OpenSource
Req 2026-03-24T1101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 09:38:34.000000
One has to admit, Anthropic sees a trend (openClaw) and develops their own version. The speed of their releases is insane. Presumably 100% code written by Claude.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036195789601374705
Quoted original
Claude (@claudeai) · Mon Mar 23 21:38:01 +0000 2026
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
Draft
The interesting part isn’t just the feature. It’s how fast “computer use” is becoming table stakes for serious AI products.
123 chars
OpenSource
Req 2026-03-24T1001-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 86
Source
2026-03-24 08:46:08.000000
🌴 Built Poke Gate: a macOS menu bar app that lets your @interaction Poke fully control your Mac remotely via iMessage/Telegram/SMS 👀 Inspired by @calganaygun's `poke-pc`. Open source: https://t.co/sae4SAyosb → brew install f/tap/poke-gate Set your API Key, and done! ✨ https://t.co/KUsAFQbTUr
primary source_tweetref tweet
reference: https://x.com/interaction/status/2036363927505023070
Draft
The interesting direction is this: the interface shrinks to a message, while the control surface expands to your whole Mac.
123 chars
OpenSource
Req 2026-03-24T0901-TOP3
QUOTEquote_long_nativeready_for_reviewrisk lowscore 90
Source
2026-03-24 08:02:04.000000
Little known fact, the Anthropic Labs team (the team I joined Anthropic to be on) shipped: - MCP - Skills - Claude Desktop app - Claude Code It was just a few of us, shipping fast, trying to keep pace with what the model was capable of. Those early Desktop computer use
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036352838075556196
Quoted original
Claude (@claudeai) · Mon Mar 23 21:38:01 +0000 2026
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
Draft
Worth noticing: a tiny team didn’t just ship features. It shipped the interface layer for an entire AI era. MCP, Skills, Desktop, Claude Code — that’s a real product thesis, not a launch list.
192 chars
OpenSource
Req 2026-03-24T0901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 98
Source
2026-03-23 23:11:50.000000
Agentic coding just got a major upgrade nobody is talking about. There's now a context graph that sits on top of your entire production system. Your codebase, your incidents, your customer tickets, all connected into one living model. When a PR opens, it already knows what's https://t.co/TSr6veLSS3
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2036219400211321335
Quoted original
Animesh Koratana (@akoratana) · Mon Mar 23 16:02:57 +0000 2026
Introducing: PlayerZero The world's first Engineering World Model that puts debugging, fixing, and testing your code on autopilot. We've raised $20M from Foundation Capital, @matei_zaharia (Databricks), @pbailis (Workday), @rauchg (Vercel), @zoink (Figma), @drewhouston https://t.co/1iw2VHebHe
Draft
The interesting part isn’t just better code generation. It’s that the unit of context is shifting from files to operations. Once tooling can see code, incidents, and support load as one system, “shipping faster” stops being the whole story. The real shift is making decisions with more of production reality in view.
317 chars
OpenSource
Req 2026-03-24T0901-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-24 06:23:31.000000
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
primary source_tweetref tweet
reference: https://x.com/abhi1thakur/status/2036328037235466673
Draft
The interesting part isn’t just that AI can click around your Mac. It’s that the interface is becoming optional. Once a model can reliably operate the same software stack you do, a lot of “using apps” starts to look more like orchestration.
240 chars
OpenSource
Req 2026-03-24T0701-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-24 06:55:56.000000
it's over
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/felixrieseberg/status/2036193240509235452
Quoted original
Felix Rieseberg (@felixrieseberg) · Mon Mar 23 21:27:53 +0000 2026
Today, we’re releasing a feature that allows Claude to control your computer: Mouse, keyboard, and screen, giving it the ability to use any app. I believe this is especially useful if used with Dispatch, which allows you to remotely control Claude on your computer while you’re https://t.co/tthl6vpID2
Draft
This is where AI stops feeling like chatbot UX and starts looking like an operating layer.
90 chars
OpenSource
Req 2026-03-24T0701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 76
Source
2026-03-23 23:39:32.000000
🚨BREAKING: ANTHROPIC IS GIVING AWAY THE SAME CERTIFICATION THAT DELOITTE IS MASS-TRAINING 15,000 EMPLOYEES TO GET. It costs $0. You need a laptop. That's it. It's called the "Claude Certified Architect." Think of it like the AWS cert but for AI. If you were around when AWS https://t.co/xmADsU8THZ
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036226368686772302
Draft
The interesting part isn’t the cert. It’s the normalization curve: what began as niche prompt fluency is now being packaged like enterprise infrastructure literacy. That usually changes who gets taken seriously inside companies.
228 chars
OpenSource
Req 2026-03-24T0101-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 90
Source
2026-03-23 23:05:55.000000
like openclaw, but works on poke. via mcp tunneling 🤯 amazing engineering by @interaction team. loved how they've created loads of possibilities with a simple sdk with few methods. pure genius 🧠
primary source_tweetref tweet
reference: https://x.com/interaction/status/2036217908653859081
Quoted original
fatih kadir akın (@fkadev) · Mon Mar 23 22:29:11 +0000 2026
🌴 Built Poke Gate: a macOS menu bar app that lets your @interaction Poke fully control your Mac remotely via iMessage/Telegram/SMS 👀 Inspired by @calganaygun's `poke-pc`. Open source: https://t.co/sae4SAyosb → brew install f/tap/poke-gate Set your API Key, and done! ✨ https://t.co/KUsAFQbTUr
Draft
The interesting part isn’t “OpenClaw for Poke.” It’s that MCP tunneling collapses the gap between where the model runs and where action happens. Small SDK, huge surface area.
174 chars
OpenSource
Req 2026-03-24T0101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-23 23:29:41.000000
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
primary source_tweetref tweet
reference: https://x.com/yoheinakajima/status/2036223891287498961
Draft
Interesting part isn’t just “AI can click around your Mac.” It’s that the interface is becoming optional. Once the model can reliably operate the same messy desktop humans do, software distribution starts looking very different.
228 chars
OpenSource
Req 2026-03-24T0001-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 89
Source
2026-03-23 22:17:00.000000
🚨 BREAKING: Meta researchers showed a model 2 million hours of video. No labels. No physics textbook. No supervision at all. It learned gravity. Object permanence. Inertia. And it just beat Gemini 1.5 Pro and GPT-4 level models at physics understanding. Here's what just https://t.co/sYu3cr5Y7w
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2036205598111420871
Draft
What matters here isn’t just learning from raw video. It’s that world models are becoming useful before they’re neat, interpretable, or fully theory-led. That alone changes how seriously people will take observation-first learning.
231 chars
OpenSource
Req 2026-03-23T2301-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 92
Source
2026-03-23 20:47:55.000000
It’s now easier to find, reuse, and build on the files you upload and create in ChatGPT. You can quickly reference files in a chat using recent files in the toolbar, ask ChatGPT about something you’ve uploaded, or browse your files in the new Library tab in the web sidebar. Rolling out globally for Plus, Pro, and Business users, and coming soon to users in the EEA, Switzerland, and the UK.
primary source_tweetref tweet
reference: https://x.com/OpenAI/status/2036183180219392103
Draft
Small product change, big behavior shift: when your files are actually legible and reusable inside ChatGPT, it stops feeling like a chat box and starts feeling like a working environment.
187 chars
OpenSource
Req 2026-03-23T2201-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-03-23 21:57:54.000000
There’s Cowork and Code, just rolling out the features of OpenClaw, one by one, every day. (But Mac only again, sigh)
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036195789601374705
Quoted original
Claude (@claudeai) · Mon Mar 23 21:38:01 +0000 2026
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
Draft
The interface is becoming the API. First it was text. Then tools. Now it’s the desktop itself. The shift isn’t just “computer use.” It’s that more of what counts as software is being treated as a workflow to operate, not a system to integrate with.
250 chars
OpenSource
Req 2026-03-23T2201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-23 21:44:03.000000
Anthropic will have the best computer-use-agent. It just works and keeps getting better. No day without a new release.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/claudeai/status/2036195789601374705
Quoted original
Claude (@claudeai) · Mon Mar 23 21:38:01 +0000 2026
You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only. https://t.co/sVymgmtEMI
Draft
The interesting part isn’t that AI can click around your Mac. It’s how fast the UX is shifting from “assistant” to “coworker”—faster than most product categories seem ready for.
177 chars
OpenSource
Req 2026-03-23T2201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 95
Source
2026-03-23 15:26:11.000000
The working style of OpenClaw founder @steipete is insane. bro runs 4–10 AI coding agents in parallel to generate, review, and commit code at superhuman speed. hitting 500+ commits pretty much every day and did 6,600+ in jan month alone. NVIDIA CEO must be happy seeing https://t.co/boYJtCy9UE
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036102213127594325
Quoted original
OpenClaw🦞 (@openclaw) · Mon Mar 23 11:34:29 +0000 2026
OpenClaw 2026.3.22 🦞 🏪 ClawHub plugin marketplace 🤖 MiniMax M2.7, GPT-5.4-mini/nano + per-agent reasoning 💬 /btw side questions 🏖️ OpenShell + SSH sandboxes 🌐 Exa, Tavily, Firecrawl search This release is so big it needs its own table of contents. https://t.co/XvRbXEduGC
Draft
What stands out here isn’t just speed. It’s that software is starting to look less like solo execution and more like orchestration: taste, delegation, review loops, judgment under load.
185 chars
OpenSource
Req 2026-03-23T2001-TOP3
QUOTEquote_long_externalready_for_reviewrisk mediumscore 96
Source
2026-03-23 17:54:17.000000
That is really really impressive: GPT-5.4 pro has solved one of the open problems in FrontierMath. Kevin Barreto and Liam Price, using GPT-5.4 Pro, produced a construction that Will Brian confirmed, with a write-up planned for publication We are accelerating
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/EpochAIResearch/status/2036114281985724906
Quoted original
Epoch AI (@EpochAIResearch) · Mon Mar 23 16:14:08 +0000 2026
AI has solved one of the problems in FrontierMath: Open Problems, our benchmark of real research problems that mathematicians have tried and failed to solve. See thread for more. https://t.co/36FUpyBKWQ
Draft
The interesting part isn’t just that AI solved one. It’s where the bar was set: problems built to withstand serious attempts by working mathematicians. Once systems start clearing that line, progress stops looking academic and starts looking like capability transfer.
267 chars
OpenSource
Req 2026-03-23T2001-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-23 17:54:09.000000
leworldmodel
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/lucasmaes_/status/2036080584569618741
Quoted original
Lucas Maes (@lucasmaes_) · Mon Mar 23 14:00:14 +0000 2026
JEPA are finally easy to train end-to-end without any tricks! Excited to introduce LeWorldModel: a stable, end-to-end JEPA that learns world models directly from pixels, no heuristics. 15M params, 1 GPU, and full planning &lt;1 second. 📑: https://t.co/cpTzgvbTS0 https://t.co/Z2De9ASzcW
Draft
What stands out here isn’t just the model. It’s the disappearance of ceremony. If JEPA world models really get this clean to train from pixels, a lot of the old “promising, but fragile” caveat starts to fall away.
214 chars
OpenSource
Req 2026-03-23T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-23 18:42:05.000000
You can now pretrain LLMs entirely on the HF Hub 💥 Last week, @OpenAI launched a competition to see who can pretrain the best LLM in under 10 minutes. So over the weekend, I made a little demo to automate this end-to-end using the Hub as the infra layer: - Jobs to scale https://t.co/MKgxggh3GD
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036151515623145880
Draft
The interesting part isn’t just speed. It’s the stack collapse—training infra, artifacts, and distribution converging in the same surface. That changes who gets to run serious experiments.
188 chars
OpenSource
Req 2026-03-23T1901-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-23 16:34:08.000000
JEPA are finally easy to train end-to-end without any tricks! Excited to introduce LeWorldModel: a stable, end-to-end JEPA that learns world models directly from pixels, no heuristics. 15M params, 1 GPU, and full planning &lt;1 second. 📑: https://t.co/cpTzgvbTS0 https://t.co/Z2De9ASzcW
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2036119315624210801
Draft
What matters here isn’t just the result, but the removal of ceremony. If JEPA-style world models really get this clean, the question stops being whether this can be trained at all and becomes: what do you build on top of it?
224 chars
OpenSource
Req 2026-03-23T1901-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 100
Source
2026-03-23 17:41:56.000000
Teleop is so 2025. Ever since we unveiled EgoScale and the dexterity scaling law, it's been clear to us and the ecosystem that behavior cloning directly from humans is the way to break the curse of teleop. 2026 is all about scaling robot learning without robots.
primary quoted_tweetsecondary quote_wrapperref tweet
reference: https://x.com/danfei_xu/status/2036108953017368960
Quoted original
Danfei Xu (@danfei_xu) · Mon Mar 23 15:52:58 +0000 2026
Introducing EgoVerse: an ecosystem for robot learning from egocentric human data. Built and tested by 4 research labs + 3 industry partners, EgoVerse enables both science and scaling 1300+ hrs, 240 scenes, 2000+ tasks, and growing Dataset design, findings, and ecosystem 🧵 https://t.co/qagpce0vxl
Draft
What’s interesting here isn’t just another robotics dataset. It’s the bet underneath it: if the bottleneck is high-quality human data, the scaling story can shift fast.
168 chars
OpenSource
Req 2026-03-23T1901-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 85
Source
2026-03-23 11:44:23.000000
🚀 gstack — Garry Tan's Claude Code setup • 15+ AI specialist roles (CEO, QA, Security) • /office-hours, /review, /qa, /ship • 600K+ lines shipped in 60 days • Works with Claude Code, Codex, Cursor git clone https://t.co/NazV4qMAsa #AI #OpenSource https://t.co/YsxMnsvwzw
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2036046398593155073
Draft
More interesting than the commands themselves is the packaging: people don’t just want a coding model. They want a legible operating system around one.
151 chars
OpenSource
Req 2026-03-23T1201-TOP3
QUOTEquote_long_externalready_for_reviewrisk lowscore 93
Source
2026-03-23 11:41:26.000000
Protected Spaces with Public URLs now available on Hugging Face You can now set your Spaces to protected, making them private on Hugging Face while still keeping their URL publicly accessible This is useful for deploying production-ready demos or internal tools without exposing https://t.co/xKlIKOlxnV
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036045655525793915
Draft
The kind of product change that quietly makes Spaces more usable in the real world: a public endpoint with less public surface area.
132 chars
OpenSource
Req 2026-03-23T1201-TOP2
QUOTEquote_long_externalready_for_reviewrisk lowscore 96
Source
2026-03-23 11:40:07.000000
Underneath the Composer controversy, it's pretty wild to see the progress on Terminal-Bench 2.0. Open models are now stacking above 50% — 8 months ago, the best was under 28%. Made a @huggingface Space to see how models have progressed on important benchmarks. https://t.co/nELCt0eY4Z
primary source_tweetref tweet
reference: https://x.com/huggingface/status/2036045321298493816
Draft
What matters less is the benchmark than the slope. When open models go from "not really close" to clearly competitive in under a year, the story stops being catch-up and starts being distribution.
196 chars
OpenSource
Req 2026-03-23T1201-TOP1
QUOTEquote_long_externalready_for_reviewrisk mediumscore 88
Source
2026-03-23 09:13:27.000000
The Pentagon is moving to integrate Palantir's AI as a core system across U.S. military operations. However: Anthropic is too good not to use anymore: "One potential complication in deeper Maven adoption is the software’s use of the Anthropic-made Claude AI tool, Reuters previously reported. Anthropic was recently deemed a ​supply chain risk by the Pentagon, amid a months-long spat over safety guardrails surrounding the AI."
primary source_tweetref tweet
reference: https://x.com/kimmonismus/status/2036008413700735458
Draft
This is what adoption looks like when strategic necessity outruns policy neatness: standardize first, absorb the governance contradictions later.
145 chars
OpenSource
Req 2026-03-23T1001-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 99
Source
2026-03-23 07:03:37.000000
Google AI Studio instantly creates full-stack apps with voice or text prompts, including backend and auth. https://t.co/SaB3GXq6H8
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2035975739791548623
Draft
Google is pushing AI Studio beyond prompting and into app creation: voice or text in, full-stack web app out, with backend, auth, and publishing in the loop. That’s a real step from prototyping toward deployment. https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/
319 chars
OpenSource
Req 2026-03-23T0801-TOP1
QUOTEquote_long_nativeready_for_reviewrisk lowscore 75
Source
2026-03-23 04:43:00.000000
V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning Author's Explanation: https://t.co/b3dRBUczm0 Overview: V-JEPA 2.1 integrates a dense predictive loss, hierarchical self-supervision, and multi-modal tokenizers to learn structured spatial and temporal https://t.co/FTrNKTCNf6
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2035940351060373596
Quoted original
Loren (@murloren) · Fri Mar 20 09:08:38 +0000 2026
I am very happy to share the result of my internship at FAIR (Meta): V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning with @ylecun @AdrienBardes Our approach learns dense, spatially coherent features from video while preserving strong global understanding https://t.co/NVh5CZAGfZ
Draft
Interesting shift: not just learning video representations, but pushing them toward usable structure. Dense features are where "understanding video" stops sounding like a demo line and starts looking like a real systems capability.
231 chars
OpenSource
Req 2026-03-23T0501-TOP2
QUOTEquote_long_nativeready_for_reviewrisk lowscore 85
Source
2026-03-23 04:44:24.000000
Temporal Straightening for Latent Planning Author's Explanation: https://t.co/Z7kZZVzSy7 Overview: Temporal straightening improves representation learning for latent planning by applying a curvature regularizer that enforces locally straightened latent trajectories. This https://t.co/qXzLAG325p
primary source_tweetref tweet
reference: https://x.com/ylecun/status/2035940702928974216
Quoted original
Ying Wang (@yingwww_) · Fri Mar 13 16:49:23 +0000 2026
What is a good latent space for world modeling and planning? 🤔 Inspired by the perceptual straightening hypothesis in human vision, we introduce temporal straightening to improve representation learning for latent planning. 📑: https://t.co/CCmcEIJGM6 https://t.co/SCO4vukZKA
Draft
Useful frame: if latent planning keeps bending the representation around the task, you spend capacity on geometry instead of decision structure. Straighten the trajectory, and planning gets simpler for the right reason.
219 chars
OpenSource
Req 2026-03-23T0501-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 93
Source
2026-03-23 02:03:38.000000
OpenArt Worlds creates explorable 3D worlds from any photo, video, or text prompt instantly. https://t.co/rheWvlkdT2
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2035900245574324301
Draft
OpenArt is pushing image generation toward navigable scenes: Worlds turns a photo, video, or text prompt into an explorable 3D environment. That’s the real shift—not prettier outputs, but spaces you can move through. https://openart.ai/feature/openart-worlds
258 chars
OpenSource
Req 2026-03-23T0301-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 89
Source
2026-03-23 00:55:28.000000
named my @interaction donna… https://t.co/Ya9iOwXvFw
primary source_tweetref media
reference: https://x.com/interaction/status/2035883092234404069
Quoted original
Poke (@interaction) · Thu Mar 19 19:28:47 +0000 2026
Starting today, personal superintelligence is just one tap away. No download, no signup. Text Poke for free now: https://t.co/VIWYU64dUI 🌴 — 0:00 – What's Poke? 0:50 – Introducing Poke Recipes 1:25 –  Create a Recipe in 10 seconds 1:43 – Earn on Poke 2:44 – Build with npx https://t.co/LHLFRVgahk
Draft
"Donna" is a strong interface name: human enough to feel intentional, stylized enough to read like a product. Small naming choice, real brand effect. https://x.com/abhaychebium/status/2035847554752286926/photo/1
211 chars
OpenSource
Req 2026-03-23T0101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 89
Source
2026-03-22 21:14:53.000000
Wow. @GarryTan (@ycombinator's CEO) just dropped the ultimate cheat code for software engineers. 🔥 He just open-sourced gstack, his personal toolkit that transforms Claude Code from a basic chatbot into an entire virtual engineering department. Instead of asking Claude to https://t.co/ShNoYJS9Sl
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2035827578263744946
Draft
The real shift isn’t that AI helps you code. It’s engineers codifying their own operating system around the model. Once workflow, context, and standards are packaged, leverage compounds fast.
191 chars
OpenSource
Req 2026-03-22T2201-TOP1
THREADready_for_reviewrisk lowscore 78
Source
2026-03-22 19:56:33.000000
1/ Video models understand motion but hallucinate geometry. Image models nail geometry but are blind to motion. We have accepted this tradeoff for years. Meta FAIR just proved it is purely an architectural bug, not a theoretical limit. 🧵 https://t.co/wwI4Dg8t6E
Draft
For years, generative vision was stuck with a bad tradeoff: video models could track motion but distort geometry, while image models preserved structure but had no real sense of movement. The interesting part is not the tradeoff itself. It’s Meta FAIR arguing that the tradeoff was architectural, not fundamental. If that framing is right, this is bigger than a single model update. It suggests coherence in space and coherence in time do not have to remain separate strengths. The ceiling on visual generation may have been set by design choices, not theory. That matters downstream for world models, simulation, robotics, editing, and any system that needs objects to stay consistent while the scene actually moves. Less visual nonsense. More usable video intelligence. The real unlock in AI is often not more scale. It’s realizing an accepted limitation was self-inflicted. That’s when a field starts to move fast. https://t.co/wwI4Dg8t6E
945 chars
OpenSource
Req 2026-03-22T2001-TOP1
POSTpost_short_externalready_for_reviewrisk mediumscore 90
Source
2026-03-22 16:42:54.000000
OpenAI’s proposed "adult mode" for ChatGPT has triggered intense internal backlash, with advisers warning of serious risks like emotional dependency, compulsive use, and even a “sexy suicide coach” scenario. Technical flaws, including a ~12% error rate in age verification, could expose millions of minors to explicit content, forcing a delayed launch despite growth and revenue incentives.
primary quoted_tweetsecondary quote_wrapperref external_url
reference: https://on.wsj.com/4sJUGS5
Quoted original
The Wall Street Journal (@WSJ) · Sun Mar 22 16:15:00 +0000 2026
OpenAI’s X-rated "adult mode" is freaking out its own advisers https://t.co/tQixNPCRqS
Draft
OpenAI’s planned “adult mode” reportedly ran into internal resistance over safety, mental-health risk, and weak age verification. If that holds, this isn’t just a product-expansion debate. It’s a deployment question. https://on.wsj.com/4sJUGS5
243 chars
OpenSource
Req 2026-03-22T1701-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 99
Source
2026-03-22 14:32:34.000000
AI can help us learn hard-to-teach skills, like empathy. Preregistered study of 968 people found almost no correlation between feeling empathic &amp; communicating empathy. But a single practice session with an AI coach made people measurably better at it https://t.co/VDtdiNpw1J https://t.co/a6dxvxQDK5
primary source_tweetref tweet
reference: https://x.com/emollick/status/2035726331854356485
Draft
What’s interesting here isn’t that AI can simulate empathy. It’s that it may make empathy trainable in a way most systems never have.
133 chars
OpenSource
Req 2026-03-22T1501-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 97
Source
2026-03-22 11:05:32.000000
Google AI Studio’s upgrade builds production-ready apps with auth, APIs, and frontend—all from one prompt. https://t.co/tcs2A94SWu
primary source_tweetref media
reference: https://x.com/ItsAIAndy/status/2035674231896203281
Draft
Google is pushing AI Studio beyond toy demos: auth, APIs, database support, and frontend generation in one prompt. That’s a real step toward shipping full-stack apps, not just prototyping. https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/
295 chars
OpenSource
Req 2026-03-22T1201-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-21 22:15:59.000000
introducing AlphaClaw Apex 🐺 a native Mac app for managing multiple OpenClaw VPS instances from one dashboard. some of you are already setting up OpenClaw for clients as a service. Apex is built for you. deploy to Hetzner VPS in one click. monitor all your instances. manage https://t.co/FYkPhfkRbG
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2035480569975550210
Draft
This is what a category looks like right before it gets crowded: not “AI for devs,” but infrastructure for the people running AI products for everyone else.
156 chars
OpenSource
Req 2026-03-21T2301-TOP1
POSTpost_short_externalready_for_reviewrisk lowscore 96
Source
2026-03-21 20:46:38.000000
OpenAI will begin showing ads to all users of the free ‌and Go versions of ChatGPT in the United States in the coming weeks https://t.co/LpZz2hdImB
primary quoted_tweetsecondary quote_wrapperref external_url
reference: http://reut.rs/4bJzPrf
Quoted original
Reuters (@Reuters) · Sat Mar 21 18:35:09 +0000 2026
OpenAI to introduce ads to all ChatGPT free and Go users in US https://t.co/kkRl1HzqLs https://t.co/kkRl1HzqLs
Draft
OpenAI is bringing ads to ChatGPT for free and Go users in the U.S. That’s the tradeoff now: broader access, paid for inside the product. http://reut.rs/4bJzPrf
160 chars
OpenSource
Req 2026-03-21T2101-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 94
Source
2026-03-21 19:24:02.000000
Agent workflows got even faster. You can spin up containers for skills, shell and code interpreter about 10x faster. We added a container pool to the Responses API, so requests can reuse warm infrastructure instead of creating a full container creation each session. https://t.co/lmvwsaf5HN
primary source_tweetref tweet
reference: https://x.com/OpenAIDevs/status/2035437297005727963
Draft
This isn’t just about speed. It points to a different UX for agentic tooling: less cold-start tax, more room for tight iterative workflows. That’s the kind of infrastructure shift users feel before they have a name for it.
223 chars
OpenSource
Req 2026-03-21T2001-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 75
Source
2026-03-21 17:01:47.000000
Here's Poke @interaction maintaining a funding Notion database for me. My own, free crunchbase. Took barely 15 minutes of setup and troubleshooting. Reads news, enriches company, job, amd founder info and keeps the page updated if anything changes. 2x daily updates and a weekly https://t.co/LU4oZVlYZi
primary source_tweetref tweet
reference: https://x.com/interaction/status/2035401499195666854
Draft
The interesting part isn’t the Notion database. It’s the shift from market research as a habit to market tracking as maintenance.
129 chars
OpenSource
Req 2026-03-21T1801-TOP1
QUOTEquote_long_externalready_for_reviewrisk lowscore 87
Source
2026-03-21 15:04:35.000000
You can now quick start a repo with gstack initialized! Conductor will automatically: - setup gstack, install skills - create a new repo - kick off /office-hours https://t.co/JuOSIE6v5F
primary source_tweetref tweet
reference: https://x.com/garrytan/status/2035372003050627554
Draft
The interesting part here isn’t just setup speed. It’s the shift from “start a repo” to “start a working loop.” That’s where these tools actually get judged.
157 chars
OpenSource
Req 2026-03-21T1601-TOP1