๐Ÿ”ด High Significance

Model Releases

๐Ÿ”ด Gemini 3.5 Flash โ€” score 85 Sources: hackernews

๐Ÿ”ด Do you guys actually think AI agents can replace people for bigger tasks anytime soon? โ€” score 72 Sources: reddit/r/AIAgents

Not talking about small stuff like summarizing notes or drafting emails. I mean real work: * managing projects * handling operations * coordinating across tools * doing research end-to-end * dealing with messy real-world situations Because honestly my experience has been all over the place lol Tools

Developer Tools

๐Ÿ”ด Agentic Payments: How AI Agents Are Becoming New Players in the Payments Market โ€” score 89 Sources: reddit/r/AIAgents

๐Ÿ”ด got my first "rm -rf /" today โ€” score 82 Sources: reddit/r/LocalLLaMA

Agent decided to test if harmful command block worked by issuing a rm -rf / Thankfully it worked so only damage was a mild heart attack. I implemented a sandbox immediately afterwards. EDIT: for those wondering, I was implementing a bash command whitelist and also bubblewrap for isolation. I did the

๐Ÿ”ด Intel's Crescent Island PCB Leaks, Showing a Massive Xe3P GPU, 16-Pin Connector, 160GB LPDDR5X as Intel Sidesteps the HBM Shortage โ€” score 75 Sources: reddit/r/LocalLLaMA

Upcoming Intel Xe3P data center GPU with 20 8GBLPDDR5X modules for a total of 160GB, bypassing HBM shortages. Assuming a 32-bit interface, that's a 640-bit wide memory interface, or 10 channel memory interface if converted to the 64-bit wide desktop equivalent. At 8800-9500MT, that's a 704-760GB/s m

๐Ÿ”ด Show HN: Forge โ€“ Guardrails take an 8B model from 53% to 99% on agentic tasks โ€” score 75 Sources: hackernews

๐Ÿ”ด Alishahryar1/free-claude-code โ€” Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported) โ€” score 72 Sources: github_trending

Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

Enterprise Adoption

๐Ÿ”ด The harmless prompt injection that leaked our system architecture โ€” score 89 Sources: reddit/r/AIAgents

Model cheerfully listed every internal API endpoint, database schema, integration paths, third party service names, even the staging environment urls. Nothing flagged as harmful by our safety layer. No toxic language, attempts to bypass etc. Just a helpful AI being too helpful. The request didn't tr

Research Papers

๐Ÿ”ด When Vision Speaks for Sound โ€” score 95 Sources: huggingface

Despite rapid progress in video-capable MLLMs, we find that their apparent audio understanding in videos is often vision-driven: models rely on visual cues to infer or hallucinate acoustic information, rather than verifying the audio stream. This issue appears across both state-of-the-art open-sourc

๐Ÿ”ด PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset โ€” score 85 Sources: huggingface

Text-to-Image (T2I) models have recently seen notable progress around 1K and 2K resolution. With the extreme desire for better visual experience and the rapid development of imaging technology, the demand for Ultra-High-Resolution (UHR) image generation has grown significantly. However, UHR image ge

Other Signals

๐Ÿ”ด Iโ€™ve joined Anthropic โ€” score 95 Sources: hackernews

๐Ÿ”ด bytedance released an open source model that attempts to do just about anything with only 3b parameters โ€” score 89 Sources: reddit/r/LocalLLaMA

EDIT: working link https://huggingface.co/bytedance-research/Lance Lance is a lightweight native unified multimodal model that supports image and video understanding, generation, and editing within a single framework. * *Efficient at 3B scale.

๐ŸŸก Notable

Model Releases

๐ŸŸก LM Studio finally added support for MTP Speculative Decoding โ€” score 68 Sources: reddit/r/LocalLLaMA

https://preview.redd.it/1uuzjm0ll72h1.png?width=923&format=png&auto=webp&s=1af7d7594be1e08ff7ad6797e2bc53e9410769a3 update to 0.4.14 Build 2 (Beta) and make sure your llama.cpp engine is 2.15.0 https://preview.redd.it/x0vdwjb3n72h1.png?width=742&format=png&auto=webp&s=6367de4

๐ŸŸก Co-Scientist (Nature 2026-05-19): 5+1 Gemini agents, tournament-of-ideas, prod โ€” score 61 Sources: reddit/r/AIAgents

DeepMind published a Co-Scientist in Nature yesterday. It's a multi-agent system on Gemini with five role-specialised agents โ€” Generation, Reflection, Ranking, Evolution, Meta-review โ€” orchestrated by a Supervisor agent that breaks down high-level research goals into executable steps and coordinates

๐ŸŸก @OpenAI: People are generating over 1.5 billion images a week in ChatGPT. Researcher @kenjihata joins Product lead @adele__li and host @AndrewMayne to explore the new use cases and trends emerging since the l โ€” score 60 Sources: twitter_rss

People are generating over 1.5 billion images a week in ChatGPT. Researcher @kenjihata joins Product lead @adele__li and host @AndrewMayne to explore the new use cases and trends emerging since the launch of Images 2.0.

๐ŸŸก @OpenAI: Introducing OpenAI Guaranteed Capacity: a new offering that enables customers to guarantee long-term access to OpenAI compute. Weโ€™ve made long-term investments in infrastructure, partnerships, and ca โ€” score 60 Sources: twitter_rss

Introducing OpenAI Guaranteed Capacity: a new offering that enables customers to guarantee long-term access to OpenAI compute. Weโ€™ve made long-term investments in infrastructure, partnerships, and capacity planning to help customers scale reliably. Now, Guaranteed Capacity helps customers plan ahead

๐ŸŸก Carbon: Decoding the Language of Life โ€” score 54 Sources: reddit/r/LocalLLaMA

https://preview.redd.it/rajj11v7j42h1.png?width=1744&format=png&auto=webp&s=72381de22a9bac4b30a59498d549bb09df075df3 Hey, it's loubna from Hugging Face. Very happy to share our latest release: Carbon ๐Ÿงฌ, a family of open DNA foundation models. Carbon-3B matches the current SOTA (Evo2-7B)

Omitted 6 additional model releases items from the main section; see raw data and source-specific sections below.

Developer Tools

๐ŸŸก Remove-AI-Watermarks โ€“ CLI and library for removing AI watermarks from images โ€” score 65 Sources: hackernews

๐ŸŸก OpenAI Adopts Google's SynthID Watermark for AI Images with Verification Tool โ€” score 62 Sources: hackernews ยท lab_blog/OpenAI

OpenAI advances AI content provenance with Content Credentials, SynthID, and a verification tool to help people identify and trust AI-generated media.

๐ŸŸก All fundamental knowledge in ML Course by Andrew NG that I noted and create into a repo github [R] โ€” score 56 Sources: reddit/r/MachineLearning

https://preview.redd.it/mikhasjiq32h1.png?width=572&format=png&auto=webp&s=4c053200dbd9852bebf083550e2144b31579d497 https://preview.redd.it/bay5r3njq32h1.png?width=575&format=png&auto=webp&s=2823db3d6bc534ef00330528a200cba2aca1c5d3 https://preview.redd.it/dm40ntdkq32h1.png?wi

๐ŸŸก alirezarezvani/claude-skills โ€” 313+ Claude Code skills & agent skills & plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents โ€” engineering, marketing, product, compliance, C-level advisory, research, business operations, commercial & finance, and your daily productivity skills. โ€” score 52 Sources: github_trending

313+ Claude Code skills & agent skills & plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents โ€” engineering, marketing, product, compliance, C-level advisory, research, business operations, commercial & finance, and your daily productivity skills.

๐ŸŸก Got my agent to audit MCP servers for trust issues .. how do you handle it? โ€” score 44 Sources: reddit/r/AIAgents

Got my agent to audit MCP servers for trust issues (credential exposure, permission scope, data isolation). Here's what 20 popular servers scored: โ€ข docker-mcp: 18/100 โ€” credential exposure across all operations โ€ข Fetch: 84/100 โ€” clean but limited scope The MCP ecosystem is growing fast but there's

Omitted 1 additional developer tools items from the main section; see raw data and source-specific sections below.

Infrastructure & Compute

๐ŸŸก unslothai/unsloth โ€” Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally. โ€” score 50 Sources: github_trending

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

๐ŸŸก The next phase of OpenAIโ€™s Education for Countries โ€” score 50 Sources: lab_blog/OpenAI

OpenAI advances Education for Countries, expanding AI adoption in schools with new partnerships, teacher training, and tools to improve global learning outcomes.

๐ŸŸก Google AI Edge Gallery v1.0.13 & v1.0.14 updates: Gemma 4 Multi-Token Prediction, Pixel TPU support, experimental MCP, new skills, now saves chat history โ€” score 46 Sources: reddit/r/LocalLLaMA

Research Papers

๐ŸŸก TideGS: Scalable Training of Over One Billion 3D Gaussian Splatting Primitives via Out-of-Core Optimization โ€” score 65 Sources: huggingface

Training 3D Gaussian Splatting (3DGS) at billion-primitive scale is fundamentally memory-bound: each Gaussian primitive carries a large attribute vector, and the aggregate parameter table quickly exceeds GPU capacity, limiting prior systems to tens of millions of Gaussians on commodity single-GPU ha

๐ŸŸก SAGA: A Sequence-Adaptive Generative Architecture for Multi-Horizon Probabilistic Forecasting with Adaptive Temporal Conformal Prediction โ€” score 58 Sources: huggingface ยท arxiv/cs.LG

Microsimulation models used by ministries of finance and central banks rely on parametric processes for lifetime earnings that capture only first and second moments of the conditional distribution and miss long-range nonlinear structure. We propose SAGA, a decoder-only transformer for irregular tabu

๐ŸŸก Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction โ€” score 45 Sources: huggingface

Real-time duplex interaction is essential for multimodal AI systems operating in real-world scenarios, where models must continuously process streaming inputs and respond at appropriate moments. However, most existing multimodal large language models (MLLMs) are evaluated in offline settings, where

๐ŸŸก Where Does Authorship Signal Emerge in Encoder-Based Language Models? โ€” score 42 Sources: huggingface ยท arxiv/cs.CL

Authorship attribution models fine-tuned with the same pretrained encoder, data, and loss can differ four-fold in performance depending only on their scoring mechanism. We use mechanistic interpretability tools to explain this gap. Stylistic features such as word length, punctuation density, and fun

๐ŸŸก CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning โ€” score 42 Sources: huggingface ยท arxiv/cs.AI

Chain-of-thought (CoT) is a standard approach for eliciting reasoning capabilities from large language models (LLMs). However, the common CoT paradigm treats thinking as a prerequisite for answering, which can delay access to plausible answers and incur unnecessary token costs even when the model is

Other Signals

๐ŸŸก 48GB VRAM users, what are your daily drivers? Do you wish you had more VRAM? What would you run if you did? โ€” score 61 Sources: reddit/r/LocalLLaMA

Iโ€™m upgrading from 32 to 48 soon and am excited but Iโ€™m curious what yโ€™all run!

๐ŸŸก ICML Proceedings-only [D] โ€” score 44 Sources: reddit/r/MachineLearning

For proceedings-only papers, do we need to make a poster and submit it to the portal? Has anyone asked this question to ICML Program Chair?

๐ŸŸข Incremental

Model Releases

๐ŸŸข Qwen3.7 Max scored by Artificial Analysis, 27B/35B waiting room โ€” score 39 Sources: reddit/r/LocalLLaMA

https://preview.redd.it/42ak5qmus82h1.png?width=1133&format=png&auto=webp&s=744ea3dfc06c83d0c4d8aa128c39b3238b17d7be Qwen 3.7 Max sitting at 5th, pretty much on par with GPT 5.4 (xhigh) and a notch above the just released Gemini 3.5 Flash. On the other end, we see DSV4 Flash and Qwen3.6

๐ŸŸข Gemini CLI will stop working from June 18, 2026 โ€” score 35 Sources: hackernews

๐ŸŸข Letโ€™s talk quants of Gemma and Qwen - 16 vs Q8 vs Q4 - any experiences? โ€” score 18 Sources: reddit/r/LocalLLaMA

Some people say theyโ€™d never go under Q8, and others say they find Q3 acceptable! Whatโ€™s your take?

๐ŸŸข Gemma 4 MTP with LlamaCPP โ€” score 4 Sources: reddit/r/LocalLLaMA

I am running Gemma 4 31B for a project using LlamaCPP. There is no integrated main model + MTP drafter GGUF. And from what I can tell, LlamaCPP was updated to not accept a separate MTP drafter GGUF but instead to use a combined GGUF for main+drafter. So how can I use Gemma 4 31B with MTP on LlamaCPP

Developer Tools

๐ŸŸข New SOTA 1B model? HRM-text โ€” score 32 Sources: reddit/r/LocalLLaMA

Saw this video by them. Seems interesting but Tbh the benchmarks seem too good to be true. I'm not super knowledgeable on how models think so can anyone more knowledgeable explain what exactly is happening. And it's pros and cons? GitHub: https: //github.com/sapientinc/HRM-Text Hugging face: https:/

๐ŸŸข Machine Learning on Spherical Manifold [R] โ€” score 31 Sources: reddit/r/MachineLearning

Hi, I'm interested in geometric deep learning (due to Michael M. Bronstein's book and Maurice Weiler's PhD thesis), and in order not to write projects to nowhere, I decided to keep a technical blog. I started with a short note about machine learning on spherical manifolds, but it's a pretty simple t

๐ŸŸข HanaokaYuzu/Gemini-API โ€” โœจ Reverse-engineered Python API for Google Gemini web app โ€” score 22 Sources: github_trending

โœจ Reverse-engineered Python API for Google Gemini web app

๐ŸŸข dmtrKovalenko/fff โ€” The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS โ€” score 22 Sources: github_trending

The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS

๐ŸŸข [ECCV 2026] No modified date next to reviews [D] โ€” score 19 Sources: reddit/r/MachineLearning

On Openreview, you can see modified date next to the review. This modified date should be recent (anything 12th May or newer) which means that reviewer gave a final justification and may have increased their score or kept the same score. In either case, it means they read the rebuttal and justified

Omitted 5 additional developer tools items from the main section; see raw data and source-specific sections below.

Infrastructure & Compute

๐ŸŸข Michael-A-Kuykendall/shimmy โ€” โšก Python-free Rust inference server โ€” OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever. โ€” score 36 Sources: github_trending

โšก Python-free Rust inference server โ€” OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

๐ŸŸข Running DeepSeek-V4 locally with 4x legacy RTX 2080 Ti ($2k budget setup). Custom Turing kernels, W8A8 quantization, and 255 prefill tok/s! โ€” score 25 Sources: reddit/r/LocalLLaMA

Hey r/DeepSeek, Who says we need an H100 cluster or the latest expensive GPUs to run frontier MoE models? I wanted to see how far we could push a single node of consumer legacy hardware, so we spent less than $2,500 total to build a budget machine that successfully runs DeepSeek-V4-Flash (284B t

Other Signals

๐ŸŸข Growing Neural Cellular Automata โ€” score 25 Sources: hackernews

๐ŸŸข authentication/sesh timeouts in multi step browser agents โ€” score 17 Sources: reddit/r/AIAgents

hey guys, building a custom multi-step agent atm that needs to navigate a bunch of different vendor sites to scrape data and pull invoices. the problem isn't the actual navigation (using standard gpt-4o calls for that), it's the absolute mess of handling weird login flows, random 2FA prompts, and ag

๐ŸŸข Infomaniak transitions to a foundation model to protect user data privacy โ€” score 15 Sources: hackernews

๐ŸŸข Show HN: The AI Quant Desk for Onchain Finance โ€” score 5 Sources: hackernews

RepoDescriptionStars TodayLanguage
Alishahryar1/free-claude-codeUse claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)563python
alirezarezvani/claude-skills313+ Claude Code skills & agent skills & plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents โ€” engineering, marketing, product, compliance, C-level advisory, research, business operations, commercial & finance, and your daily productivity skills.157python
unslothai/unslothUnsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.156python
Michael-A-Kuykendall/shimmyโšก Python-free Rust inference server โ€” OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.108rust
HanaokaYuzu/Gemini-APIโœจ Reverse-engineered Python API for Google Gemini web app59python
dmtrKovalenko/fffThe fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS59rust
screenpipe/screenpipeYC (S26) | Give AI the ability to live your experience. Records everything you do, say, hear 24/7, local, private, secure29rust

๐Ÿ“„ New Papers

TitleCategoryHotnessLink
When Vision Speaks for Soundresearch_paper40Open
PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Datasetresearch_paper6Open
TideGS: Scalable Training of Over One Billion 3D Gaussian Splatting Primitives via Out-of-Core Optimizationresearch_paper3Open
SAGA: A Sequence-Adaptive Generative Architecture for Multi-Horizon Probabilistic Forecasting with Adaptive Temporal Conformal Predictionresearch_paper2Open
Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performancecs.AI0Open
Operationalizing Document AI: A Microservice Architecture for OCR and LLM Pipelines in Productioncs.AI0Open
Evaluating the Utility of Personal Health Records in Personalized Health AIcs.AI0Open
Learn-by-Wire Training Control Governance: Bounded Autonomous Training Under Stress for Stability and Efficiencycs.AI0Open
AgentNLQ: A General-Purpose Agent for Natural Language to SQLcs.AI0Open
KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognitioncs.AI0Open
Trustworthy Agent Network: Trust in Agent Networks Must Be Baked In, Not Bolted Oncs.AI0Open
Interference-Aware Multi-Task Unlearningcs.AI0Open
Embedding by Elicitation: Dynamic Representations for Bayesian Optimization of System Promptscs.AI0Open
DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflowscs.AI0Open
POLAR-Bench: A Diagnostic Benchmark for Privacy-Utility Trade-offs in LLM Agentscs.AI0Open

๐Ÿข Lab Blog Posts

๐Ÿฆ Twitter/X Highlights

AccountTweet Summary
OpenAIPeople are generating over 1.5 billion images a week in ChatGPT. Researcher @kenjihata joins Product lead @adele__li and host @AndrewMayne to explore the new use cases and trends emerging since the launch of Images 2.0. Post
OpenAIIntroducing OpenAI Guaranteed Capacity: a new offering that enables customers to guarantee long-term access to OpenAI compute. Weโ€™ve made long-term investments in infrastructure, partnerships, and capacity planning to help customers scale reliably. Now, Guaranteed Capacity helps customers plan ahead Post
AnthropicAIOver the past few months, we've been holding dialogues with scholars, philosophers, clergy, and ethicists on the questions AI raisesโ€”starting with how good character forms. Read more about how weโ€™re widening the conversation on frontier AI: https://www.anthropic.com/news/widening-conversation-ai Post
GoogleDeepMindBuild your next story with Gemini Omni. Post
GoogleDeepMindGemini 3.5 Flash ๐Ÿค @Antigravity Watch how the model deploys multiple subagents to design and build an entire city. Post
xaiStarting today, use your Grok or X Premium subscription in @openclaw. Chat with your agent, generate images and videos, or search for X posts. http://x.ai/news/grok-openclaw Post

Repeated From Recent Briefings