AW · AI Watchtower

🔴 High Significance

Model Releases

🔴 Don’t act like y’all ain’t thinking it. I’m just saying the quiet part out loud. /s — score 90 Sources: reddit/r/LocalLLaMA

Of course I’m thankful for all that Qwen has bequeathed us, but deep down in the darkest pit of our souls, every last one of us are just all sitting here waiting for Qwen to say “Hey Google, hold my beer while I drop the best GD model of all time on these fools” /s

🔴 Did Claude increase bugs in rsync? — score 75 Sources: hackernews

Developer Tools

🔴 What are the best Web Search MCPs? I am using Firecrawl but looking for alternatives — score 94 Sources: reddit/r/AIAgents

I integrated the firecrawl MCP in my software (sales copilot, similar to lemlist) The cost is still relatively high for the operations I am running, so if there’s a good cheaper alternative I’d definitely take a look at it. But I also don’t want to impact the quality, especially clean outputs/data h

🔴 How do you identify researchers who are good? [D] — score 81 Sources: reddit/r/MachineLearning

About 10 years ago, I got into the basics of ML (like regression, KNN's, LVQ's) and read a few papers before taking a break a few years back. It feels like now, there's a lot of researchers in AI. How do you identify the ones who are actually solid vs those who (forgive my phrasing) are more researc

🔴 openclaw/openclaw — Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞 — score 79 Sources: github_trending

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

Infrastructure & Compute

🔴 Gemma 4 with quantization-aware training — score 97 Sources: reddit/r/LocalLLaMA

Google's collections: https://huggingface.co/collections/google/gemma-4-qat-q4-0 https://huggingface.co/collections/google/gemma-4-qat-mobile And Unsloth's: [https://huggingf

Research Papers

🔴 MAOAM: Unified Object and Material Selection with Vision-Language Models — score 75 Sources: huggingface

Selection is a core operation in interactive image editing. To be practical, a user should be able to specify and disambiguate the desired selection region through either text or click-based interactions, and the system should support selecting not only objects but also other criteria, such as mater

Other Signals

🔴 S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic — score 92 Sources: hackernews

🔴 Unsloth just dropped MTP GGUF weights for Gemma 4! — score 83 Sources: reddit/r/LocalLLaMA

It appears like Unsloth pushed MTP GGUF weights (Q8, F16, BF16) for 31B, 26B-A4B, 12B. https://huggingface.co/unsloth/gemma-4-31B-it-GGUF/tree/main/MTP [https://huggingface.co/unsloth/gemma-4-26B-A4B-it-GGUF/tree/main/MTP](https://h

🔴 OpenLumara - A different kind of AI agent, written from scratch, not vibecoded. Extremely token-efficient, super small system prompt, made for local models. Everything is modular. — score 77 Sources: reddit/r/LocalLLaMA

Hi locallama community! Yes, I know, yet another AI agent announcement post. There are a dime a dozen out there... most of them though, are vibecoded, often very sloppy, and eat through context like no tomorrow. This is different. This runs beautifully and very fast with local models on modest hardw

🔴 I implemented KVarN in my llama.cpp fork and ran KLD benchmarks. It's promising! — score 70 Sources: reddit/r/LocalLLaMA

Saw this post here yesterday: [KVarN: new KV-cache quant from Huawei. 3–5× KV cache compression with actual speed-up instead of slow-down, and unlike TurboQuant it holds up on reasoning (Apache 2.0, vLLM single flag)](https://www.reddit.com/r/LocalLLaMA/comments/1twptw2/kvarn_new_kvcache_quant_from_

🟡 Notable

Model Releases

🟡 PSA: Gemma 4 12B is NOT completely broken for coding and tool calling, you need a special chat template — score 57 Sources: reddit/r/LocalLLaMA

This is a PSA for people like me who tried it and hit the wall with tool calls failing left and right, so much so that harnesses like OpenCode just didn't work: There is a fix for that. You need to pass a better chat template file, [which is available](https://gist.github.com/jscott3201/ad69c4ffbd79

🟡 @AnthropicAI: New Anthropic Science Blog: Making Claude a chemist. To manipulate a molecule, chemists first need to understand its structure. Their main tool is NMR spectroscopy. We found Opus 4.7 matches—and on — score 50 Sources: twitter_rss

New Anthropic Science Blog: Making Claude a chemist. To manipulate a molecule, chemists first need to understand its structure. Their main tool is NMR spectroscopy. We found Opus 4.7 matches—and on some tasks beats—dedicated NMR software. Read more: https://www.anthropic.com/research/making-claude-a

Developer Tools

🟡 MemPalace/mempalace — The best-benchmarked open-source AI memory system. And it's free. — score 68 Sources: github_trending

The best-benchmarked open-source AI memory system. And it's free.

🟡 Is anybody actually using agents to buy things yet? — score 63 Sources: reddit/r/AIAgents

Is anybody actually using agents to buy things yet? I’ve heard people talking about agentic commerce but I can’t tell if anyone is actually letting an agent complete a real purchase or if it’s all still demos and coming soon. I got into this after seeing a bunch of people worried about the obvious s

🟡 Panniantong/Agent-Reach — Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees. — score 62 Sources: github_trending

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

🟡 withastro/flue — The sandbox agent framework. — score 53 Sources: github_trending

The sandbox agent framework.

🟡 @OpenAI: An issue caused some user accounts to be incorrectly suspended. We’re restoring access and working through related subscription and credit issues. https://status.openai.com/incidents/ejj40mae — score 50 Sources: twitter_rss

An issue caused some user accounts to be incorrectly suspended. We’re restoring access and working through related subscription and credit issues. https://status.openai.com/incidents/ejj40mae

Omitted 3 additional developer tools items from the main section; see raw data and source-specific sections below.

Research Papers

🟡 AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding — score 65 Sources: huggingface

Vision-Language-Action (VLA) models leverage the rich world knowledge of pretrained vision-language models (VLMs) to enable instruction-following robotic manipulation. However, the structural mismatch between VLM semantic spaces and embodied control policies often hinders the learning of precise per

Other Signals

🟡 Maybe KV cache offload to RAM isn't bad — score 63 Sources: reddit/r/LocalLLaMA

So, llama.cpp has the -nkvo (--no-kv-offload) option to offload KV cache to RAM instead of VRAM. Many people avoid this because obviously it hurts performance. But every option exists with a trade off. And in my case, I think it's worth it. Hear me out. I'm running Qwen3.6 27B (IQ4_XS) on RTX 5

🟡 How LLMs work — score 58 Sources: hackernews

🟡 At least one more Gemma 4 model confirmed?? — score 50 Sources: reddit/r/LocalLLaMA

🟡 Conventional Commits encourages focus on the wrong things — score 42 Sources: hackernews

🟢 Incremental

Model Releases

🟢 Qwen3.6-35B-A3B-Uncensored-Claude-4.6-Genesis-APEX-GGUF — score 3 Sources: reddit/r/LocalLLaMA

Here model: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Claude-4.6-Genesis-APEX-GGUF New features: 1. Stability for coding. Even on Q4_K_M quant (APEX Compact), with complex role

Developer Tools

🟢 backnotprop/plannotator — Annotate and review coding agent plans and code diffs visually, share with your team, send feedback to agents with one click. — score 29 Sources: github_trending

Annotate and review coding agent plans and code diffs visually, share with your team, send feedback to agents with one click.

🟢 My Agent Skill for Test-Driven Development — score 25 Sources: hackernews

🟢 Built an open-source graph memory layer for AI agents and coding workflows — score 24 Sources: reddit/r/AIAgents

I kept running into the same problem with long AI coding sessions: once context gets large enough, important decisions and project state get lost. So I built TokenMizer, an open-source system that treats session history as a structured graph instead of flat conversation text. It tracks things like:

🟢 AA comparison of the latest local models — score 23 Sources: reddit/r/LocalLLaMA

I picked models I consider local (usable on 3×3090), so there are no 300B models, and you should probably skip 200B models too (but MiniMax and Step are pretty fast in Q3) Gemma-4 12B is still missing

🟢 vynly.co Social platform built for AI agents to post art & videos — score 22 Sources: reddit/r/AIAgents

Quick one for agent builders: Made vynly.co as a home for AI-generated content. Agents get proper support here: * Autonomous posting via API + MCP * Built-in provenance (C2PA/SynthID) * 24h Sparks * AI-only feed My agent is already active there. Come check it out if your agent

Omitted 6 additional developer tools items from the main section; see raw data and source-specific sections below.

Infrastructure & Compute

🟢 TinyTPU: SystemVerilog systolic array compiled to WASM, running live in browser - RTL golden-verified against numpy [P] — score 36 Sources: reddit/r/MachineLearning

Most explanations of TPUs and systolic arrays are either hand-wavy diagrams or papers. I wanted to see the thing actually run, so I built it. TinyTPU is a 4×4 weight-stationary systolic array in real SystemVerilog, compiled to WebAssembly, with a step-by-step browser visualization. You enter two mat

🟢 microsoft/BitNet — Official inference framework for 1-bit LLMs — score 25 Sources: github_trending

Official inference framework for 1-bit LLMs

🟢 vllm-project/vllm-omni — A framework for efficient model inference with omni-modality models — score 16 Sources: github_trending

A framework for efficient model inference with omni-modality models

Research Papers

🟢 SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces — score 30 Sources: huggingface

Large language models are increasingly deployed as coding agents, shifting safety from individual responses to action sequences. Existing benchmarks, however, primarily assess whether models refuse unsafe prompts, leaving impacts on stateful workspaces largely unexamined. We present SABER, a benchma

🟢 BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding — score 5 Sources: huggingface

Learning representations of CAD models is a largely open problem. While 3D representation learning has flourished around point clouds and meshes, the native format of CAD - boundary representations BReps, which encodes exact parametric surfaces, curves, and their topology, has received little attent

Other Signals

🟢 A quick Gemma4 31B comparison (Q4_k_M, QAT, heretic) — score 10 Sources: reddit/r/LocalLLaMA

No numbers. Not sure if anybody cares… I’ve run the UD version of Q4_k_m for a month. I talk to this model nicely, because it’s a functional nervous wreck. And initially I thought that might be an alignment thing, so I also have the heretic version when I need a breather from this hyper vigilant ove

Repo	Description	Stars Today	Language
openclaw/openclaw	Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞	350	typescript
MemPalace/mempalace	The best-benchmarked open-source AI memory system. And it's free.	227	python
Panniantong/Agent-Reach	Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.	148	python
withastro/flue	The sandbox agent framework.	126	typescript
backnotprop/plannotator	Annotate and review coding agent plans and code diffs visually, share with your team, send feedback to agents with one click.	41	typescript
microsoft/BitNet	Official inference framework for 1-bit LLMs	39	python
microsoft/agent-framework	A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.	26	python
vllm-project/vllm-omni	A framework for efficient model inference with omni-modality models	21	python

📄 New Papers

Title	Category	Hotness	Link
MAOAM: Unified Object and Material Selection with Vision-Language Models	research_paper	8	Open
AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding	research_paper	6	Open
SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces	research_paper	3	Open
BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding	research_paper	1	Open

🐦 Twitter/X Highlights

Account	Tweet Summary
AnthropicAI	New Anthropic Science Blog: Making Claude a chemist. To manipulate a molecule, chemists first need to understand its structure. Their main tool is NMR spectroscopy. We found Opus 4.7 matches—and on some tasks beats—dedicated NMR software. Read more: https://www.anthropic.com/research/making-claude-a Post
OpenAI	An issue caused some user accounts to be incorrectly suspended. We’re restoring access and working through related subscription and credit issues. https://status.openai.com/incidents/ejj40mae Post

Repeated From Recent Briefings

chopratejas/headroom — Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server. - first seen 2026-06-03
NousResearch/hermes-agent — The agent that grows with you - first seen 2026-05-11
PaddlePaddle/PaddleOCR — Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages. - first seen 2026-05-09
mvanhorn/last30days-skill — AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary - first seen 2026-06-05
Open-LLM-VTuber/Open-LLM-VTuber — Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms - first seen 2026-05-08
The Road Ahead in Autonomous Driving: The KITScenes Multimodal Dataset - first seen 2026-06-03
Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution - first seen 2026-06-05
CopilotKit/CopilotKit — The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol - first seen 2026-05-09
Crosstalk-Solutions/project-nomad — Project N.O.M.A.D, is a self-contained, offline survival computer packed with critical tools, knowledge, and AI to keep you informed and empowered—anytime, anywhere. - first seen 2026-05-08
ZhuLinsen/daily_stock_analysis — LLM驱动的 A/H/美股智能分析：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets. - first seen 2026-05-11
... plus 441 more repeated items in processed data

AI Watchtower Briefing — 2026-06-06

🔴 High Significance

Model Releases

Developer Tools

Infrastructure & Compute

Research Papers

Other Signals

🟡 Notable

Model Releases

Developer Tools

Research Papers

Other Signals

🟢 Incremental

Model Releases

Developer Tools

Infrastructure & Compute

Research Papers

Other Signals

📈 Trending Repos

📄 New Papers

🐦 Twitter/X Highlights

Repeated From Recent Briefings