AW · AI Watchtower

🔴 High Significance

Model Releases

🔴 I have DeepSeek V4 Pro at home — score 88 Sources: reddit/r/LocalLLaMA

Just wanted to share that I used u/LegacyRemaster slightly modified (Q4_K_M conversion support) DeepSeek V4 CUDA repo (based on u/antirez work) to convert and run Q4_K_M [De

🔴 How I set up an AI agent to handle invoicing bill pay and expense tracking through my bank via MCP — score 83 Sources: reddit/r/AIAgents

I run a B2B lead gen agency and was spending hours a week on invoicing clients, paying contractors, tracking expenses and bookkeeping. Wanted to automate the financial operations side so I could focus on actual client work and heres how I set it up and what I learned. The setup: I use Meow for banki

Developer Tools

🔴 NousResearch/hermes-agent — The agent that grows with you — score 99 Sources: github_trending

The agent that grows with you

🔴 Tokens are not positively correlated with Quality and should not be a success metric — score 94 Sources: reddit/r/AIAgents

I think Anthropic and OpenAI have successfully turned token consumption into a status symbol. Like it’s somehow a metric of productivity. I’ve been building agents since 2022 for my own projects, SMBs, and government clients. I’ve been using LangChain since Harrison was running the Discord and the c

🔴 anthropics/skills — Public repository for Agent Skills — score 82 Sources: github_trending

Public repository for Agent Skills

🔴 yikart/AiToEarn — Let's use AI to Earn! — score 77 Sources: github_trending

Let's use AI to Earn!

🔴 AI meeting assistants make more sense once you use them as agent input — score 72 Sources: reddit/r/AIAgents

I used to think AI meeting tools were mostly just smart note takers, but lately I’ve been looking at them differently. What’s been more useful for me is having meeting context automatically saved and searchable later, transcripts, summaries, action items, decisions, all in one place. I’ve been using

Research Papers

🔴 MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation — score 95 Sources: huggingface

With the rise of online dance-video platforms and rapid advances in AI-generated content (AIGC), music-driven dance generation has emerged as a compelling research direction. Despite substantial progress in related domains such as music-driven 3D dance generation, pose-driven image animation, and au

Other Signals

🔴 Local AI needs to be the norm — score 88 Sources: hackernews

🔴 Openclaw ia trending down and will disappear soon — score 81 Sources: reddit/r/LocalLLaMA

🔴 PhD students in ML, how many hours on average do you work? [D] — score 81 Sources: reddit/r/MachineLearning

I generally work around 9–10 hours a day, but not contiguously. I can usually carve out a dedicated chunk of time in the morning, take lab or project meetings in the afternoon, and block out around 6–8 PM for commute, exercise, socializing, and dinner. I also get more work done in the evening, since

🔴 Getting a feel for how fast X tokens/second really is. — score 76 Sources: reddit/r/LocalLLaMA

I love following all your adventures with local LLM setups. Quality and size of the models are important, but so is performance. Numbers don't really convey the experienced speed well, however. If someone claims they run Qwen 3.6-27B at 21 tokens/second, how fast is that? Is 10 tokens/second unusabl

🔴 Running Qwen3.6 35b a3b on 8gb vram and 32gb ram ~190k context — score 73 Sources: reddit/r/LocalLLaMA

If anyone is looking for a good high-speed setup with ~190k context, this config has been working insanely well for me. I’m using my laptop as a server over Tailscale. Installed Linux on it and running: - Qwen3.6 35B A3B - RTX 4060 8GB VRAM - 32GB DDR5 5600MHz RAM - Q5 quant models Current mode

🟡 Notable

Model Releases

🟡 Any implementations similar to D4RT? [D] — score 56 Sources: reddit/r/MachineLearning

Deepmind released a paper on D4RT at the start of this year which crucially enabled a “4D” understanding of the world via structure from motion and generating: 1. Point cloud reconstruction from 2D videos (not static scenes) 2. Camera pose estimation You could pass in a video of a dog walking on a b

🟡 Put my 4 years of SEO experience into a claude skill so you don't have to figure it out yourself — score 56 Sources: reddit/r/AIAgents

I condensed my SEO experience into a Claude Code skill that actually does keyword research and writes articles the right way & open sourced it Most AI writing tools I came across gave really shallow output. They go straight from keyword to article with no research in between. No competitor analy

🟡 ExLlamaV3 Major Updates! — score 42 Sources: reddit/r/LocalLLaMA

Turboderp has a been on an absolute tear recently, in the endless battle to cram new llamas into smaller, faster boxes. We started off last month with the release of [gemma 4 support](https://github.com/turboderp-org/exllamav3/releases/tag/v0

Developer Tools

🟡 open-webui/open-webui — User-friendly AI Interface (Supports Ollama, OpenAI API, ...) — score 64 Sources: github_trending

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

🟡 An AI coding agent, used to write code, needs to reduce your maintenance costs — score 62 Sources: hackernews

🟡 How are you operating local AI agents after the first demo works? — score 56 Sources: reddit/r/AIAgents

I’m curious how other people here are handling the operational side of local/self-hosted agents. The demo phase is usually fun: wire a model to tools, get it to use a browser or files or an MCP server, watch it complete a task. But the next phase gets messy fast: - what agents are installed? - what

🟡 tinyhumansai/openhuman — Your Personal AI super intelligence. Private, Simple and extremely powerful. — score 56 Sources: github_trending

Your Personal AI super intelligence. Private, Simple and extremely powerful.

🟡 ZhuLinsen/daily_stock_analysis — LLM驱动的 A/H/美股智能分析：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets. — score 51 Sources: github_trending

LLM驱动的 A/H/美股智能分析：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets.

Omitted 3 additional developer tools items from the main section; see raw data and source-specific sections below.

Infrastructure & Compute

🟡 jundot/omlx — LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar — score 68 Sources: github_trending

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Enterprise Adoption

🟡 How enterprises are scaling AI — score 50 Sources: lab_blog/OpenAI

How enterprises scale AI: from early experiments to compounding impact through trust, governance, workflow design, and quality at scale.

Research Papers

🟡 Rethinking State Tracking in Recurrent Models Through Error Control Dynamics — score 68 Sources: huggingface · arxiv/cs.CL

The theory of state tracking in recurrent architectures has predominantly focused on expressive capacity: whether a fixed architecture can theoretically realize a set of symbolic transition rules. We argue that equally important is error control, the dynamics governing hidden-state drift along the d

🟡 Gated QKAN-FWP: Scalable Quantum-inspired Sequence Learning — score 60 Sources: huggingface · arxiv/cs.AI

Fast Weight Programmers (FWPs) encode temporal dependencies through dynamically updated parameters rather than recurrent hidden states. Quantum FWPs (QFWPs) extend this idea with variational quantum circuits (VQCs), but existing implementations rely on multi-qubit architectures that are difficult to

Other Signals

🟡 MTP benchmark results: the nature of the generative task dictates whether you will benefit (coding) or get slower inference (creative) from speculative inference. No other factor comes close. — score 65 Sources: reddit/r/LocalLLaMA

I recently published MTP quants of Qwen 3.6 27B and I was suprised by the reports here on reddit, and on HF, of users who were experiencing worst speed with speculative inference than without. Th

🟡 I Think I Spent Way Too Much Time Messing with Local LLMs — score 58 Sources: reddit/r/LocalLLaMA

Guys, I'm hearing coil whine in my sleep. Help >!/s!<

🟢 Incremental

Model Releases

🟢 What's actually moving the needle on agent token bills? — score 39 Sources: reddit/r/AIAgents

I've been researching how teams handle FinOps and cost optimization on agentic workflows in production. wanted to share what keeps coming up and ask what's actually working in your setup. Most stacks I've looked at have the same starting kit. cheaper model for routing or sub-tasks (haiku, gpt-4o-min

🟢 How Fast Does Claude, Acting as a User Space IP Stack, Respond to Pings? — score 38 Sources: hackernews

🟢 The Qwen 3.6 35B A3B hype is real!!! — score 35 Sources: reddit/r/LocalLLaMA

My personal test for small local LLM intelligence is to check whether a model has any ability to understand the code that I write for my own academic research. My research is on some pretty niche topics and I doubt that anything like it is substantively present in the training sets for LLMs. A few m

🟢 Show HN: adamsreview – better multi-agent PR reviews for Claude Code — score 12 Sources: hackernews

Developer Tools

🟢 MemoriLabs/Memori — Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems. — score 39 Sources: github_trending

Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.

🟢 apify/crawlee — Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation. — score 30 Sources: github_trending

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headf

🟢 Has anyone built scripts or seeded test workspaces for Gmail, Slack, Teams, Notion, GitHub etc.? — score 17 Sources: reddit/r/AIAgents

We're building an AI agent platform and need realistic test environments across a bunch of SaaS tools. Empty sandboxes don't cut it — we need accounts that look like they've actually been used. Specifically looking for any of the following: * Scripts that use APIs to seed realistic data (emails,

🟢 Would you trust an AI agent to monitor flight deals and book them for you? — score 17 Sources: reddit/r/AIAgents

I joined a startup team, because everyone on the team shares the same travel frustration: there’s just no time to keep checking flight apps every day. A lot of flight alert tools are too slow. Sometimes airlines or travel platforms briefly show unusually low fares or even prices caused by a system b

🟢 Why your AI agent needs a dedicated inbox, not a shared mailbox (and how to wire it up) — score 17 Sources: reddit/r/AIAgents

Been building agent workflows that send and receive email for a while now. One of the most common mistakes I see is routing all agent email through a single shared inbox like [email protected] or a team Gmail account. # The problem with shared inboxes # When your agen

Omitted 2 additional developer tools items from the main section; see raw data and source-specific sections below.

Research Papers

🟢 Sparse Autoencoders as Plug-and-Play Firewalls for Adversarial Attack Detection in VLMs — score 38 Sources: huggingface · arxiv/cs.AI

Vision-language models (VLMs) have advanced rapidly and are increasingly deployed in real-world applications, especially with the rise of agent-based systems. However, their safety has received relatively limited attention. Even the latest proprietary and open-weight VLMs remain highly vulnerable to

🟢 R^3-SQL: Ranking Reward and Resampling for Text-to-SQL — score 25 Sources: huggingface

Modern Text-to-SQL systems generate multiple candidate SQL queries and rank them to judge a final prediction. However, existing methods face two limitations. First, they often score functionally equivalent SQL queries inconsistently despite identical execution results. Second, ranking cannot recover

Other Signals

🟢 unsloth/MiMo-V2.5-GGUF · Hugging Face — score 27 Sources: reddit/r/LocalLLaMA

can you run it?

🟢 Any news (or hope) of Qwen-3.6 14B and 9B distills for local coding ? — score 12 Sources: reddit/r/LocalLLaMA

As the title suggests. I'm already testing (with some success, and few challenges) usage of Qwen-3.5 9B with a new work laptop that I've received with RTX 1000 6GB VRAM (I know it seems like a joke in today's time and age). I am using it with `pi` as the terminal coding harness. The issue I am fac

🟢 Why is human LLM annotation so expensive? [D] — score 6 Sources: reddit/r/MachineLearning

Scale AI and similar services charge a lot for annotation. MTurk is cheap but the quality is horrible for anything requiring real domain understanding. For small teams that need a few thousand labeled examples to calibrate their evals or fine tune a model, there seems to be no good middle ground. Ho

🟢 Has anyone been able to get Draft Models to load in LM Studio? — score 4 Sources: reddit/r/LocalLLaMA

Per title. Been trying to load Gemma E2b as draft model for 26b as target using LM Studio's UI but it can't seem to recognise what's already been downloaded. Any advice on how to get this to work?

Repo	Description	Stars Today	Language
NousResearch/hermes-agent	The agent that grows with you	1496	python
anthropics/skills	Public repository for Agent Skills	509	python
yikart/AiToEarn	Let's use AI to Earn!	397	typescript
jundot/omlx	LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar	185	python
open-webui/open-webui	User-friendly AI Interface (Supports Ollama, OpenAI API, ...)	174	python
tinyhumansai/openhuman	Your Personal AI super intelligence. Private, Simple and extremely powerful.	154	rust
ZhuLinsen/daily_stock_analysis	LLM驱动的 A/H/美股智能分析：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets.	141	python
MemoriLabs/Memori	Memori is agent-native memory infrastructure. A LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.	62	python
apify/crawlee	Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.	42	typescript
alibaba/page-agent	JavaScript in-page GUI agent. Control web interfaces with natural language.	12	typescript

📄 New Papers

Title	Category	Hotness	Link
MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation	research_paper	70	Open
Rethinking State Tracking in Recurrent Models Through Error Control Dynamics	research_paper	7	Open
Gated QKAN-FWP: Scalable Quantum-inspired Sequence Learning	research_paper	3	Open
GraphDC: A Divide-and-Conquer Multi-Agent System for Scalable Graph Algorithm Reasoning	cs.AI	0	Open
More Thinking, More Bias: Length-Driven Position Bias in Reasoning Models	cs.AI	0	Open
Fast and Effective Redistricting Optimization via Composite-Move Tabu Search	cs.AI	0	Open
State Representation and Termination for Recursive Reasoning Systems	cs.AI	0	Open
Hidden Coalitions in Multi-Agent AI: A Spectral Diagnostic from Internal Representations	cs.AI	0	Open
CASCADE: Case-Based Continual Adaptation for Large Language Models During Deployment	cs.AI	0	Open
From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms	cs.AI	0	Open
When Does a Language Model Commit? A Finite-Answer Theory of Pre-Verbalization Commitment	cs.AI	0	Open
Weblica: Scalable and Reproducible Training Environments for Visual Web Agents	cs.AI	0	Open
When Does Critique Improve AI-Assisted Theoretical Physics? SCALAR: Structured Critic--Actor Loop for Agentic Reasoning	cs.AI	0	Open
Towards Security-Auditable LLM Agents: A Unified Graph Representation	cs.AI	0	Open
Uneven Evolution of Cognition Across Generations of Generative AI Models	cs.AI	0	Open

🏢 Lab Blog Posts

OpenAI: OpenAI Campus Network: Student club interest form
OpenAI: How enterprises are scaling AI

Repeated From Recent Briefings

anthropics/financial-services - first seen 2026-05-07
My experience interviewing with Huawei Vancouver for an ML research role: strong mismatch between how it was pitched and how it was evaluated [D] - first seen 2026-05-09
farion1231/cc-switch — A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io - first seen 2026-05-08
datawhalechina/hello-agents — 📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程 - first seen 2026-05-09
bytedance/UI-TARS-desktop — The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra - first seen 2026-05-09
rohitg00/agentmemory — #1 Persistent memory for AI coding agents based on real-world benchmarks - first seen 2026-05-09
DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents - first seen 2026-05-07
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers - first seen 2026-05-08
rowboatlabs/rowboat — Open-source AI coworker, with memory - first seen 2026-05-10
millionco/react-doctor — Your agent writes bad React. This catches it - first seen 2026-05-10
... plus 105 more repeated items in processed data

AI Watchtower Briefing — 2026-05-11

🔴 High Significance

Model Releases

Developer Tools

Research Papers

Other Signals

🟡 Notable

Model Releases

Developer Tools

Infrastructure & Compute

Enterprise Adoption

Research Papers

Other Signals

🟢 Incremental

Model Releases

Developer Tools

Research Papers

Other Signals

📈 Trending Repos

📄 New Papers

🏢 Lab Blog Posts

Repeated From Recent Briefings