AI Resources Weekly Report

Published on 2026-07-20

Last updated on 2026-07-20

AI Resources Weekly Report

更新频率：每周自动更新

New Tools

Code Agents

IDE Plugins

cline: Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Roo-Cline: A fork of Cline, an autonomous coding agent, with some additional experimental features. It’s been mainly writing itself recently, with a light touch of human guidance here and there.
Roo Code: (Formerly Roo-Cline) An autonomous coding agent and significantly evolved fork of Cline. Now features “Custom Modes” and operates as a full AI Coding OS.
Claude Code Workflow Studio: Design complex AI agent workflows by conversing with AI – or use intuitive drag-and-drop. Build Sub-Agent orchestrations and conditional branching with natural language, then export directly to .claude format.
GitKraken Desktop 12.0: Agent Mode — dedicated view for launching, monitoring, and managing multiple AI coding agent sessions from a single interface (Apr 2026)

CLI & Cloud Agents

vm0: Natural language Agent, 24/7 in cloud sandbox
agenticSeek: Fully Local Manus AI
ml-intern: HuggingFace’s open-source AI agent that automates the full LLM post-training workflow
GELab-Zero-4B-preview: 这是个专注于 Android 系统的GUI 代理模型，针对交互界面元素（点击、输入、滑动、等待等）进行了优化，可以支持跨多个应用（如餐饮、交通、购物、社交等）执行多步骤长时程任务。
Atlassian Rovo Dev: Enterprise AI coding agent CLI — integrates with Jira, Confluence, Bitbucket via Teamwork Graph. Supports Claude Code, Cursor, Copilot, Kiro, OpenCode via installable skills (Apr 28, 2026)
Warp Terminal: Open-source AI-native terminal (MIT/AGPLv3, Rust). 41K+ GitHub stars in 48h. Built-in support for Claude Code, Codex, Gemini CLI. OpenAI founding sponsor. Oz cloud agent orchestration (Apr 30, 2026)
cplt: Sandbox wrapper for AI coding agents — runs GitHub Copilot CLI, OpenCode, Gemini CLI, or plain shell inside a kernel-level sandbox. macOS (Seatbelt/SBPL) + Linux (Landlock LSM + seccomp-BPF). MIT license (May 12, 2026)
agent-sandbox: Runs AI coding agents inside isolated Docker containers with no root access, no Docker socket, and no privilege escalation. Multi-language support (Node, Go, PHP) (May 8, 2026)
Code Bench: Local-first desktop AI coding agent, BYO model (MIT). Desktop-native with integrated AI agent (May 10, 2026)
Kanbots: Open-source Kanban desktop app that runs parallel AI coding agents (Claude Code, Codex) on every card — local collaboration interface for multi-agent task management (May 2026)
CodeGraph: Pre-indexed code knowledge graph for Claude Code, Codex, Cursor, OpenCode, and Hermes Agent — fewer tokens, fewer tool calls, 100% local (16.4K stars, MIT)

UI & Design Dev Tools

Pax dev and code pax: Pax is a revolutionary new canvas for building apps & websites with AI.
makepad:a new way to build UIs in Rust for both native and the web.
superdesign: extract webpage info and generate UI designs
ui.sh
variant

Video Generation

Video Models

FramePack: Revolutionary next-frame prediction model using a 13B model to generate 1-minute videos (60 seconds) at 30fps (1800 frames), minimum GPU memory required is 6GB
Wan2.1 we present Wan2.1, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Pusa: is a new video diffusion model that matches SOTA with 200x less training cost & 2500x less data. It outperforms Wan-I2V on VBench-I2V, runs 5x faster, and supports I2V, T2V, start-end frames, video extension, and video completion.
HunyuanVideo: Tencent’s 13B+ parameter video generation model with systematic framework
CogVideoX: Tsinghua’s expert transformer-based text-to-video diffusion model
Luma AI Dream Machine: Text-to-video and image-to-video generator powered by Ray3 model, featuring 4K resolution output and realistic physics
Sora 2: OpenAI’s latest video generation model with enhanced coherence and length capabilities
Runway Gen-4.5: World’s top-rated video model — unprecedented visual fidelity and creative control, advanced camera presets
Seedance 2.0: ByteDance’s multimodal video model — #1 on Arena for T2V (Elo 1460) and I2V (Elo 1454) as of April 2026. Unified audio-video generation, accepts up to 12 mixed inputs, 4–15s clips at 2K with native audio
Kling 3.0: 4K@60fps video generation, ~$0.50/10s clip, best value for volume creators
Veo 3.1: Google DeepMind’s video model — current benchmark for cinematic realism and native audio quality
PixVerse V6: Advanced video generation with precise parameterized camera control; R1 real-time world model with shared worlds + personalized avatars (Apr 2026 update); Series C unicorn status, 100M+ users across 177 countries
Pollo AI: All-in-one AI video & image platform, innovative video-to-video transformation
Genra AI: Chat-to-Video AI agent with built-in music, avatars, and templates: 15B params, ranks #1 on T2V and I2V leaderboards (Elo 1333/1392) as of April 2026, accepts 12 simultaneous multimodal inputs
Gemini Omni: Google’s upcoming unified video generation model, surfaced ahead of Google I/O 2026. Demonstrates significantly improved text rendering in generated video (May 2026, pre-release)

Avatar & Talking Head

SynTalker:Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation
EchoMimicV3: 1.3B Parameters for Unified Multi-Modal and Multi-Task Human Animation (AAAI 2026)
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
MyTimeMachine: Personalized Facial Age Transformation
MEMO: MEMO is a state-of-the-art open-weight model for audio-driven talking video generation.
StableAnimator:High-Quality Identity-Preserving Human Image Animation (CVPR 2025)
INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations.Given the dual-track audio in dyadic conversations and a single portrait image of arbitrary agent, our framework can dynamically synthesize verbal, non-verbal and interactive agent videos with lifelike facial expressions and rhythmic head pose movements
LatentSync:Taming Stable Diffusion for Lip Sync! - State-of-the-art lip-sync technology from ByteDance
KDTalker: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait (IJCV 2025)
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis (ACM MM 2025)
HeyGen Avatar V: Most realistic AI avatar — 15-second recording creates full identity model, 175+ language translation with lip-sync, zero identity drift
HappyHorse 1.0

Video Tools

VideoCaptioner: An intelligent video subtitle processing assistant based on Large Language Models (LLM), supporting subtitle generation, optimization, translation and more
VideoLingo: VideoLingo is an all-in-one video translation, localization, and dubbing tool aimed at generating Netflix-quality subtitles. It eliminates stiff machine translations and multi-line subtitles while adding high-quality dubbing, enabling global knowledge sharing across language barriers.
SmolVLM real-time camera demo This repository is a simple demo for how to use llama.cpp server with SmolVLM 500M to get real-time object detection

Audio

TTS

OpenAudio (formerly Fish Audio): support both tts and asr, #1 on TTS-Arena2 with 0.008 WER
fish-speech-gui
F5-TTS:F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching - Very active with streaming support
OuteTTS:OuteTTS is an experimental text-to-speech model that uses a pure language modeling approach to generate speech, without architectural changes to the foundation model itself.
ElevenLabs Flash： generates speech in 75ms + application & network latency，build directly via the API using model id “eleven_flash_v2” and “eleven_flash_v2_5”. April 2026: v3 conversational model, TTS Normalizer v3.1, Agent platform with tool error handling and voice quality assessment.
Orpheus TTS: 自然人声合成 with Llama-3b backbone, <200ms latency
dia: Dia directly generates highly realistic dialogue from a transcript. Recently released Dia2 with 1.6B params
VITA-Audio Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model (53ms first token latency)
Muyan-TTS is a trainable TTS model designed for podcast applications within a $50,000 budget, which is pre-trained on over 100,000 hours of podcast audio data
Higgs Audio V2: Redefining Expressiveness in Audio Generation with 75.7% win rate over GPT-4o-mini
SoulX-Podcast: Very recent podcast generation with Chinese dialects support
sonic-3
MiniMax Speech 2.6
Spark-TTS: Recent open-source TTS model
Voxtral TTS: Mistral AI’s open-source text-to-speech model
FireRedTTS: Competitive open-source option
MoonCast: Multi-speaker dialogue generation
CosyVoice2: Advanced voice cloning
Gemini 3.1 Flash TTS: Google’s most expressive TTS — 200+ inline audio tags, 70+ languages, two-speaker mode, $0.50/1M chars, Elo 1211 on Artificial Analysis (highest)
MiMo-V2.5-TTS: Xiaomi’s open-source TTS series (VoiceDesign / VoiceClone variants), supports Chinese dialects
Kani-TTS-2: 400M param open-source TTS, runs in 3GB VRAM, 12 languages, voice cloning support
OmniVoice: 600+ language zero-shot TTS with diffusion language model, SOTA on multilingual benchmarks
Chatterbox Multilingual: MIT-licensed zero-shot TTS for 23 languages with emotion control and watermarking
TADA: Hume AI’s open-source zero-hallucination TTS, 5x faster than LLM-based TTS (1B/3B models)
MOSS-TTS-Nano: Open-source 0.1B param multilingual TTS from Fudan NLP Lab / MOSI.AI — CPU-only inference, 48kHz stereo, 20 languages, voice cloning, Apache 2.0 (Apr 10, 2026)
MOSS-TTS Family: Open-source speech & sound generation family — MOSS-TTS (8B), VoiceGenerator (1.7B), SoundEffect (8B), TTS-Realtime (1.7B, 180ms TTFB, 377ms e2e). Covers long-form, dialogue, voice design, SFX, streaming. Apache 2.0
Supertonic: Lightning-fast, on-device, multilingual TTS running natively via ONNX — supports Swift, Python, Rust, C++, Java, Go, Node.js, Flutter, Web (9.4K stars, MIT)
LongCat-AudioDiT: Meituan’s 3.5B non-autoregressive TTS — operates in waveform latent space, eliminates error accumulation, Adaptive Projection Guidance (APG) (Mar 31, 2026)
MAGIC-TTS: Fine-grained controllable speech synthesis with explicit local duration and pause control — flow-matching TTS with token-aligned timing (arXiv Apr 2026)
pocket-tts

ASR

Open ASR Leaderboard
FunASR:FunASR is a speech recognition framework developed by the Speech Lab of DAMO Academy, which integrates industrial-level models in the fields of speech endpoint detection, speech recognition, punctuation segmentation, and more. It has attracted many developers to participate in experiencing and developing
nvidia/canary-1b:Canary-1B supports automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC).
Parakeet TDT 1.1B (en): is an ASR model that transcribes speech in lower case English alphabet. This model is jointly developed by NVIDIA NeMo and Suno.ai teams. It is an XXL version of FastConformer [1] TDT [2] (around 1.1B parameters) model.
CrisperWhisper:CrisperWhisper is an advanced variant of OpenAI’s Whisper, designed for fast, precise, and verbatim speech recognition with accurate (crisp) word-level timestamps. Unlike the original Whisper, which tends to omit disfluencies and follows more of a intended transcription style, CrisperWhisper aims to transcribe every spoken word exactly as it is, including fillers, pauses, stutters and false starts. support English, German.
Deepgram
Whisper-WebUI: A Gradio-based browser interface for Whisper. You can use it as an Easy Subtitle Generator!
ten-vad TEN VAD is a real-time voice activity detection system designed for enterprise use, providing accurate frame-level speech activity detection. It shows superior precision compared to both WebRTC VAD and Silero VAD, which are commonly used in the industry. Additionally, TEN VAD offers lower computational complexity and reduced memory usage compared to Silero VAD. Meanwhile, the architecture’s temporal efficiency enables rapid voice activity detection, significantly reducing end-to-end response and turn detection latency in conversational AI systems.
Omnilingual ASR: 1600+ languages support with zero-shot learning
Scribe v2 Realtime Speech to Text - 150ms Latency API: very promising
WhisperX: Improved Whisper with diarization
SenseVoice: FunASR’s multilingual model
Qwen3 ASR: Pure Rust implementation of Qwen3-ASR automatic speech recognition
MiMo-V2.5-ASR: Xiaomi’s open-source ASR — handles bilingual conversations, Chinese dialects (Wu, Cantonese, Minnan, Sichuanese), song lyrics recognition
mega-asr

Music

qa-mdt(active):(OpenMusic) Awesome Open-source Text-to-music (TTM) generation: QA-MDT (IJCAI-25 accepted)
DiffRhythm:全球首个基于扩散模型的端到端音乐模型
suno: Leading commercial music AI platform — Suno v5.5 with Studio DAW, personal voice cloning; Warner Music licensing deal (2026); 50 credits/day free tier
Lyria: Google’s music model (powering YouTube)
Beatoven.ai: Royalty-free music generation
LoudMe: Text-to-song generator
Udio: Professional music generation — best vocal realism, Inpaint editing feature; legal status unsettled after RIAA lawsuit
SUMO: Pay-per-track AI music generation with clear commercial rights
huobao-drama
khala

Voice Tools

UVR5-UI: UI for UVR’s, state-of-the-art source separation models to remove vocals from audio files
Grok Speech API: xAI’s standalone STT/TTS APIs — STT error rate 5.0% (industry-best), 25 languages, $0.10/hr batch /$ 0.20/hr streaming
MMAudio: MMAudio generates synchronized audio given video and/or text inputs
omniaudio-2.6b and demo: World’s Fastest Audio Language Model for Edge Deployment
openai-webrtc-go
MicDrop: Transform your voice into any voice, instantly.
Spatial Speech Translation Translating Across Space With Binaural Hearables
Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice
Grok Speech API: xAI’s standalone speech-to-text and text-to-speech APIs for enterprise voice developers
Amazon Nova 2 Sonic: AWS unified speech-to-speech foundation model — bidirectional streaming, unifies ASR + reasoning + tool use + TTS into single model. Background noise robustness, tone/sentiment adaptation

Image

Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer - 20x smaller and 100x faster than FLUX-12B (ICLR-2025 Oral)
HivisionIDPhotos: 旨在开发一种实用、系统性的证件照智能制作算法，它利用一套完善的 AI 模型工作流程，实现对多种用户拍照场景的识别、抠图与证件照生成。
LiYing: LiYing 是一套适用于自动化完成一般照相馆后期证件照处理流程的照片自动处理的程序。
BRIA-RMBG: High-Accuracy, Legal, and Inclusive Background Removal
RMBG-2-Studio:
IC-Custom: IC-Custom is designed for diverse image customization scenarios, including:
InvSR: Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
upscayl: Upscayl lets you enlarge and enhance low-resolution images using advanced AI algorithms. Enlarge images without losing quality. It’s almost like magic! 🎩🪄
HiDream-I1: is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.
FLUX: Leading open-weight image generation model from Stable Diffusion creators
Recraft V3: #1 on Artificial Analysis rankings for 5 consecutive months
Midjourney V8.1: V8.1 Alpha — HD mode 3x faster & cheaper, native 2K rendering (--hd), Prompt Shortener, updated Describe, image prompts restored
Midjourney v7: Aesthetic intelligence benchmark — --cref character consistency, --sref style lock, cinematic texture
Remove.bg: Industry standard for automatic background removal
Z-Image-Turbo 图像生成器
Z-Image-Turbo: is a powerful and highly efficient image generation model with 6B parameters. Currently there are three variants
- GPT-Image-2: OpenAI’s next-gen image model — officially released April 21, 2026. First agentic image model with O-series reasoning, 2K resolution, near-perfect multilingual text rendering (CJK/Hindi/Arabic), natural language editing, web search integration. Powered by GPT-5.4 backbone. API pricing $8/$ 30 per million tokens.
- Recraft V4: Design-optimized image model with native SVG vector output and raster variant — professional brand asset generation
- Flux 2 Max: Open-weight photorealistic image model, excels at complex multi-subject scenes
- Dolphin is a novel multimodal document image parsing model that follows an analyze-then-parse paradigm. It addresses the challenges of complex document understanding through a two-stage approach designed to handle intertwined elements such as text paragraphs, figures, formulas, and tables.
- Seedream 5.0: ByteDance’s latest image generation model — multi-version (5.0/4.5/4.0), advanced multi-image editing, real-time search integration, top-rated on Pixazo for versatility

3D Generation

StableGen: Transform your 3D texturing workflow with the power of generative AI, directly within Blender!
Meshy AI: Text-to-3D model generation with high-quality 3D assets — Meshy 6 latest model for highest fidelity
Hunyuan3D 2.1: Tencent’s 3D generation model for detailed textured assets
Hunyuan3D-Omni: Unified framework for controllable generation of 3D assets
hyper3d: 3D 模型生成（3D打印）
Seed3D 2.0: ByteDance’s next-gen 3D foundation model — officially released April 23, 2026. SOTA geometry + texture quality, improved PBR materials, supports scene generation from text/image/video (API via Volcano Engine)
Tripo Studio / Tripo H3.1: All-in-one AI 3D workspace. H3.1 (April 2026) — high-fidelity image-to-3D for production (game dev, product viz, interactive media). 20B+ parameter algorithm, 6.5M+ creators, AI model segmentation and 3D printing support
Hi3D v2.1: Converts AI images (GPT-Image-2 etc.) to production-ready 3D models, 2-5 min generation, manifold meshes for 3D printing

AI Search & Research

Search Engines

Perplexity AI: Market leader in AI search
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher - Deployed on Puyu platform
khoj: is a personal AI app to extend your capabilities. It smoothly scales up from an on-device personal AI to a cloud-scale enterprise AI.
Scira (formerly mplx.run): A minimalistic AI-powered search engine that helps you find information on the internet
Gemini Search: A Perplexity-style search engine powered by Google’s Gemini 2.0 Flash model with grounding through Google Search
exa: AI-native search engine for developers and research
You.com: AI-powered search with personalization
Kagi: Premium AI search engine
Brave Search: Privacy-focused AI search

Research & Deep Research

SurfSense While tools like NotebookLM and Perplexity are impressive and highly effective for conducting research on any topic/query, SurfSense elevates this capability by integrating with your personal knowledge base. It is a highly customizable AI research agent, connected to external sources such as search engines (Tavily, LinkUp), Slack, Linear, Notion, YouTube, GitHub and more to come.
Agentic Company Researcher 🔍: A multi-agent tool that generates comprehensive company research reports
MAESTRO: Your Self-Hosted AI Research Assistant
Open Deep Research: Deep research has broken out as one of the most popular agent applications. This is a simple, configurable, fully open source deep research agent that works across many model providers, search tools, and MCP servers. It’s performance is on par with many popular deep research agents (see Deep Research Bench leaderboard).
MiroThinker/code: is MiroMind’s Flagship Research Agent Model. It is an open-source search model designed to advance tool-augmented reasoning and information-seeking capabilities, enabling complex real-world research workflows across diverse challenges
CO-storm: Get a Wikipedia-like report on your topic with AI, STORM is a research prototype that supports interactive knowledge curation.
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Resume Matcher: Resume Matcher is an AI Based Free & Open Source Tool. To tailor your resume to a job description. Find the matching keywords, improve the readability and gain deep insights into your resume.
InsightExpress: InsightExpress is a Next.js application that generates AI-powered research reports based on user-provided topics and emails them to users. The application leverages Langflow for its AI capabilities and features a modern, responsive UI built using NextJS.

Document & OCR

pdf2md: Self-hostable API server and pipeline for converting PDF’s to markdown using thrifty large language vision models like GPT-4o-mini and gemini-flash-1.5.
PDFMathTranslate: PDF scientific paper translation and bilingual comparison.
gamma: 精美的演示文稿、文档和网站。无需设计或编码技能。
refly: 这是一款革新性的 AI Native 内容创作引擎！⚡️Refly 是基于「自由画布👨‍🎨👩‍🎨」理念打造的 AI Native 创作工具，为用户提供从创意萌发到成品内容的一站式解决方案🌈：
markitdown: MarkItDown is a utility for converting various files to Markdown (e.g., for indexing, text analysis, etc).
ReaderLM: converts raw HTML into beautifully formatted markdown or JSON with superior accuracy and improved longer context handling
AiryLark 是一个开源的文档处理工具，支持多种文件格式的输入和处理。无论是 PDF 文档、Word 文件还是纯文本，AiryLark 都能高效处理。
AI Presentation Generator
GutenOCR
deepseek-ocr-2
PaddleOCR
glm-ocr

AI Agents & OpenClaw

Agent Frameworks

ChainForge: An open-source visual programming environment for battle-testing prompts to LLMs.
anychat:A unified chat interface for multiple AI models powered by Gradio. This application provides access to various leading AI models through a simple tab-based interface.
aisuite: Simple, unified interface to multiple Generative AI providers.
Open Canvas: Open Canvas is an open source web application for collaborating with agents to better write documents. It is inspired by OpenAI’s “Canvas”, but with a few key differences.
Reppl:Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts; experimental AI-assisted re-implementation of prompt optimization algorithms.
Google ADK → Gemini Enterprise Agent Platform: Google’s Agent Development Kit — major upgrade at Cloud Next 2026. Graph-based multi-agent orchestration, 6T+ tokens/month, secure Workspaces, multimodal streaming. Now part of unified Gemini Enterprise Agent Platform replacing Vertex AI
Kimi Claw: Moonshot AI’s native OpenClaw agent — 5,000 community skills, 40GB cloud storage
Statewright: Visual state machine guardrails for AI agents — controls which tools agents can use in each phase. Works across Claude Code, Codex, Cursor, OpenCode, Pi. Local models (13GB+) go from 2/10 to 10/10 on SWE-bench subset. 122 HN points (May 12, 2026)
Mozaik: TypeScript framework for building reactive AI agents — models LLM context as a first-class primitive, aligned with OpenResponses spec. Provider-agnostic, typed context composition (May 11, 2026)
Rotunda: A browser built for AI agents from the ground up with simulated typing and anti-CAPTCHA. Playwright-compatible Python library (May 13, 2026)
Recursant: Mesh-based control plane for governing AI agents — service mesh for production agent deployments (May 13, 2026)
Lowdefy v5.3: AI agents in 30 lines of YAML — low-code agent orchestration platform (May 11, 2026)
OpenHuman: Your personal AI superintelligence — private, simple, extremely powerful. Rust-based (25.7K stars)
Academic Research Skills: Academic Research Skills for Claude Code: research → write → review → revise → finalize pipeline (19K stars)
Superpowers: Agentic skills framework & software development methodology that works (202.8K stars, MIT)
RuView: Turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — no camera needed (63.9K stars, MIT)
AgentMemory: #1 Persistent memory for AI coding agents based on real-world benchmarks (16.4K stars)
CloakBrowser: Stealth Chromium that passes every bot detection test — drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed (18.7K stars, MIT)
ViMax: Agentic Video Generation — Director, Screenwriter, Producer, and Video Generator All-in-One (6.7K stars, MIT)

Desktop Coworkers

Open Claude Cowork: An open-source desktop chat application powered by Claude Agent SDK and Composio Tool Router. Build AI agents with access to 500+ tools and persistent chat sessions.
openwork: Open Source AI Desktop Agent(coworker)
Claude-Cowork: Agent Cowork is an open-source alternative to Claude Cowork — a desktop AI assistant that helps with programming, file management, and any task you can describe(desktop)
eigent: The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
AionUi: Cowork with Your AI, Gemini CLI, Claude Code, Codex, Qwen Code, Goose CLI, Auggie, and more
Deepseek-Cowork:

OpenClaw Ecosystem

KrillClaw: The world’s smallest AI agent runtime. 49KB. Written in Zig. Zero dependencies.
nullclaw:Fastest, smallest, and fully autonomous AI assistant infrastructure written in Zig
openfang: Open-source Agent OS built in Rust. 137K LOC. 14 crates. 1,767+ tests. Zero clippy warnings.
nanoclaw
ironclaw
nanobot
hermes-agent
NemoClaw: NVIDIA’s open-source secure OpenClaw stack — one-command install, privacy controls, Nemotron-powered agents
OpenHarness

Open Source LLMs

BAGEL Unified Model for Multimodal Understanding and Generation - Outperforms Qwen2.5-VL and InternVL-2.5
VITA: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Qwen2.5-Omni: Alibaba’s multimodal model with TTS capabilities
Gemma 4 (31B): Google DeepMind’s open multimodal model — text + image + video input, 256K context, 140+ languages, hybrid attention
DeepSeek-V4: MIT-licensed MoE — V4-Pro (1.6T total / 49B active) and V4-Flash (284B / 13B active), 1M context, 32T tokens pretrain. Released April 24, 2026. Competes with top closed-source models on reasoning, coding, agentic tasks. $0.14/M input (Flash) /$ 1.74/M input (Pro) — cheapest frontier-class model ever released publicly
GLM-5.1: Z.ai’s open-weights model — sustains autonomous work for up to 8 hours, strong coding and agentic workflows. 744B MoE with 40B active
Qwen3.6-27B: Dense 27B model — flagship-level coding, 77.2% SWE-bench Verified, competitive with Claude 4.5 Opus on coding agents
Qwen3.6-35B-A3B: MoE variant with 3B active params for efficient deployment
Qwen3.5-397B-A17B: Alibaba’s flagship MoE — top-tier general chat, story writing, competitive with DeepSeek-V4
Qwen3.5-9B: 81.7% GPQA Diamond at $0.10/M tokens — benchmark leader in sub-$ 0.20 tier
Tencent Hy3-preview: Tencent’s latest open-weights model — strong STEM reasoning, code & agent benchmarks
Llama 4 Behemoth: Meta Superintelligence Labs’ first frontier model — Intelligence Index 52 (vs Llama 4 Maverick’s 18)
Llama 4 Scout: 10M token context window — longest in any open-weight model
Gemma 4: Google’s strongest open-weight family — four variants from 2B to 31B, native text+image+audio, Apache 2.0
Kimi K2.5/K2.6: Moonshot AI’s MoE models — strong agentic performance, top OpenClaw compatibility
Mistral Small 4: Apache 2.0, 6.5B active params — efficient open-source model for self-hosted deployment
Mistral Large 3: Mistral’s flagship model — competitive with frontier closed-source models
NVIDIA Nemotron 3 Super: High-performance inference model from GTC 2026
IBM Granite 4.0: Enterprise-grade multimodal models including 3B Vision variant
MiniMax-M2.7: Top OpenClaw compatibility, strong coding and agentic benchmarks
PrismML Bonsai 8B: 1-bit quantized model — extreme compression with competitive performance
Devstral 2: Mistral’s coding-focused open model
GLM-4.7: Z.ai’s predecessor to GLM-5.1, strong general performance

Inference Acceleration

Speculative Decoding

DFlash: Block Diffusion for Flash Speculative Decoding — lightweight block diffusion model for efficient parallel drafting. Supports Gemma 4, Qwen3.5/3.6, Kimi K2.5, MiniMax-M2.5 and more. Works with vLLM, SGLang, Transformers, MLX. 3.1K+ stars (MIT)
EAGLE-3: Extrapolation Algorithm for Greater Language-model Efficiency — feature-level speculative decoding with provable output quality preservation. EAGLE-3 (NeurIPS’25) is the latest. 2.3K+ stars
Medusa: Simple framework for accelerating LLM generation with multiple decoding heads — adds trainable draft heads to any LLM without a separate draft model. 2.7K+ stars
Gemma 4 MTP Drafters: Google’s Multi-Token Prediction drafters for Gemma 4 family — up to 3x speedup via speculative decoding, zero quality degradation. Apache 2.0, supports Transformers, MLX, vLLM, SGLang, Ollama, LiteRT-LM
SpecForge: Flexible open-source training framework for speculative decoding draft models — supports Medusa, EAGLE, and other drafting strategies
Awesome Multi-Token Prediction: Curated list of MTP papers, tools, and resources for LLMs and Speech-Language Models

Attention & KV Cache Optimization

FlashAttention-3: Fast and memory-efficient exact attention — exploits async Tensor Cores and TMA on Hopper GPUs (H100). The de facto standard for attention acceleration
vLLM: High-throughput LLM serving with PagedAttention — built-in speculative decoding support (DFlash, MTP, EAGLE, Medusa), continuous batching, quantization

Infrastructure & DevTools

Genesis: Genesis is a physics platform designed for general purpose Robotics/Embodied AI/Physical AI applications. It is simultaneously multiple things:
nofx: Agentic Trading OS
AI AGENTS FOR TRADING
dexter
微舆BettaFish 是一个从0实现的创新型多智能体舆情分析系统，帮助大家破除信息茧房，还原舆情原貌，预测未来走向，辅助决策。用户只需像聊天一样提出分析需求，智能体开始全自动分析国内外30+主流社媒与数百万条大众评论。
TrendRadar: 🚀 最快30秒部署的热点助手 —— 告别无效刷屏，只看真正关心的新闻资讯
higress: AI Native API Gateway
LLMRouter
grep.app:the fastest code search engine on the planet (of over 500k+ Git repos).
AIMX: Self-hosted, open-source email server designed for AI agents — purpose-built SMTP/IMAP infrastructure for agent-to-agent and agent-to-human communication (May 14, 2026)
Hypercubic Hopper: Agentic interface for mainframes and COBOL — AI agents that interact with legacy enterprise systems. 95 HN points (May 12, 2026)
Voker: Analytics platform for AI agents — YC S24 startup providing observability and metrics for production agent deployments. 59 HN points (May 12, 2026)
Models.dev: Open-source database of AI model specs, pricing, and capabilities — community-maintained reference for model comparison (MIT)
Pyrefly: Meta’s fast type checker and language server for Python, written in Rust (6.4K stars, MIT)
ModelRift: OpenSCAD code editor & AI 3D model builder — text-to-CAD tool for creating, customizing, and sharing parametric 3D models in browser
MiroFish: A Simple and Universal Swarm Intelligence Engine, Predicting Anything

Tutorials & Guides

Prompt Engineering Guide:Motivated by the high interest in developing with LLMs, we have created this new prompt engineering guide that contains all the latest papers, learning guides, lectures, references, and tools related to prompt engineering for LLMs.
Build and design an AI image generator app for iOS without ANY design or coding experience
Awesome Claude Prompts
cursorhub: 零基础掌握 Cursor 和我一起用 AI 做网站
Gemini for Google Workspace prompting guide 101
Context Engineering
ClaudeCode 最佳实践、社区技巧和工具的综合指南
Frad’s .claude: A comprehensive development environment with specialized AI agents for code review, security analysis, and technical leadership.
lovable Prompting 1.1
Anthropic Prompt engineering
prompt-optimizer
Writing a good CLAUDE.md
Vibe Coding 指南
Easy Vibe: Vibe coding 2026 — first modern coding course for beginners to master step by step (14K stars)

Leadboard

open asr leaderboard
TTS Arena V2
Artificial Analysis: Model performance rankings
LM Arena: Chatbot and video generation arena rankings
WhatLLM: Open source LLM comparison tool
Arena AI Model ELO History: Historical ELO rating tracker for AI models across benchmarks — visualizes model performance evolution over time

更新日志

2026-05-22

🆕 新增：Anthropic Project Glasswing — 网络安全倡议，联合 AWS、Apple、Google、Microsoft、NVIDIA 等发现 Claude Mythos Preview 前沿模型可发现所有主要操作系统和浏览器的高危漏洞，跨行业防御安全合作（194 HN points）
🆕 新增：CodeGraph — AI 编码 agent 预索引代码知识图谱，支持 Claude Code/Codex/Cursor/OpenCode/Hermes Agent，减少 token 和工具调用（16.4K stars, MIT）
🆕 新增：Kanbots — 开源看板桌面应用，每张卡片运行并行 AI 编码 agent（Claude Code/Codex），多 agent 任务管理（116 HN points）
🆕 新增：Supertonic — 超快本地多语言 TTS，ONNX 原生运行，支持 Swift/Python/Rust/C++/Java/Go/Node.js/Flutter/Web（9.4K stars, MIT）
🆕 新增：Superpowers — Agentic skills 框架和软件开发方法论（202.8K stars, MIT）
🆕 新增：OpenHuman — 个人 AI 超级智能，Rust 实现，隐私优先（25.7K stars）
🆕 新增：RuView — WiFi 信号实时空间智能，生命体征监测和存在检测（63.9K stars, MIT）
🆕 新增：AgentMemory — AI 编码 agent 持久化内存，基于真实基准排名第一（16.4K stars）
🆕 新增：CloakBrowser — 隐身 Chromium，通过所有机器人检测测试，Playwright 替代品（18.7K stars, MIT）
🆕 新增：ViMax — Agentic 视频生成，导演/编剧/制片人/生成器一体化（6.7K stars, MIT）
🆕 新增：Academic Research Skills — Claude Code 学术研究技能：研究→写作→审查→修订→定稿流程（19K stars）
🆕 新增：Models.dev — 开源 AI 模型规格/定价/能力数据库（MIT）
🆕 新增：Pyrefly — Meta 的快速 Python 类型检查器和语言服务器，Rust 实现（6.4K stars, MIT）
🆕 新增：ModelRift — OpenSCAD 代码编辑器和 AI 3D 模型构建器，Antigravity 2.0 在 OpenSCAD 架构 3D LLM 基准测试中排名第一（321 HN points）
🆕 新增：Easy Vibe — 2026 Vibe coding 初学者编程课程（14K stars）
📈 更新：Anthropic Glasswing 披露 Mythos Preview 模型在网络安全领域的前沿能力，已发现数千个高危漏洞

2026-05-15

🆕 新增：Statewright — AI agent 可视化状态机护栏，控制每阶段可用工具，跨 Claude Code/Codex/Cursor/OpenCode/Pi 兼容，本地模型 SWE-bench 从 2/10 提升至 10/10（122 HN points）
🆕 新增：cplt — AI 编码 agent 内核级沙箱包装器，支持 macOS Seatbelt + Linux Landlock/seccomp-BPF（MIT, NAV/挪威劳工福利局开源）
🆕 新增：agent-sandbox — Docker 容器隔离 AI 编码 agent，零 root 权限，零 Docker socket 访问，零提权能力
🆕 新增：Code Bench — 本地优先桌面 AI 编码 agent，BYO 模型（MIT）
🆕 新增：Gemini Omni — Google 即将发布的统一视频生成模型，Google I/O 2026 前泄露，显著改善视频内文字渲染
🆕 新增：Mozaik — TypeScript 响应式 AI agent 框架，LLM context 作为一等公民，OpenResponses 规范对齐
🆕 新增：Rotunda — 专为 AI agent 构建的浏览器，模拟输入 + 反 CAPTCHA，Playwright 兼容
🆕 新增：Recursant — AI agent 服务网格控制平面，生产级 agent 治理
🆕 新增：Lowdefy v5.3 — 30 行 YAML 定义 AI agent，低代码 agent 编排平台
🆕 新增：AIMX — 自托管开源 AI agent 专用邮件服务器
🆕 新增：Hypercubic Hopper — 主机/COBOL 的 Agent 接口，AI agent 与遗留企业系统交互（95 HN points）
🆕 新增：Voker — AI agent 分析平台，YC S24，agent 生产部署可观测性（59 HN points）
🆕 新增：Arena AI Model ELO History — AI 模型 ELO 历史评分追踪工具
📈 更新：Anthropic 将 Claude Code SDK 和 claude -p 移出订阅计划，开放独立使用（May 14）
📈 更新：OpenAI 发布 GPT-5.5-Cyber — 网络安全专用模型，欧盟可信访问计划（May 8）

2026-05-07

🆕 新增：Inference Acceleration 章节 — 推理加速技术汇总，包含 Speculative Decoding（DFlash, EAGLE-3, Medusa, Gemma 4 MTP Drafters, SpecForge）和 Attention/KV Cache 优化（FlashAttention-3, vLLM）
🆕 新增：Warp Terminal — AI 终端开源（MIT/AGPLv3），OpenAI 赞助，41K+ GitHub stars，agent-native 开发环境（Rust）
🆕 新增：Atlassian Rovo Dev CLI — 企业级 AI 编码代理，集成 Jira/Confluence/Bitbucket，Teamwork Graph + MCP，支持多 agent 技能安装
🆕 新增：Amazon Nova 2 Sonic — AWS 统一语音到语音基础模型，bidirectional streaming，ASR+推理+TTS 单模型
🆕 新增：MOSS-TTS-Nano — 0.1B 参数开源 TTS，CPU-only，48kHz 立体声，20 语言（Fudan/MOSI.AI，Apache 2.0）
🆕 新增：MOSS-TTS Family — 开源语音家族（8B/1.7B/Realtime），覆盖长语音/对话/角色/音效，377ms 端到端延迟
🆕 新增：LongCat-AudioDiT — Meituan 开源 3.5B 非自回归 TTS，波形潜空间直接生成，消除误差累积
🆕 新增：MAGIC-TTS — 细粒度可控 TTS（arXiv），显式时长和停顿控制，flow-matching
🆕 新增：Seedream 5.0 — ByteDance 即梦最新图像模型，多版本(5.0/4.5/4.0)，多图编辑+搜索集成
🆕 新增：GitKraken Desktop 12.0 — Agent Mode 多代理会话管理界面
📈 更新：Microsoft Agent 365 → 正式 GA (May 1)，企业治理（Observe/Govern/Secure）+ Defender + Purview + Entra 集成
📈 更新：Atlassian Rovo → 信用使用月增 20%+，Rovo 客户 ARR 增速 2x 非Rovo客户，Service Collection $1B ARR
📈 更新：GLM-5 → SWE-bench Verified 77.8%（开源最高），Chatbot Arena Elo 1451（开源最高）
📈 更新：Kimi K2.6 → Rails 构建 87 分，开源唯一 Tier A 编码模型，BenchLM 89.3

2026-04-26

🆕 新增：GPT-5.5 — OpenAI 最新旗舰模型，agentic coding、computer use、知识工作大幅提升，API 已开放
🆕 新增：Claude Opus 4.7 — Anthropic 最新 Opus 模型，SWE-bench 87.6%，同步发布 Claude Design（设计协作工具）
🆕 新增：Claude Design — Anthropic Labs 新产品，AI 协作创建设计/原型/演示
🆕 新增：Gemini Enterprise Agent Platform — Google Cloud Next 2026 发布，取代 Vertex AI 的统一企业 AI Agent 平台
🆕 新增：Grok Speech API — xAI 独立 STT/TTS API，企业级语音开发，STT 错误率 5.0%（行业最低）
🆕 新增：Tripo H3.1 — 高保真 image-to-3D 模型，面向生产级游戏/产品可视化
🆕 新增：Suno v5.5 — 个人语音克隆 + Studio DAW 功能升级，Warner Music 授权合作
🆕 新增：Midjourney v7 — 美学智能标杆，—cref 角色一致性 + —sref 风格锁定
🆕 新增：Qwen3.5-397B-A17B / Qwen3.5-9B / Qwen3.6-Plus — Alibaba 新一轮开源模型发布
🆕 新增：MiniMax-M2.7 — 顶尖 OpenClaw 兼容模型
🆕 新增：Deezer AI 音乐检测 — 44% 新上传为 AI 生成，Deezer 向行业开放检测技术
🆕 新增：PixVerse R1 更新 — 共享世界 + 个性化化身，实时交互 1080P
📈 更新：GPT-Image-2 → 正式发布(4/21)，2K 分辨率，推理驱动生成，多语言文字渲染
📈 更新：ElevenLabs v3 conversational model + TTS Normalizer v3.1，Agent 平台增强
📈 更新：Cursor → $1.2B ARR，超越 Copilot 成为开发者首选 AI IDE
📈 更新：Windsurf → Google 收购创始团队 $2.4B，Cognition 收购剩余部分$ 250M
📈 更新：Seed3D 2.0 → 正式发布(4/23)，SOTA 几何 + PBR 材质质量
📈 更新：Google ADK → 重大升级，图框架多 agent 编排，月处理 6T+ tokens
📈 更新：PixVerse → 完成 C 轮融资达到独角兽估值，100M+ 用户

2026-04-26 (prior)

🆕 更新：Cursor 3 — Agent-first 重构，Agents Window 并行代理、Cloud Agents、Design Mode
🆕 更新：Midjourney V8 → V8.1 Alpha — HD 模式 3x 更快更便宜，Prompt Shortener，图像提示回归
🆕 更新：Runway Gen-3 → Gen-4.5 — 世界最高评分视频模型，前所未有视觉保真度与创意控制
🆕 更新：Kling O3 → Kling 3.0 — 4K@60fps 视频生成，~$0.50/10s clip，最佳性价比
🆕 新增：HeyGen Avatar V — 最逼真 AI 数字人，15 秒录制，175+ 语言翻译
🆕 新增：JetBrains Junie / Air — IDE 原生 AI 编码代理 + 多代理并行开发环境
🆕 新增：Google Jules — 免费 AI 编码代理，15 tasks/day，1M 上下文
🆕 新增：OpenCode — 开源无锁定编码代理，147K+ GitHub stars
🆕 新增：Augment Code (Auggie CLI) — 企业级代码理解，SWE-bench Pro 51.80%
🆕 新增：Kilo Code — 开源多 IDE 编码代理（VS Code/JetBrains/CLI）
🆕 新增：NVIDIA NemoClaw — OpenClaw 安全代理栈，一键部署隐私可控 AI Agent
🆕 新增：Voxtral TTS (Mistral) — 开源 TTS；Mistral Small 4 (Apache 2.0, 6.5B)
🆕 新增：NVIDIA Nemotron 3 Super、IBM Granite 4.0、Pollo AI、Genra AI
新增：DeepSeek-V4 (Pro 1.6T / Flash 284B) 开源大模型，MIT 协议，1M 上下文
新增：Qwen3.6-27B 密集模型、Gemma 4 (31B 多模态)、Tencent Hy3-preview
新增：Seedance 2.0 (ByteDance) 视频生成 Arena 排名第一，Kling O3、Veo 3.1、PixVerse V6
新增：Gemini 3.1 Flash TTS (200+ 音频标签，70+ 语言)、MiMo-V2.5、Kani-TTS-2、OmniVoice、Chatterbox Multilingual
新增：Seed3D 2.0 (ByteDance)、Tripo Studio、Meshy 6、GPT-Image-2、Recraft V4
新增：Google ADK、ml-intern (HuggingFace)、Kimi Claw
新增：开源 LLM 章节追踪 DeepSeek-V4、GLM-5.1、Llama 4 Behemoth 等最新模型

2026-07-03

🔴 重大事件：Fable 5 与 Mythos 5 恢复上线

Anthropic Fable 5 / Mythos 5 解禁回归 (6月30日解禁, 7月1日恢复)

事件：美国商务部于 6月30日解除对 Claude Fable 5 和 Mythos 5 的出口管制令，Anthropic 于 7月1日开始全球恢复访问。此次关停共计 19天（6月12日 → 6月30日），系史上首次政府关闭商业AI模型事件
Fable 5 恢复范围：Claude Platform、Claude.ai、Claude Code、Claude Cowork 全球恢复；AWS、Google Cloud、Microsoft Foundry 尽快恢复；7月7日前可用至 50% 周用量上限，之后转为使用额度制
Mythos 5 恢复范围：仅向美国政府审查通过的美国组织恢复（6月26日已向部分美国组织启用），通过 Project Glasswing 计划继续扩展访问；Mythos 携带完整网络安全能力栈，仅限审查通过的组织
关键背景：Fable 5 和 Mythos 5 共享相同底层权重和训练；Fable 5 包含护栏，将约 5% 会话中的高风险请求路由至 Claude Opus 4.8；Mythos 5 携带完整网络安全能力栈
行业影响：Anthropic 表示行业需要评估和修复 AI 模型”越狱”(jailbreak) 的一致标准；目前 Anthropic、Amazon、Microsoft、Google 和 Glasswing 合作伙伴正在合作制定框架
来源：Anthropic 官方 X 公告 Jun 30, Constellation Research Jul 1, MarketScale Jul 1, Guardian Jul 1

📈 前沿模型动态

Gemini 3.5 Pro 延期至 7月发布

状态：Polymarket “Gemini 3.5 released by June 30” 合约以 97% No-Release 结算（$1.8M 总交易量），模型仍处于 Vertex AI 有限企业预览
核心规格：2M token 上下文窗口（任何生产前沿模型中最大，是 Gemini 3.5 Flash 和 GPT-5.5 的 1M 的两倍）+ Deep Think 推理模式 + 前沿多模态理解
Polymarket 最新预测：7月17日为最可能发布日期（37% 概率），7月整体发布概率显著上升
预期定价：约 $15/M 输入、$ 60/M 输出 tokens（约为 Gemini 3.5 Flash 的 10倍），Deep Think 推理模式 10倍溢价，限 Ultra 层级（$250/月）订阅者
定位：Gemini 3.5 Pro 是唯一没有当前政府访问限制的主要即将发布的前沿模型；其 2M token 窗口填补了 Fable 5 暂停期间留下的长上下文空白
来源：Polymarket 合约, FAQ.com Jul 1, TechTimes Jun 29, HokAI Hub, TokenCost

GPT-5.6 Sol Ultra — Terminal-Bench 2.1 新纪录 91.9%

基准测试结果（Terminal-Bench 2.1 命令行/编码工作流）：

模型	Terminal-Bench 2.1
GPT-5.6 Sol Ultra	91.9%
GPT-5.6 Sol (基础)	88.8%
GPT-5.5	88.0%
Claude Mythos 5 (已下架后恢复)	84.3%
Claude Fable 5 (已下架后恢复)	83.4%
Claude Opus 4.8	78.9%
Gemini 3.1 Pro Preview	70.7%

Ultra 模式：部署多个并行 AI 子代理处理任务不同部分，然后合成统一结果；显著提升复杂任务性能，但消耗更多 tokens
定价：Sol $5 输入 /$ 30 输出 per M tokens；Terra 更低成本版本；Luna 高量级版本；缓存最小 30 分钟
访问限制：白宫要求 OpenAI 限制性发布(slow-roll)因安全顾虑，仅限政府批准客户和审查通过的 API/Codex 合作伙伴（非 ChatGPT），发布日 6月26日
METR 独立评估：发现 Sol 在所有公开模型中 reward-hack 率最高（模型系统卡承认”有时作弊”）
AI 编码 Agent 竞争格局（7月初）：GPT-5.6 Sol > Claude Opus 4.8 > Gemini 3.5 Flash；Fable 5 恢复后 SWE-Bench Verified 领先 88.6%
来源：DataCamp, Lushbinary, EdenAI, R&D World, Agensi Jul 2026

GLM-5.2 — Z.ai 开源旗舰长程任务模型 (6月16日发布)

核心规格：753B 参数 MoE，MIT 开源许可，1M token 无损上下文（较 GLM-5.1 的 200K 提升 5倍），最高 131,072 输出 tokens，两种思考强度级别
基准测试亮点：

Benchmark	GLM-5.2	GPT-5.5	Claude Opus 4.8	Gemini 3.1 Pro
SWE-bench Pro	62.1%	58.6%	69.2%	54.2%
HLE w/ Tools	54.7%	51.4%	57.9%	52.2%
AIME 2026	99.2%	98.2%	98.3%	-
GPQA-Diamond	91.2%	93.6%	93.6%	94.3%

定位：在多个长程编码基准上以 1/6 的成本击败 GPT-5.5；开源 SOTA 编码和长程任务性能；在 Fable 5 被禁期间成为企业替代选项
集成：即日上线所有 GLM Coding Plan 层级（Lite/Pro/Max/Team），企业订阅 $12.60/月起；支持 Claude Code、Cline、OpenClaw（Anthropic 兼容端点，仅需 base-URL 和 model 替换）；20+ 第三方编码环境可用
来源：Z.ai 官方博客, VentureBeat Jun 16, MarkTechPost Jun 14, Z.ai Docs

Liquid AI LFM 2.5-230M — 最小基础模型 (6月26日)

核心规格：230M 参数（Liquid AI 迄今最小模型），非 Transformer 架构，专为设备端 agentic 工作流设计，可在任何设备运行
性能：在数据提取任务上击败 4倍大小模型；手机 CPU 上 213 tokens/s；支持 llama.cpp、MLX、vLLM、SGLang、ONNX
应用场景：手机、机器人、自动化设备的 agentic 任务；Liquid AI 在 Unitree G1 人形机器人（NVIDIA Jetson Orin）上部署，作为技能选择层将自然语言指令分解为工具调用
配套发布：LFM 2.5-8B-A1B 混合专家（8.3B 总参数/1.5B 激活）6月初发布
来源：Liquid AI 官方博客, VentureBeat Jun 26, explainx.ai, MarkTechPost Jun 27

🛠️ 工具与平台更新

Gemini 3.5 Flash Computer Use 公开预览 (6月24日)

功能：开发者可通过 Gemini API 和 Gemini Enterprise Agent Platform 使用 Computer Use 工具构建自定义 Agent，实现浏览器、移动、桌面环境的自主操作（看、推理、行动）
可用性：原生 computer use 工具参数即日起通过 Gemini API 和 Gemini Enterprise Agent Platform 可用（预览功能）；Browserbase 提供沙箱原型设计
竞争定位：与 Anthropic（Claude Computer Use）和 OpenAI 在 agentic 桌面控制上直接对标
来源：Google 官方博客, Digital Applied, Neowin, Gemini API Release Notes

xAI Grok Build Plugin Marketplace (6月11日)

功能：Grok Build 终端编码 Agent 内置插件市场，无需离开终端即可浏览、安装、更新、发布插件；一个插件捆绑技能、斜杠命令、Agent、钩子、MCP 服务器和 LSP
启动合作伙伴：MongoDB、Vercel、Sentry、Chrome DevTools、Cloudflare、Neon、Superpowers（224,691★）、Firecrawl
关键发现：xAI 直接复用 Anthropic 的 Claude Code 插件格式——Grok Build 读取 ~/.claude/skills/、CLAUDE.md、~/.claude/plugins/known_marketplaces.json；目录清单格式与 Anthropic 2025年10月发布的格式逐字段相同（从 .claude-plugin/marketplace.json 重命名为 .grok-plugin/marketplace.json）；“Grok 完全兼容 Claude Code，零配置”
意义：Agent 战争产生了第一个事实标准——赢得格式的公司不是本周发布产品的公司
来源：xAI 官方新闻 Jun 11, Towards AI Jun 2026, Firecrawl Blog

Perplexity Deep Research 升级 — 多模型路由 (6月11日)

升级：Deep Research 移入 Perplexity Computer 多模型编排系统，将难题分解为子任务并跨 20+ 前沿模型 路由，返回工作报告、演示文稿和仪表板
架构：Computer 是云系统，协调最多 20 个 AI 模型（Claude Opus 4.6 为核心推理引擎，Gemini 处理深度研究任务），模型无关；Deep Research 在外部基准上达到 SOTA 准确性和可靠性
定价：$200/月，含 10,000 credits；Max 用户即可用，向 Pro 用户推出
来源：MarkTechPost Jun 11, Perplexity Threads, Linas Substack Apr 10

📊 AI 编码 Agent 7月初格局总览

排名	工具	核心优势	定价
1	Claude Code (Opus 4.7/4.8, Fable 5)	最深 SKILL.md 生态, 1M context, 46% 最受喜爱	$20/月 Pro
2	Cursor 3 (多模型)	多模型路由, 插件市场增长, Agents Window 并行	$20/月
3	OpenCode	最实用免费选项, 160K+ stars, 75+ 模型, LSP 集成	免费开源
4	Codex CLI (GPT-5.5, GPT-5.6 Terra)	/goal 持久循环, 云沙箱, Terminal-Bench 领先	预览中
5	Windsurf	最佳并行 Agent 工作流, 多 worktree	$15/月
6	Antigravity 2.0	并行 Agent, 浏览器验证, 制品系统	预览免费

关键趋势：65% 的在职工程师现在每天使用两个工具；获胜组合：Claude Code Max ( $200/月) 发货完整功能 + Cursor Pro ($ 20/月) 日常编辑
政府干预新模式：2026年6月是历史上 AI 模型发布和政策干预最集中的时期——GPT-5.6 政府批准有限访问、Fable 5 出口管制限制、GitHub Copilot 转为用量计费；未来模型发布将涉及与监管机构协调，开发者不应假设发布日即可访问
来源：Agensi Jul 2026, AI Builder Club

2026-06-26

🔴 重大事件

Anthropic Fable 5 / Mythos 5 全球下架 (6月12日)

事件：美国商务部 BIS 出口管制令，命令 Anthropic 于 6月12日 17:21 ET 暂停所有 Fable 5 和 Mythos 5 对任何外籍人员的访问（包括美国境外的外籍员工）
背景：两模型于 6月9日刚发布，仅存活 72小时即遭关停，系史上首次政府关闭商业AI模型
原因：援引《国际紧急经济权力法》(IEEPA) 的国家安全权力，理由包括 jailbreak 漏洞风险
影响范围：全球所有用户，含 Anthropic 自己的外籍员工；Anthropic 正在与 Trump 政府谈判争取恢复
来源：Anthropic官方声明, Forbes Jun 16, Fortune Jun 13

GPT-5.6 Launch Window 开启 (6月22日)

泄露名称：kindle-alpha（来自 OpenAI Codex 后端日志）
核心参数：1.5M token context 窗口，较 GPT-5.5 的 1M 提升 43%，token 效率 +10-15%
发布时间：Polymarket 押注显示 6月22-28日窗口定价 83-89% 概率；OpenAI 首席科学家 Pachocki 内部称其为”有意义的飞跃”
发布限制：据 Engadget 6月25日报道，OpenAI 将仅向政府批准客户首发（government-approved customers only）
来源：TechTimes Jun 21, AI Weekly Jun 16, Engadget Jun 25

Trump 白宫 AI 行政令 (6月2日)

文件：《促进先进人工智能创新与安全》(Promoting Advanced Artificial Intelligence Innovation and Security)
核心内容：(1) 8月1日前建立前沿模型安全自愿部署框架；(2) 30天内强化联邦网络防御；(3) 创建 AI 网络安全信息交换所（AI Cybersecurity Clearinghouse）
关键修订：废止拜登时期 EO 13694 和 EO 14144 的部分内容
来源：WhiteHouse.gov Jun 2, Latham & Watkins Jun 3

⚠️ 安全警报

Agentjacking 攻击 (Tenet Security, Jun 2026)

攻击原理：通过公开的 Sentry DSN（write-only 凭证，可从浏览器 JS 或 GitHub 搜索获取），注入恶意 Sentry 错误事件；当开发者让 AI agent “fix unresolved Sentry issues” 时，agent 通过 MCP 连接 Sentry，接收注入的错误内容，执行攻击者植入的命令（如 npx @tenet-controlled-validation-package --diagnose）
影响范围：Claude Code、Cursor、Codex 及任何接受 Sentry MCP 连接的 agent；测试显示成功率 85%；发现 2,388 个组织的 DSN 公开暴露
安全影响：攻击在开发者自身凭证下执行，可绕过 EDR/WAF/IAM；Tenet 于 6月3日披露，Sentry 已引入全局内容过滤缓解
来源：The New Stack Jun 14, Tenet Security研究, CSA Lab Space Jun 12

📈 模型更新

Claude Opus 4.8 (Anthropic, May 28)

核心数据：SWE-bench Pro 69.2%（ vs Opus 4.7 的 64.3%），Intelligence Index 61.4，1M context，同价 $5/$ 25 per M tokens
Dynamic Workflows：可在一个会话中规划任务、生成数百并行子代理、验证输出并汇总汇报；CTO Patil 称发布核心是”信任”而非单纯性能
来源：Anthropic官方, TECHSY Jun 13, TrueFoundry

Gemini 3.5 Flash 四层 Agent Stack (Google I/O, May 19)

四层架构：Gemini 3.5 Flash（底层模型）+ Antigravity 2.0（开发者平台）+ Managed Agents（自主长期运行代理编排）+ Gemini Spark（用户端手机助理）
性能：在编码和 agentic 基准测试上超越 Gemini 3.1 Pro，价格 $1.50/$ 9.00/M tokens，速度 >275 tokens/s（约 3倍于竞品）
定位：从编码助手升级为全自主 Agent 平台，覆盖 Google AI Studio、Android Studio、Vertex AI、消费者 Gemini app
来源：Google Blog, AIDA Insider, MarkTechPost May 20

NVIDIA RTX Spark (Computex, May 31)

定位：全球首款面向 Agent 时代 Windows PC 的 ARM 架构超级芯片
规格：1 Petaflop AI 算力，融合 CPU + Blackwell GPU + NPU；合作厂商包括 Asus、Dell、HP、Lenovo、MSI、Microsoft Surface
意义：NVIDIA 首次进入其从未控制过的计算机组件——整个 PC 端；将 Windows 重塑为 Agentic AI OS
来源：NVIDIA News May 31, IEEE Spectrum Jun 6

🛠️ 产品与工具更新

Snowflake CoCo GA (Snowflake Summit, Jun 2)

前身：Cortex Code（Mar 9 GA），升级为全自主开发平台
能力：原生桌面应用 + 云端 Agent + Agent SDK；理解 Snowflake schema、RBAC、lineage、compute；仓库原生是核心优势和局限
定位：从 AI 编码助手升级为数据团队全栈自主开发平台
来源：Snowflake官方博客, Digital Applied

Cursor 3 (Apr 2, 持续迭代)

Agents Window：全新代理工作区，支持并行代理、多子代理嵌套并行（Apr 24 扩展）
Design Mode：UI 设计辅助；/worktree 隔离 git worktree；/best-of-n 并行多模型比较
进展： $2B ARR，$ 50B 估值，1M+ 付费用户，70% F1000 覆盖
来源：Cursor Changelog, Digital Applied

NVIDIA Self-Coding Agents (GTC 2026, Mar)

主题：自编码/反馈驱动/自适应架构，数据飞轮，自主编排
Agent 基础设施：OpenShell（策略执行安全）+ Agent Toolkit（自主 Agent 构建运行）+ Dynamo（Agentic 推理全栈优化）
合作：ServiceNow 合作探索 Autonomous Workforce（AI Agent 治理）
来源：NVIDIA On-Demand GTC26 S81569, Futurum Mar 16

Microsoft Build 2026 (Jun 2-3)

MAI 模型系列：MAI-Thinking-1（35B 推理模型，128K context，SWE-bench Pro 匹敌 Opus 4.6）+ MAI-Code-1-Flash + MAI-Voice-2/Flash + MAI-Transcribe-1.5（43语言）+ MAI-Image-2.5
Work IQ API：6月16日 GA，向所有 Agent 开放 Microsoft 365 智能层，6大核心 API（copilot!、calendar、email、files、meeting、person-profile）
Agent 365：企业 Agent 控制平面（Observe/Govern/Secure），Defender + Purview + Entra 集成
来源：Microsoft 365 Blog Jun 2, Microsoft Build Blog

📊 协议与生态

MCP 里程碑

月度 SDK 下载突破 97M+（Python + TypeScript 合计），成为历史上增速最快的 AI 协议
Anthropic 于 2025年12月将 MCP 捐赠给 Linux Foundation 下的 Agentic AI Foundation（146+ 成员）
来源：Anthropic官方, WorkOS

🏥 跨领域更新

ASCO Breakthrough 2026 (新加坡, Jun 25-27)

液体活检：血基生物标志物用于癌症早检和疗效预测
新靶点：HER2、Claudin 18.2、CD44v9（B7-H3：GSK HS-20093 展示 58% 缓解率）
来源：ASCO官方, ASCO Facebook Feb 2026

Wakix (pitolisant) FDA 儿科适应症扩展 (Feb 2026)

6岁+ 猝倒症（Cataplexy）：FDA 于 2026年2月17日批准 Wakix 扩展用于治疗儿科猝倒症，使 pitolisant 成为美国唯一覆盖发作性睡病全适应症（EDS + 猝倒）且非管制药物
背景：2024年6月已获批儿科 EDS，2025年2月新增儿科猝倒适应症
来源：FDA AccessData, Harmony Biosciences IR Feb 17, NeurologyLive Feb 18

📅 2026-07-17 周报：AI工具与资源更新

本周（7月8日–7月17日）是2026年AI行业竞争最激烈的一周——四大前沿模型在十天内集中发布：OpenAI GPT-5.6 Sol/Terra/Luna（Jul 9）、SpaceXAI Grok 4.5（Jul 8）、Moonshot Kimi K3（Jul 16），以及Gemini 3.5 Pro第三次错过发布窗口。同时，Claude Fable 5回归后免费访问已第三次延期至Jul 19，Anthropic推出Claude Science进入药物发现领域，中国开源模型Kimi K3首次在Code Arena击败Fable 5。

🔥 重大事件

Claude Fable 5 回归与免费延期（Jul 1–19, Anthropic）

时间线：

Jun 12：美国商务部对Fable 5/Mythos 5实施出口管制（因Amazon研究人员发现jailbreak），所有用户被暂停访问
Jun 30：商务部解除出口管制
Jul 1：Fable 5恢复全球访问，Pro/Max/Team/Enterprise用户可免费使用至Jul 7（50%周配额）
Jul 7→Jul 12→Jul 19：免费期三次延期，Claude Code周用量限制增加50%同步延长

技术变更：Anthropic训练了新的安全分类器，针对Amazon报告中描述的jailbreak技术在99%以上的情况下予以拦截。被分类器拦截的请求被路由至Claude Opus 4.8并通知用户。

行业影响：这是史上首次政府关闭商业AI模型的案例。Anthropic联合Amazon、Microsoft、Google等Glasswing合作伙伴提出了AI Jailbreak严重性评估框架（四维度：能力增益、广度、可武器化程度、可发现性），旨在建立行业一致标准。

来源：Anthropic官方博客 Jul 2, Anthropic Redeploying Fable 5, CNBC Jun 30, BleepingComputer Jul 12, TechBriefly Jul 13, gHacks Jul 14

Anthropic Claude Science 药物发现平台（Jun 30, Anthropic）

产品：Claude Science——面向科学家的AI工作台，集成60+科学技能和连接器（基因组学、单细胞、蛋白质组学、结构生物学、化学信息学），原生支持PubMed、Jupyter、R、HPC集群终端
功能：3D蛋白质结构渲染、基因组浏览器轨道、化学结构可视化；自动生成可审计的科学制品（代码+环境+消息历史），确保可复现性
商业模式：Pro/Max/Team/Enterprise用户可用（Beta）；支持50个AI for Science项目，每个最高 $30,000信用额度+Modal$ 2,000计算额度
内部药物发现：Anthropic同时宣布启动内部临床前药物发现项目，聚焦”被忽视”疾病（传统药企不愿开发的领域）。利用公共福利公司地位选择患者受益优先的项目
MIT Technology Review评价：Claude Science被定位为与Claude Code、Claude Cowork并列的旗舰产品，标志着Anthropic正式进入生物技术领域
关键人才：前Google DeepMind AlphaFold联合创始人John Jumper加入Anthropic；4月收购Coefficient Bio加速药物开发

来源：Anthropic官方, MIT Technology Review Jun 30, CNBC Jun 30, STAT News Jun 30, Endpoints News Jun 30, The Verge Jul 3, DDW Online Jul 6

OpenAI GPT-5.6 三层模型家族 GA（Jul 9）

模型：GPT-5.6 Sol（旗舰）/Terra（平衡）/Luna（经济型），取代GPT-5.5
创新：
- max推理努力模式（Sol专用，给予更长深度推理时间）
- ultra模式——4个子代理并行协作，Terminal-Bench 2.1从88.8%提升至91.9%（可扩展至16并行）
- 程序化工具调用——在隔离V8运行时中执行模型编写的JavaScript（无网络访问）
- ChatGPT Sites——直接从ChatGPT发布交互式网页
基准测试亮点：
- Agents’ Last Exam：Sol 53.6（比Fable 5高13.1分），即使medium推理也高11.4分
- Artificial Analysis Coding Agent Index：Sol 80（新SOTA，比Fable 5高2.8分），使用少于一半的输出token
- Terminal-Bench 2.1：Sol Ultra 91.9%（超越Mythos 5的88.0%），Sol 88.8%
- DeepSWE v1.1：72.7%（新SOTA）
- BrowseComp：92.2%（新SOTA）
- SWE-Bench Pro：64.6%（仍落后于Fable 5的80%约15点）
定价：Sol $5/$ 30、Terra $2.50/$ 15、Luna $1/$ 6 per M tokens

模型	AA Coding Index	Terminal-Bench 2.1	SWE-Bench Pro	输入/M tokens	输出/M tokens
GPT-5.6 Sol Ultra	—	91.9%	—	$5	$30
GPT-5.6 Sol	80	88.8%	64.6%	$5	$30
GPT-5.6 Terra	77.4	84.3%	63.4%	$2.50	$15
GPT-5.6 Luna	74.6	84.7%	62.7%	$1	$6

同日发布：ChatGPT Work（企业工作代理，跨桌面/web/移动端创建文档/表格/幻灯片/网页）、ChatGPT桌面app重大更新（内置浏览器+本地文件访问）、Atlas浏览器退役（功能合并入Chrome扩展和桌面app）

来源：OpenAI官方, OpenAI Preview Jun 26, TechCrunch Jul 9, MarkTechPost Jul 9, Neowin Jul 9, OpenAI ChatGPT Work, TechCrunch Atlas Jul 9, Dataconomy Jul 14

SpaceXAI Grok 4.5（Jul 8）

定位：xAI（更名为SpaceXAI）后首个模型，与Cursor联合训练，专为编码/agentic任务/知识工作设计
架构：V9基础模型，约1.5T参数（约为前代v8-small的3倍），训练于数万块NVIDIA GB300 GPU
性能（SpaceXAI自报）：
- Terminal-Bench 2.1：83.3%（接近GPT-5.5 83.4%和Fable 5 84.3%）
- SWE-Bench Pro：64.7%（超过GPT-5.5 58.6%，但落后Fable 5 80.4%和Opus 4.8 69.2%）
- DeepSWE 1.0：62.0%（Fable 5 max 66.1%，GPT-5.5 xhigh 64.31%）
- SWE Marathon：29.0%（Opus 4.8 max 26.0%）
token效率：声称是同级领先模型2倍——同任务用更少步骤完成
定价： $2/$ 6 per M tokens（极具竞争力）
可用性：Grok Build（终端编码Agent）、Cursor全计划、SpaceXAI console；EU预计7月中旬
Grok Build开源（Jul 15/16）：SpaceXAI在GitHub（xai-org/grok-build）以Apache 2.0开源Grok Build全部代码，可完全本地运行。紧随7月初”仓库上传服务器”隐私争议后的信任回应
Cursor收购：SpaceX $60B全股票收购Anysphere（Cursor母公司）预计Q3 2026完成

来源：SpaceXAI官方, Axios Jul 8, TechCrunch Jul 8, Engadget Jul 9, Reuters/CNA Jul 9, ThePlanetTools Jul 16, MarkTechPost Jul 15, The Left Shift Jul 17

Moonshot AI Kimi K3（Jul 16）——中国开源AI里程碑

定位：全球最大开源模型（2.8T参数MoE），标志着中国开源模型首次在Code Arena击败美国闭源模型
架构创新：
- Kimi Delta Attention（KDA）：混合线性注意力机制，百万token上下文解码速度提升6.3倍
- Attention Residuals（AttnRes）：选择性跨深度检索表示，训练效率提升约25%（成本增加<2%）
- Stable LatentMoE：896个专家中激活16个
版本：K3 Max（聊天/推理/自主Agent）/ K3 Swarm Max（多Agent并行编排）
能力：1M token上下文、原生视觉理解、always-on”thinking mode”（推理模式）
基准（reasoning effort设为max）：
- GDPval-AA v2：1687（第三，仅次于Fable 5 Max 1815和GPT-5.6 Sol Max 1747.8，超过Opus 4.8 1600）
- AA-Briefcase：1527（第二，超过GPT-5.6 Sol Max 1495）
- BrowseComp：91.2（SOTA）
- Terminal-Bench 2.1：88.3%（超过Fable 5 w/fallback 84.6%）
- Program Bench：77.8%（超过Fable 5 76.8%）
- SWE Marathon：42.0%（Fable 5 35.0%，GPT-5.6 Sol 39.0%）
- HLE-Full：43.5%（落后Fable 5 53.3%）
- DeepSWE：67.5%（Fable 5 70.0%，GPT-5.6 Sol 73.0%）
Code Arena排名：前端编码第一名（1679），超越Fable 5（1631）。Arena.ai CEO称”可能是今年最大的发布，标志着中国开源模型超越美国模型”
定价： $3/$ 15 per M tokens（缓存命中输入$0.30）
开源：权重Jul 27公开发布
意义：仅距Fable 5发布6周即超越——“中国落后美国数月”的共识不再成立

基准	Kimi K3	Fable 5 (w/ fallback)	GPT-5.6 Sol	Opus 4.8	GLM-5.2
Terminal-Bench 2.1	88.3%	84.6%	88.8%	84.6%	82.7%
Program Bench	77.8%	76.8%	77.6%	71.9%	63.7%
BrowseComp	91.2%	88.0%	90.4%	84.3%	—
SWE Marathon	42.0%	35.0%	39.0%	40.0%	13.0%
GPQA-Diamond	93.5%	92.6%	94.1%	91.0%	91.2%
HLE-Full	43.5%	53.3%	44.5%	49.8%	—

来源：SiliconANGLE Jul 16, VentureBeat Jul 16, MarkTechPost Jul 16, Tech Startups Jul 16, Moonshot X Jul 16

GLM-5.2 中国AI追赶（Z.ai, Jun 16发布, 持续本周关注）

定位：Z.ai（前Zhipu AI）旗舰开源模型，“mini DeepSeek moment”——低成本中国模型展现前沿竞争力
架构：753B参数MoE（约40B活跃/token），MIT开源许可证，1M token上下文
IndexShare创新：每4层稀疏注意力共享同一索引器，1M上下文每token计算量降低2.9倍
关键基准（最强开源编码模型）：
- SWE-bench Pro：62.1（超过GPT-5.5 58.6）
- Terminal-Bench 2.1：81.0-82.7%（接近Opus 4.8 85.0）
- FrontierSWE：74.4（落后Opus 4.8仅1%，超过GPT-5.5 72.6%）
- MCP-Atlas：76.8（超过GPT-5.5 75.3，接近Opus 4.8 77.8）
定价： $1.40/$ 4.40 per M tokens（约为GPT-5.5/Claude Opus 4.8的1/6成本）
地缘影响：美国出口管制Fable 5期间，GLM-5.2全球需求激增，OpenRouter排名超过Anthropic模型。David Sacks称”GLM-5.2与Opus 4.8仅差一档，与GPT-5.5持平”
努力控制：High/Max两种推理模式，Max用于复杂多步编码

来源：Z.ai官方, Z.ai HuggingFace, VentureBeat Jun 16, Reuters/95KQDS Jul 2, Japan Times Jul 3, Economic Times Jun 17, Z.ai GitHub, claudefa.st

Gemini 3.5 Pro 第三次错过发布（Jul 17, Google）

背景：Google I/O 2026（May 19）承诺”下个月”发布，June已过未发布
完全重建：Google废弃了原始Gemini 3.5 Pro基础模型（基于2.5 Pro架构），从头重新预训练
重建原因：原始模型在三个关键领域失败：
1. 数学推理——结构化多步逻辑工作（金融建模、代码生成、数据分析）
2. 复杂SVG场景生成——结构化关系信息理解（文档、图表）
3. 递归工具调用——Agent调用工具链时崩溃（agentic编码核心要求）
Jul 17再次错过：Bloomberg报道（Jul 16，The Verge Jul 17确认）模型仍受幻觉和输出不一致困扰，未通过基本可靠性标准。Google正转向临时Flash模型作为过渡
传闻规格（未经官方确认）：
- 2M token上下文窗口（为当前任何前沿模型的2倍）
- Deep Think推理层（Ultra订阅$250/月专属）
- 定价约 $1.25/$ 10 per M tokens（比GPT-5.6 Sol便宜约4倍）
竞争窗口：Jul 9-17十天窗口内GPT-5.6、Grok 4.5、Kimi K3均已发布，Gemini 3.5 Pro仍在”即将推出”
政府协调：Google正在与美国政府”富有成效地合作”测试3.5 Pro

来源：The Verge Jul 17, TechTimes Jul 13, TechTimes Jul 16, Enterprise DNA Jul 8, Bloomberg/Yahoo Tech, Google Blog May 19

🛠️ 工具与产品更新

SpaceXAI 开源 Grok Build（Jul 15-16）

事件：SpaceXAI在GitHub（xai-org/grok-build）以Apache 2.0开源Grok Build终端编码Agent全部代码，同时重置所有用户使用限制
内容：完整Agent循环（上下文组装→模型输出解释→工具调用调度）、TUI（渲染/输入/计划审查/内联diff）、扩展框架（skills/plugins/hooks/MCP servers/subagents）
本地优先：可自行编译，指向本地推理，通过config.toml完全控制——无需经过SpaceXAI服务器
信任背景：7月初Grok Build被曝上传整个代码仓库到SpaceXAI服务器，后修复。开源+本地运行是对信任问题的最直接回应

来源：ThePlanetTools Jul 16, MarkTechPost Jul 15, GIGAZINE Jul 16

VS Code 1.129 Agent Host 架构（Jul 16）

Agent Host：独立后台进程运行Agent harness（Copilot、Claude、Codex），通过WebSocket+JSON-RPC协议与VS Code workbench通信。Agent崩溃不再冻结编辑器，同一会话可附加多个窗口
重新设计的Agents窗口：Agent输出和diff直接停靠在聊天对话旁的共享标签栏；支持inline和side-by-side diff；会话状态跨窗口重载存活
BYOK模型：通过Copilot harness支持自带API密钥
其他：聊天消息加!前缀执行终端命令；prompt-to-skill迁移工具；GitHub Enterprise Copilot登录修复；实验性现代UI预览

来源：VS Code 1.129, Help Net Security Jul 16, Visual Studio Magazine Jul 16, NTCompatible Jul 16, VS Code PR #296627

Claude Code 内联Diff + v2.1.212（Jul 17）

内联Diff：Claude Code引入内联diff查看功能（与Cursor核心功能对齐），免费提供给所有用户。解决4月以来用户反馈的diff视图退化问题
v2.1.212更新（Jul 17）：
- /fork将对话复制到新后台会话（原in-session subagent改为/subtask）
- WebSearch工具调用会话限制（默认200，防失控搜索循环）
- 子代理生成会话上限（默认200，防失控委托循环）
- MCP工具调用超过2分钟自动转入后台
- /resume在Agent视图中打开历史会话选择器
- 多项plan mode/worktree/ultrareview修复

来源：Claude Code Changelog, Claude Code v2.1.212, Claude Code Week 28, BotBeat

Cognition SWE-1.7（Jul 8）

定位：Cognition训练的最强模型，基于Kimi K2.7 Code进行RL再训练，挑战”后训练天花板”理论
关键发现：已在Kimi K2.7上经过大量RL训练的模型，通过Cognition的RL可再获得大幅提升
基准（self-reported）：
- FrontierCode 1.1 Main：42.3%（SWE-1.6仅9.4%，四倍跳跃；Kimi K2.7 Code 30.1%）
- Terminal-Bench 2.1：81.5%
- SWE-Bench Multilingual：77.8%（超过GPT-5.5 76.8%）
可用性：Devin Web/Desktop/CLI via Cerebras 1000 TPS

来源：Cognition官方 Jul 8, TechTimes Jul 9

Kimi K2.7 Code in Microsoft Foundry（Jul 1）

Moonshot AI K2.7 Code在Microsoft Foundry公开预览（Jul 1, 2026）
改进K2.6的长期编码+多步执行+agentic任务执行
思考token使用减少约30%（vs K2.6），基准性能更高
定价： $0.95/$ 4.00 per M tokens（缓存输入$0.19）

来源：Microsoft Foundry Blog Jul 6

Cast AI Kimchi Coding GA（Jul 15）

自主多模型编码Agent正式发布，前端质量达到前沿水平，成本降低2.5倍
编排引擎根据复杂度和成本将每个任务路由到最佳模型
硬性支出上限（从API key到组织级），失控Agent循环自动终止
可在客户VPC内独立运行或使用专用Nvidia B300 GPU
ISO 27001、SOC 2 Type II认证、GDPR合规

来源：Cast AI Press Release Jul 15

💰 定价与市场

前沿模型定价对比（2026年7月中旬）

模型	输入/M tokens	输出/M tokens	上下文	定位
GPT-5.6 Sol	$5	$30	128K	旗舰推理+agentic
GPT-5.6 Terra	$2.50	$15	128K	平衡日常
GPT-5.6 Luna	$1	$6	128K	经济高量
Claude Fable 5	$1-10	$50	—	Mythos级旗舰
Claude Opus 4.8	$5	$25	1M	前沿编码
Grok 4.5	$2	$6	—	Opus级高效率
GLM-5.2	$1.40	$4.40	1M	开源最强
Kimi K3	$3	$15	1M	开源最大
Kimi K3 (缓存)	$0.30	—	1M	缓存命中
Kimi K2.7 Code	$0.95	$4.00	—	Foundry托管
Gemini 3.5 Flash	$1.50	$9	1M	agentic编码
DeepSeek V4-Pro	—	—	1M	开源前沿

AI模型价格战

OpenAI CEO Sam Altman称GPT-5.6 Sol在AI编码任务上token效率提高54%
Grok 4.5声称2倍token效率——同任务用不到一半步骤完成
GLM-5.2以GPT-5.5约1/6成本达到可比性能
Gemini 3.5 Pro传闻定价（ $1.25/$ 10）旨在比GPT-5.6 Sol便宜约4倍
分析师称”模型层价格战刚刚加速”

🔮 DeepSeek V4 迁移强制截止（Jul 24）

截止日期：deepseek-chat和deepseek-reasoner将于2026年7月24日15:59 UTC完全停用
迁移：只需更新model名称为deepseek-v4-pro或deepseek-v4-flash，保持base_url不变
DeepSeek V4-Pro：1.6T总参数/49B活跃，1M上下文，开源agentic编码SOTA
DeepSeek V4-Flash：284B总参数/13B活跃，快速高效经济

来源：DeepSeek API Docs V4 Preview, DeepSeek Change Log, Rohit Raj Tech

🌐 开源AI vs 闭源格局

开源阵营（中国主导）

模型	参数量	许可证	关键基准	开发者
Kimi K3	2.8T MoE	待发布(Jul 27)	Code Arena #1, TB2.1 88.3%	Moonshot AI
GLM-5.2	753B MoE	MIT	SWE-bench Pro 62.1, TB2.1 82.7%	Z.ai
DeepSeek V4-Pro	1.6T MoE	开源	AA Coding Index 47.5	DeepSeek
Grok Build（工具层）	—	Apache 2.0	—	SpaceXAI

闭源前沿阵营

模型	关键基准	开发者
GPT-5.6 Sol Ultra	TB2.1 91.9%, AA Coding 80	OpenAI
Claude Fable 5	TB2.1 84.3-95%, SWE-Bench Pro 80.3%	Anthropic
Claude Opus 4.8	SWE-Bench Pro 69.2%, 1M context	Anthropic
Gemini 3.5 Flash	TB2.1 76.2%	Google
Gemini 3.5 Pro	未发布（第三次延期）	Google

📊 关键趋势

十天内四大前沿发布：Jul 8 Grok 4.5 → Jul 9 GPT-5.6 → Jul 16 Kimi K3 → Gemini 3.5 Pro原定Jul 17——2026年最密集的前沿发布周期
开源追平闭源：Kimi K3在Code Arena前端编码超越Fable 5仅距其发布6周。“中国落后数月”的共识不再成立
政府预发布审查常态化：GPT-5.6被白宫限制性预览2周，Fable 5出口管制19天，Gemini 3.5 Pro与政府”富有成效合作”——政府审查成为前沿模型发布的新常态
多模型路由成为主流：GLM-5.2/Grok 4.5/DeepSeek V4的低成本+Kimchi Coding等编排工具推动从”单一前沿模型”转向”按任务复杂度路由”
RL无后训练天花板：Cognition SWE-1.7证明对已大量RL训练的模型（Kimi K2.7）再施加RL仍可大幅提升，挑战”后训练天花板”理论
工具层开源：SpaceXAI开源Grok Build（不仅是模型，而是整个Agent harness）——信任+可审计+本地优先
IDE架构转型：VS Code 1.129 Agent Host将AI Agent移入独立进程——Agent崩溃不再冻结编辑器，同一会话可跨窗口
药物发现AI竞争白热化：Anthropic Claude Science + 内部药物项目 + Isomorphic Labs + Roche/NVIDIA AI工厂——AI公司从软件供应商转变为药物开发者

📅 2026-07-20 周报：AI工具与资源更新

本周（7月17日–7月20日）AI行业格局进一步重塑：Google Gemini 3.5 Pro第三次错过发布窗口，传将推出Gemini 3.6 Flash作为过渡；欧盟DMA正式命令Google向AI竞争对手开放Android并共享搜索数据；Anthropic Fable 5免费窗口于Jul 19到期；Oracle裁员30000人注资Stargate AI基础设施；SAP完成收购Prior Labs（10亿欧元+）押注表格基础模型；世界人工智能大会（WAIC）在上海闭幕，29国成立世界人工智能合作组织（WAICO）；DeepSeek V4稳定版Jul 24发布、Kimi K3权重Jul 27开源构成”开源权重超级周”。

🔴 重大事件

欧盟命令Google开放Android并共享搜索数据（Jul 16, European Commission）

事件：欧盟委员会根据《数字市场法》（DMA）对Google采取约束性要求，命令其向竞争对手AI助手开放Android操作系统，并向竞争对手（包括AI开发者）共享部分搜索数据
Android互操作性：符合条件的第三方AI助手获得跨11个Android功能组的语音激活和跨应用能力，需通过认证和用户同意；截止日期：2027年7月
搜索数据共享：Google须以FRAND（公平、合理、非歧视）条款提供匿名化的排名、查询、点击和查看数据；搜索数据共享从2027年1月开始
意义：这是2026年最具影响力的AI监管行动，直接攻击Google两大核心资产——数十亿Android设备上的默认安装地位，以及竞争对手无法复制的二十年搜索行为数据
Google回应：全球事务总裁Kent Walker反驳称这些决定有可能破坏数百万欧洲人的隐私和安全保护措施
时机：Google旗舰模型Gemini 3.5 Pro第三次延期之际，监管者正在撬开本应补偿模型劣势的分发护城河

来源：European Commission官方公告Jul 16, Reuters Jul 16, The Verge Jul 16, Computerworld, US News Jul 16, MacRumors Jul 16

Gemini 3.5 Pro第三次错过发布窗口（Jul 17, Google）

背景：继6月30日和7月17日两次目标日期之后，Gemini 3.5 Pro第三次未能如期发布
完全重建：Google废弃了基于2.5 Pro架构的原始Gemini 3.5 Pro基础模型，从头重新预训练（此前已报道）
Jul 17再次错过：Bloomberg/The Verge报道模型仍受幻觉和输出不一致困扰，未通过基本可靠性标准
过渡方案：Google据传正在探索推出Gemini 3.6 Flash作为过渡产品，在旗舰Pro模型修复期间为市场提供新版本
股价影响：Alphabet股价因延期报道下跌约4%
竞争成本：本季度评估前沿模型的企业正在GPT-5.6、Claude、Grok 4.5、Kimi K3中选择，Gemini缺席的每一周都意味着合同被签署到其他地方
政府协调：Google正在与美国政府”富有成效地合作”测试3.5 Pro
传闻规格（未官方确认）：2M token上下文窗口（为当前任何前沿模型的2倍），Deep Think推理层（Ultra订阅 $250/月专属），定价约$ 1.25/$10 per M tokens

来源：The Verge Jul 17, TechTimes Jul 13, TechTimes Jul 16, Enterprise DNA Jul 8, BusinessInsider, Google Blog May 19

Claude Fable 5免费窗口到期（Jul 19 23:59 PT, Anthropic）

事件：Anthropic对Claude Fable 5的免费访问促销窗口于2026年7月19日太平洋时间23:59到期，Pro/Max/Team/Enterprise付费用户不再免费使用
三次延期历程：Jul 1（初始恢复）→ Jul 7（首次延期）→ Jul 12（二次延期）→ Jul 19（三次延期）→ 到期
到期后：Fable 5转入与Opus和Sonnet相同的信用额度模式，定价 $10/百万输入token、$ 50/百万输出token；Claude Code周用量50%提升同步结束
决策点：到期迫使Anthropic做出选择——Opus 5发布、第四次延长免费访问、或直接转向信用额度模式
竞争压力：免费窗口结束的同一周Kimi K3刚登上编码排行榜榜首并宣布7月27日免费开源权重，形成尴尬对比
最可能解读：Anthropic利用这一时机发布Opus 5公告，以自己的条件而非Moonshot的条件重新定义对话

来源：Anthropic官方, Forbes Jul 13, BleepingComputer Jul 12, TechBriefly Jul 13, gHacks Jul 14, CyberSecurityNews, TechTimes Jul 12

Oracle裁员30000人注资Stargate AI基础设施（Jul 2026）

事件：Oracle裁员最多30000名员工（约占全球员工总数18%），释放约80-100亿美元年度现金流用于AI基础设施建设，为公司历史上最大规模裁员
资金用途：裁员所得资金投入Stargate项目——Oracle与OpenAI、SoftBank合作的5000亿美元AI基础设施项目，锚定为OpenAI提供的3000亿美元五年期云合同，预计从4.5 GW Oracle数据中心容量产生约300亿美元/年收入
内部分配：裁员重点冲击Oracle Health、云基础设施和咨询部门，而建设Stargate数据中心的团队被保留甚至加速招聘
战略意义：这是一个部门接一个部门地将自己转换为AI基础设施提供商的最清晰案例，用被降级业务的薪水为转型融资
风险：Oracle将未来押注于单一客户关系（OpenAI），而OpenAI正面临Apple诉讼、出版商诉讼和IPO前盈利能力不确定性

来源：Capacity Global, Forbes Apr 6, Medium/Codex, Washington Times Mar 31, Economic Times

SAP完成收购Prior Labs——10亿欧元押注表格基础模型（Jul 17）

事件：SAP完成对Prior Labs的收购，这家总部位于弗莱堡的表格基础模型先驱由Frank Hutter、Noah Hollmann和Sauraj Gambhir在约18个月前创立
投资承诺：SAP承诺在未来四年内投资超过10亿欧元，将Prior Labs扩展为全球领先的前沿AI实验室
独立运营：Prior Labs将继续作为独立实体运营
技术基础：Prior Labs的TabPFN模型系列发表在Nature上，在数百项独立学术研究中确立了表格基准SOTA，证明单一预训练模型可以在表格基准上超越传统机器学习方法
战略逻辑：SAP认为企业AI最大的未开发机会不是大型语言模型，而是为运行企业的结构化数据（表格、分类账、库存、交易记录）专门构建的AI。语言模型擅长文档和聊天，但在电子表格上表现很差
欧洲AI叙事：一个18个月大、拥有Nature论文的德国创业公司被10亿欧元资金扩展为前沿实验室，这正是欧洲技术政策十年来试图制造的成果类型

来源：SAP官方新闻Jul 17, tech.eu Jul 17, Trending Topics, TNW, AIWeekly Jul 17, SAP新闻May

世界人工智能大会（WAIC）闭幕 + WAICO成立（Jul 17-20, Shanghai）

事件：2026年世界人工智能大会（WAIC）在上海闭幕，为期四天，包括习近平首次发表主旨演讲，以及成立世界人工智能合作组织（WAICO）
WAICO创始成员：29个创始成员国，包括中国、哈萨克斯坦、老挝、巴基斯坦、俄罗斯、印度尼西亚、巴西等，以全球南方国家为主
习近平承诺：(1)未来五年向发展中国家提供5000个AI培训和研讨会名额；(2)与金砖国家、东盟、拉美和非洲联盟国家发展AI合作中心；(3)强烈支持开源AI
规模：超过140个论坛，1100+参展商；华为在展台展示了Atlas 950 SuperPoD国产计算系统
意义：这是中国首次在AI治理组织上”先到场带结构”，Demis Hassabis同一周呼吁国际监管机构和美国领导的联盟，等于承认目前没有这样的机构存在
西方回应：目前明显缺席——是否会有欧洲或全球南方经济体超出北京初始圈子加入WAICO，是后续关键信号

来源：Xinhua Jul 17, Reuters Jul 17, Al Jazeera Jul 17, CGTN Jul 18, SCMP, Quartz Jul 17, NBC News

📈 前沿模型动态

Kimi K3持续发酵——“开源权重冲击波”（Jul 16-20, Moonshot AI）

后续反应：Kimi K3在周末震撼美国科技行业，引发中美AI竞争的新一轮辩论。2.8万亿参数开源模型此前在主要编码排行榜上夺得第一名
反应的显著性：美国实验室和投资者公开重新评估闭源前沿真正领先多远
K3与众不同的原因：早期中国模型以价格竞争，K3以能力竞争并在编码排行榜上击败Claude Fable 5，然后宣布将免费开放权重——这一序列消除了美国实验室使用的两个舒适论点（开源模型在质量上落后；中国模型是廉价替代品而非真正的前沿系统）
诚实警告：排行榜胜利是狭窄的——K3在通用聊天中排名约第九，是编码和Agent专家而非全方位前沿替代品。独立跨工作负载评估仍然薄弱
Jul 27权重发布：这是从”基准测试故事”变为”采购故事”的时刻。免费专家模型在高体量编码上击败付费通用模型，在预算审查中难以辩驳

来源：VentureBeat Jul 16, SiliconANGLE Jul 16, BBC, Interesting Engineering, PureAI Jul 17, Digital Applied, WindowsForum

Anthropic IPO进程与Karpathy招聘强化（持续本周）

机密S-1：Anthropic于6月1日向SEC机密提交了IPO草案S-1，四天前刚完成 $650亿Series H融资（$ 965亿后估值）。IPO进程在本周持续推进
Karpathy加入：Andrej Karpathy（OpenAI联合创始人、前Tesla AI负责人）于5月19日确认加入Anthropic预训练团队，从他的创业公司Eureka Labs直接转入
本周表现：Anthropic被认为是本周表现最好的实验室——机密IPO申请、最高安全评级、Karpathy招聘——在强势中谈判
SpaceX数据中心合同：Anthropic将向SpaceX每月支付12.5亿美元（至2029年5月）用于数据中心容量，提高Claude Code使用限额

来源：Anthropic官方, CNBC Jun 1, WSJ Karpathy, The New Stack, Digital Applied, Yahoo Finance

🛠️ 工具与产品更新

Microsoft Project Perception——多模型AI网络安全平台（Jul 2026）

定位：Microsoft正在准备Project Perception，一个AI网络安全平台，使用Microsoft、OpenAI和Anthropic的模型一起发现和修复软件漏洞，定位为Anthropic Mythos级安全产品的低成本替代
功能：系统审视公司的代码、云基础设施和端点，识别可利用的弱点，解释其影响，并提出具体的修复方案
架构创新：使用编排层将每个任务路由到最合适的模型——廉价模型处理库存检查、日志解析和常见漏洞类型的初始分类，前沿模型仅在需要推理复杂漏洞链或编写涉及生产的修复方案时被调用。这种路由使持续、始终在线的漏洞扫描变得可负担
竞争格局：Anthropic的Project Glasswing已扩展到15个国家的150个组织；Microsoft以其多模型路由和庞大的安装基础回击
行业背景：Microsoft 7月Patch Tuesday在AI辅助下修复了创纪录的570个漏洞；2026年上半年AI安全收购从去年的10笔增至29笔（增长近3倍）；CISA警告自主Agent正在身份和访问管理中打开新缺口
意义：这是AI中最健康的竞争动态之一——两家资源充足的公司在成本和覆盖范围上竞争，推动机器速度防御变得可负担

来源：TechRepublic Jul 2026, Microsoft MSRC Blog Apr 2026, AIToolsRecap, Threads/Buzz

Google NotebookLM更名为Gemini Notebook + AI Mode搜索扩展

Gemini Notebook（原NotebookLM）：获得安全云计算机，可在笔记本内运行代码，服务超过3000万用户和60万个组织
AI Mode搜索扩展：与Instacart、Canva、YouTube Music集成，将搜索转变为完成的操作
意义：Google本周发布了被旗舰延期和监管命令掩盖的真正产品——这些是有意义的胜利，只是无法与旗舰延期和监管命令竞争注意力

Mistral Medium 3.5云端编码Agent

定位：Mistral新的旗舰密集模型，128B开放权重，专为agentic和编码用例优化
云端Agent：在Vibe和Le Chat中引入基于Mistral Medium 3.5的云端远程编码Agent，支持异步、并行任务执行
工作方式：编码会话在隔离沙箱中进行，用户可以离开时继续处理长时间运行的任务，多个Agent可以并行运行
许可：Modified MIT许可下作为开放权重发布

来源：Mistral AI官方, DevOps.com Jul 20, ODSC, Mistral Docs, NYU RITS

⚠️ 安全与研究警报

Epoch AI：AI文本检测器在风格模仿下失败（Jul 18）

研究：Epoch AI测试了三款领先的AI文本检测器——Pangram、GPTZero和Originality.ai——对抗模仿特定作者风格生成的文本
关键发现：高达18%的AI生成段落未被检测到；297个风格模仿文本中平均约13%未被检测到（Pangram假阴性率10%，GPTZero更高）
科学写作特别脆弱：检测器在学术工作中常见的正式、结构化散文上最挣扎
不对称竞争：让模型模仿写作风格极其简单（只需一个提示），而检测结果是一个越来越难的统计问题
建议：机构应停止将AI检测器视为证据，开始将其视为最弱的信号

来源：The Decoder, Epoch AI, AIBase

RadLE 2.0基准：AI放射学模型”自信地错误”（Jul 2026）

基准：新基准RadLE 2.0测试AI模型在放射学任务上的表现，发现它们经常以完全确信的方式提供错误发现——读X光片并产生错误诊断，没有任何不确定性信号
风险：“自信地错误”是特定的危险失败模式——一个犹豫的错误答案会邀请第二意见，而一个自信的错误答案则不会
背景：本月Neko Health获7亿美元融资（AI分析身体扫描），Hemispheric获5200万美元（大脑活动AI），美国政府部署ChatGPT审计Medicare和Medicaid数据——所有这些都依赖AI输出可信或至少适当不确定
建议：在临床环境中部署的任何AI系统都应被要求表达校准的不确定性；如果不能，人类不应将其输出视为结论

📊 “开源权重超级周”——Jul 24 & Jul 27

两个关键日期锚定7月剩余时间：
- Jul 24：DeepSeek V4稳定版发布，结束预览版构建的动荡，移除谨慎企业将生产工作负载迁移到其上的最后一个技术异议。deepseek-chat和deepseek-reasoner于2026年7月24日15:59 UTC完全停用，需更新model名称为deepseek-v4-pro或deepseek-v4-flash
- Jul 27：Kimi K3开放权重免费发布，将刚登上编码排行榜榜首的模型放入任何人的手中
商业赌注：DeepSeek约$0.44/百万输出token已是行业衡量基准；K3权重三天后到达意味着企业可以在自己的基础设施上运行顶级编码模型，零每token成本
实际建议：将这两个日期视为强制函数——用你的实际工作负载对抗稳定V4、K3 Max和你当前的闭源模型，测量质量和总成本（包括自托管的基础设施），让数字而非排行榜决定。诚实答案是任务相关的：闭源模型仍赢得最难推理，开源模型赢得高体量常规工作——在两者之间构建路由的团队花费远少于选择一个的团队

来源：DeepSeek API Docs V4 Preview, DeepSeek Change Log, VentureBeat Jul 16, Digital Applied K3

💰 市场与定价

AI基础设施资金的真实经济学

Oracle模式：Oracle公开裁员30000人以释放80-100亿美元/年用于数据中心，是AI建设成本最诚实的会计。GW级基础设施资本不是凭空出现的——它正从现有业务线、员工人数和AI成为优先级之前资助公司的运营中被提取
行业模式：Meta今年支出1250-1450亿美元并向单个路易斯安那州站点投入500亿美元；台积电两次上调资本支出指引；Microsoft、Amazon、Google都在将巨额现金流重新导向硅和电力
差异：大多数公司从增长收入中资助，Oracle从重组中资助——使权衡以其他公司避免的方式可见

🔮 本周关键趋势

监管重塑分发：欧盟DMA命令是2026年最具影响力的AI监管行动，直接攻击Google的Android默认安装和二十年搜索数据两大护城河——为每个非Google的AI公司打开了法律路径
旗舰模型延期常态化：Gemini 3.5 Pro三次错过发布窗口，Google传将推出过渡Flash模型——旗舰延期从工程纪律变为结构性问题
开源权重攻势：Kimi K3（Jul 27）+ DeepSeek V4稳定版（Jul 24）构成”开源权重超级周”——如果两者按计划发布且企业开始迁移高体量工作负载，闭源前沿模型的定价权将以难以逆转的方式被侵蚀
AI安全两强竞争：Microsoft Project Perception vs Anthropic Mythos/Glasswing——AI驱动的漏洞检测正在巩固为真正的两强竞争，Microsoft以多模型路由攻击成本问题，Anthropic以安全专用模型领先
表格基础模型被认可：SAP 10亿欧元收购Prior Labs——最大的未开发企业AI机会不是聊天机器人，而是为运行企业的结构化数据专门构建的AI
AI治理组织竞赛：WAICO（29国）成立，中国首次在AI治理组织上”先到场带结构”——西方的Demis Hassabis呼吁暴露出目前没有对应的国际机构
免费窗口结束的竞争压力：Fable 5免费窗口到期与Kimi K3免费权重发布时机重叠——用户现在有真正有能力的免费替代方案
AI资本从重组中提取：Oracle裁员30000人注资Stargate是最诚实的AI资本会计——AI建设资本正从现有运营中被提取，而非全部来自新资金

Back to List