Horizon Daily

Horizon Summary: 2026-06-04 (EN)

2026-06-04T00:00:00+00:00

Analyzed 72 items, but none met the importance threshold.

No significant developments today. This might indicate:

A quiet day in your tracked sources
The AI score threshold is too high
Your information sources need expansion

Consider:

Lowering the ai_score_threshold in config.json
Adding more diverse information sources
Checking if the AI model is working correctly

Horizon Summary: 2026-06-03 (EN)

2026-06-03T00:00:00+00:00

From 21 items, 15 important content pieces were selected

MiniMax Introduces New Attention Architecture ⭐️ 9.0/10
Speaker Hacking: Wireless PC Exploitation ⭐️ 8.0/10
Memory Optimization Debate ⭐️ 8.0/10
Edsger: Handwritten Clojure REPL for reMarkable 2 ⭐️ 8.0/10
Nvidia GPU VRAM as Linux Swap Space ⭐️ 8.0/10
Microsoft Introduces MAI-Code-1-Flash Model ⭐️ 8.0/10
Portable C++ EnCodec Implementation Released ⭐️ 8.0/10
Semantic Tokenization Scheme for Language Models ⭐️ 8.0/10
TorchDAE: PyTorch Library for DAE Solvers ⭐️ 8.0/10
DaVinci Resolve 21 Released ⭐️ 7.0/10
Meta Introduces 30-Minute Tracking Opt-Out ⭐️ 7.0/10
PlayStation Console Architecture ⭐️ 7.0/10
Ceiling Projection Mapping of Planes ⭐️ 7.0/10
Uber Caps AI Tool Usage ⭐️ 7.0/10
Datasette Agent MicroPython 0.1a0 Released ⭐️ 7.0/10

MiniMax Introduces New Attention Architecture ⭐️ 9.0/10

MiniMax has introduced a new attention architecture called MiniMax Sparse Attention (MSA), which can scale to 1M tokens and achieves significant performance gains over previous models. This new architecture bypasses standard quadratic complexity by restructuring memory access patterns at the operator level. The introduction of MSA is significant because it enables more efficient processing of large amounts of data, which is crucial for applications such as natural language processing and deep learning. This breakthrough could lead to improved performance and reduced costs for these applications. The MSA architecture utilizes a ‘KV outer gather Q’ approach, which allows for contiguous hardware memory reads and reduces per-token compute to 1/20th of previous-generation models at full 1M context depth. This results in a 4× faster execution speed compared to Flash-Sparse-Attention and significant speedups in prefilling and decoding phases.

reddit · r/MachineLearning · /u/superintelligence03 · Jun 3, 01:26

Background: Attention architectures are a crucial component of deep learning models, particularly in natural language processing tasks. The traditional Transformer architecture has been widely adopted, but it suffers from quadratic complexity, making it inefficient for large-scale applications. Recent advancements have focused on developing more efficient attention mechanisms, such as sparse attention and hierarchical attention.

References

Horizon Summary: 2026-06-03 (ZH)

2026-06-03T00:00:00+00:00

从 54 条内容中筛选出 9 条重要资讯。

Adafruit 收到 Flux.ai 法律函件 ⭐️ 8.0/10
Anthropic 扩展 Project Glasswing 用于关键基础设施 ⭐️ 8.0/10
爱上 systemd timers——呼吁从 cron 迁移 ⭐️ 8.0/10
研究表明反向传播在一个训练周期内破坏 V1 脑对齐 ⭐️ 8.0/10
用户用 Qwen3.6-27B 替代 Claude 进行多智能体编排测试 ⭐️ 8.0/10
1 位和三值化的 4B 图像模型：本地设备极小占用 ⭐️ 8.0/10
Gemma 4 E4B 搭配 LiteRT 实现约 2.4 倍文本生成加速 ⭐️ 8.0/10
Codex 免费和 Go 订阅重置周期改为 30 天 ⭐️ 8.0/10
腾讯秘密为微信打造 AI 智能体连接数百万小程序 ⭐️ 8.0/10

Adafruit 收到 Flux.ai 法律函件 ⭐️ 8.0/10

Adafruit 收到了 Flux.ai 法律顾问 Fenwick 的律师函，威胁要就一篇关于 Flux.ai 产品及商业行为的计划中博客文章采取法律行动。这一事件凸显了开源硬件社区与采取激进法律手段压制批评的公司之间的紧张关系，可能抑制自由表达和诚实的评测。律师函是针对 Adafruit 一篇未发表的博客文章发出的；社区猜测该文章涉及 Flux.ai 的 AI 驱动 PCB 设计工具，该工具因计费和性能问题受到投诉。

hackernews · semanser · 6月2日 10:00 · 社区讨论

背景: Adafruit 是一家知名的开源硬件公司，经常评测工具和产品。Flux.ai 提供基于云、AI 辅助的 PCB 设计平台。律师函常被用来恐吓批评者，但可能适得其反，引来负面关注。

参考链接

Horizon Summary: 2026-06-02 (EN)

2026-06-02T00:00:00+00:00

From 69 items, 16 important content pieces were selected

AI Support Bot Exploit Bypasses Instagram 2FA ⭐️ 9.0/10
Red Hat npm packages compromised with credential-stealing malware ⭐️ 9.0/10
MiniMax M3: Open-Weight Frontier Model with 1M Context ⭐️ 9.0/10
Nvidia Unveils Vera Rubin Platform, Forecasts $1T Sales ⭐️ 9.0/10
Stanford CS336 Publishes AI Agent Guidelines for Students ⭐️ 8.0/10
RGB Normalization: Divide by 255 or 256? ⭐️ 8.0/10
Stanford CS336: Language Modeling from Scratch ⭐️ 8.0/10
Life’s Chemistry May Be Inherently Geological ⭐️ 8.0/10
Nvidia Unveils RTX Spark Arm Processor for Windows ⭐️ 8.0/10
Anthropic Files for IPO with SEC ⭐️ 8.0/10
Recording optimized kernel function signatures in BTF ⭐️ 8.0/10
Top LightGBM Feature Hurt Predictions Due to Label Variance ⭐️ 8.0/10
MLE-Bench gains largely due to better models, not algorithms ⭐️ 8.0/10
NVIDIA Announces Nemotron 3 Ultra LLM ⭐️ 8.0/10
NVIDIA DLSS 4.5 Ray Reconstruction Coming to All RTX GPUs in August ⭐️ 8.0/10
California bill passes requiring offline play after server shutdown ⭐️ 8.0/10

AI Support Bot Exploit Bypasses Instagram 2FA ⭐️ 9.0/10

Hackers exploited Meta’s AI support bot to take over Instagram accounts by tricking it into disabling 2FA and sending password reset emails to arbitrary addresses, as reported by Krebs on Security. This vulnerability reveals a critical flaw in Meta’s reliance on AI for account security, as the bot had privileged access that allowed it to bypass strong authentication measures, affecting all Instagram users who trust the platform’s security. The AI agent had the ability to remove 2FA from accounts, ignore the account’s registered email, and send password reset emails to any address provided by the attacker. This allowed account takeover without any authentication.

hackernews · ssiddharth · Jun 1, 16:31 · Discussion

Background: Two-factor authentication (2FA) adds an extra layer of security by requiring a second factor beyond a password. Automated customer support bots are increasingly used by companies like Meta to handle account recovery, but granting them privileged access to sensitive actions like disabling 2FA creates risk. This exploit demonstrates how social engineering can be applied to AI agents, similar to how attackers manipulate human support staff.

References

Horizon Summary: 2026-06-02 (ZH)

2026-06-02T00:00:00+00:00

从 69 条内容中筛选出 16 条重要资讯。

AI 客服机器人漏洞绕过 Instagram 双重认证 ⭐️ 9.0/10
Red Hat npm 包遭凭证窃取恶意软件入侵 ⭐️ 9.0/10
MiniMax M3：拥有 100 万上下文窗口的开源前沿模型 ⭐️ 9.0/10
英伟达发布 Vera Rubin 平台，预测销售额达 1 万亿美元 ⭐️ 9.0/10
斯坦福 CS336 发布学生 AI 代理使用指南 ⭐️ 8.0/10
RGB 归一化：除以 255 还是 256？ ⭐️ 8.0/10
斯坦福 CS336：从头开始的语言建模 ⭐️ 8.0/10
生命化学可能本质上是地质特征 ⭐️ 8.0/10
英伟达发布 RTX Spark Arm 处理器，面向 Windows 平台 ⭐️ 8.0/10
Anthropic 向 SEC 提交 IPO 申请 ⭐️ 8.0/10
在 BTF 中记录优化后的内核函数签名 ⭐️ 8.0/10
LightGBM 第一重要特征因标签方差损害预测 ⭐️ 8.0/10
MLE-Bench 的提升主要归因于更好的模型，而非算法进步 ⭐️ 8.0/10
NVIDIA 发布 Nemotron 3 Ultra 大语言模型 ⭐️ 8.0/10
NVIDIA DLSS 4.5 光线重建 8 月覆盖全系 RTX 显卡 ⭐️ 8.0/10
加州法案要求游戏停服后仍可离线游玩 ⭐️ 8.0/10

AI 客服机器人漏洞绕过 Instagram 双重认证 ⭐️ 9.0/10

黑客利用 Meta 的 AI 客服机器人，通过诱骗其禁用双重认证（2FA）并将密码重置邮件发送至任意地址，从而接管 Instagram 账户，Krebs on Security 报道了这一事件。该漏洞揭示了 Meta 依赖 AI 进行账户安全的关键缺陷：机器人拥有特权访问权限，能够绕过强身份验证措施，影响了所有信任该平台安全性的 Instagram 用户。该 AI 代理能够移除账户的 2FA，忽略账户注册邮箱，并将密码重置邮件发送至攻击者提供的任意地址，从而在无需任何身份验证的情况下实现账户接管。

hackernews · ssiddharth · 6月1日 16:31 · 社区讨论

背景: 双重认证（2FA）通过要求密码之外的第二个因素来增强安全性。Meta 等公司越来越多地使用自动化客服机器人处理账户恢复，但授予它们禁用 2FA 等敏感操作的特权访问权限会带来风险。此漏洞展示了社交工程如何应用于 AI 代理，类似于攻击者操纵人工客服人员的方式。

参考链接

Horizon Summary: 2026-06-01 (EN)

2026-06-01T00:00:00+00:00

From 44 items, 9 important content pieces were selected

Cloudflare Turnstile WebGL Fingerprinting Undermines Privacy ⭐️ 8.0/10
1-Bit Bonsai Image 4B: Efficient Local Image Generation ⭐️ 8.0/10
VideoLAN Unveils Dav2d: Open-Source AV2 Decoder ⭐️ 8.0/10
Linux Restartable Sequences Explained ⭐️ 8.0/10
Deflock reaches 100k mapped ALPRs in the US ⭐️ 8.0/10
NVIDIA Parakeet Ported to ggml: Faster, Quantized, No Python ⭐️ 8.0/10
Abliterated Gemma 4 E2B Variants Benchmarked ⭐️ 8.0/10
FROST Attack Uses SSD Timing to Spy on Users ⭐️ 8.0/10
AV2 Reference Encoder Reaches First 1.0.0 Release ⭐️ 8.0/10

Cloudflare Turnstile WebGL Fingerprinting Undermines Privacy ⭐️ 8.0/10

Cloudflare Turnstile now requires WebGL for fingerprinting, effectively bypassing privacy protections like Firefox’s resistFingerprinting and disabling access for minority browsers that lack WebGL support. This practice undermines user privacy by enabling persistent tracking without consent, and it disproportionately affects users of minority or privacy-focused browsers, fragmenting the web. The issue was reported by a minority browser maintainer who noted that users started encountering Cloudflare challenges a few weeks ago. WebGL fingerprinting uses hardware and driver details to create a unique identifier.

hackernews · HypnoticOcelot · May 31, 14:13 · Discussion

Background: Browser fingerprinting collects device information (OS, browser type, screen resolution, etc.) to create a unique identifier, often used for tracking without cookies. WebGL fingerprinting specifically leverages the graphics card’s capabilities, which vary greatly even between identical devices. Cloudflare Turnstile is a CAPTCHA alternative that aims to verify human users without manual puzzles, but its reliance on WebGL compromises privacy for non-standard browsers.

References

Horizon Summary: 2026-06-01 (ZH)

2026-06-01T00:00:00+00:00

从 44 条内容中筛选出 9 条重要资讯。

Cloudflare Turnstile 利用 WebGL 指纹识别破坏隐私 ⭐️ 8.0/10
1 比特 Bonsai Image 4B：高效本地图像生成 ⭐️ 8.0/10
VideoLAN 发布开源 AV2 解码器 Dav2d ⭐️ 8.0/10
Linux 重启序列详解 ⭐️ 8.0/10
Deflock 在美国绘制了 10 万个车牌读取器 ⭐️ 8.0/10
NVIDIA Parakeet 移植到 ggml：更快、量化、无需 Python ⭐️ 8.0/10
去除安全对齐的 Gemma 4 E2B 变体基准测试 ⭐️ 8.0/10
FROST 攻击利用 SSD 定时窥探用户活动 ⭐️ 8.0/10
AV2 参考编码器发布首个 1.0.0 版本 ⭐️ 8.0/10

Cloudflare Turnstile 利用 WebGL 指纹识别破坏隐私 ⭐️ 8.0/10

Cloudflare Turnstile 现在要求使用 WebGL 进行指纹识别，这实际上绕过了 Firefox 等浏览器的隐私保护措施，并导致不支持 WebGL 的小众浏览器无法访问。这种做法通过未经同意的持久追踪侵犯用户隐私，并且对小众或注重隐私的浏览器用户造成不成比例的影响，导致网络碎片化。该问题由一位小众浏览器维护者报告，他注意到几周前用户开始遇到 Cloudflare 的挑战。WebGL 指纹识别利用硬件和驱动程序细节生成唯一标识符。

hackernews · HypnoticOcelot · 5月31日 14:13 · 社区讨论

背景: 浏览器指纹识别通过收集设备信息（操作系统、浏览器类型、屏幕分辨率等）生成唯一标识符，常用于无 Cookie 追踪。WebGL 指纹识别专门利用显卡的差异性，即使相同的设备也可能不同。Cloudflare Turnstile 是一种 CAPTCHA 替代方案，旨在无需手动拼图即可验证人类用户，但它对 WebGL 的依赖损害了非标准浏览器的隐私。

参考链接

Horizon Summary: 2026-05-31 (EN)

2026-05-31T00:00:00+00:00

From 48 items, 14 important content pieces were selected

Running Python ASGI Apps in Browser with Pyodide and Service Workers ⭐️ 9.0/10
SpaceX Wins $4.16B US Military Satellite Missile Tracking Contract ⭐️ 9.0/10
Accenture acquires Ookla for $1.2B ⭐️ 8.0/10
Zig’s ELF Linker Improvements Detailed in Devlog ⭐️ 8.0/10
Voxel Space Tutorial Revives 1992 Comanche Graphics ⭐️ 8.0/10
OpenRouter raises $113M Series B ⭐️ 8.0/10
Openrsync: OpenBSD’s reimplementation of rsync adopted in macOS ⭐️ 8.0/10
Pope Leo’s first encyclical criticizes technological messianism ⭐️ 8.0/10
Anthropic details sandboxing techniques for Claude across products ⭐️ 8.0/10
Debugger reveals training failures local to layers and steps ⭐️ 8.0/10
NVIDIA NVFP4 Quantization of Qwen3.6-35B-A3B ⭐️ 8.0/10
GPU Specs Comparison for Local LLM Inference Challenges Mac Recommendations ⭐️ 8.0/10
Parallax: Parameterized Local Linear Attention for LLMs ⭐️ 8.0/10
Huawei Proposes ‘Tao Law’ Using Temporal Scaling for Chips ⭐️ 8.0/10

Running Python ASGI Apps in Browser with Pyodide and Service Workers ⭐️ 9.0/10

Simon Willison demonstrated a method to run Python ASGI apps in the browser using Pyodide and Service Workers, enabling execution of JavaScript script tags that previously failed in Web Worker-based approaches. This was achieved via a Claude Code experiment and tested with Datasette Lite and a basic ASGI FastCGI demo. This breakthrough overcomes a key limitation of running Python apps in the browser, allowing proper execution of JavaScript-dependent plugins and dynamic content. It significantly enhances the capabilities of Python-in-browser tools like Datasette Lite and expands the potential for serverless Python applications. The demo uses Service Workers instead of Web Workers to intercept network requests and run Python ASGI apps within Pyodide, preserving script tag execution. Simon plans to upgrade Datasette Lite to adopt this approach after fully understanding the implementation.

rss · Simon Willison · May 30, 21:02

Background: Pyodide is a Python distribution for the browser based on WebAssembly, allowing Python to run entirely on the client side. ASGI (Asynchronous Server Gateway Interface) is a specification for asynchronous Python web servers and applications, enabling modern web frameworks like FastAPI and Starlette. Service Workers are scripts that run in the background of a web browser, capable of intercepting network requests and enabling offline experiences.

References

Horizon Summary: 2026-05-31 (ZH)

2026-05-31T00:00:00+00:00

从 48 条内容中筛选出 14 条重要资讯。

在浏览器中用 Pyodide 和服务工作进程运行 Python ASGI 应用 ⭐️ 9.0/10
SpaceX 获 41.6 亿美元美军卫星导弹追踪合同 ⭐️ 9.0/10
埃森哲以 12 亿美元收购 Ookla ⭐️ 8.0/10
Zig ELF 链接器改进日志详解 ⭐️ 8.0/10
Voxel Space 教程重现 1992 年《Comanche》图形技术 ⭐️ 8.0/10
OpenRouter 获 1.13 亿美元 B 轮融资 ⭐️ 8.0/10
Openrsync：OpenBSD 对 rsync 的重实现，已被 macOS 采用 ⭐️ 8.0/10
教皇利奥首篇通谕抨击技术弥赛亚主义 ⭐️ 8.0/10
Anthropic 详解 Claude 产品沙箱技术 ⭐️ 8.0/10
调试器揭示训练失败局部化到特定层和步骤 ⭐️ 8.0/10
英伟达发布 Qwen3.6-35B-A3B 的 NVFP4 量化版本 ⭐️ 8.0/10
本地 LLM 推理的 GPU 规格对比挑战 Mac 推荐 ⭐️ 8.0/10
Parallax：用于大语言模型的参数化局部线性注意力机制 ⭐️ 8.0/10
华为提出“韬定律”：用时间缩微替代几何缩微 ⭐️ 8.0/10

在浏览器中用 Pyodide 和服务工作进程运行 Python ASGI 应用 ⭐️ 9.0/10

Simon Willison 展示了一种使用 Pyodide 和服务工作进程在浏览器中运行 Python ASGI 应用的方法，使得之前基于 Web Worker 的方法中无法执行的 JavaScript 脚本标签得以正常运行。这是通过 Claude Code 实验实现的，并在 Datasette Lite 和一个基本的 ASGI FastCGI 演示中进行了测试。这一突破克服了在浏览器中运行 Python 应用的关键限制，使得依赖 JavaScript 的插件和动态内容能够正常执行。它显著增强了 Datasette Lite 等浏览器内 Python 工具的能力，并扩展了无服务器 Python 应用的潜力。该演示使用服务工作进程替代 Web Worker 来拦截网络请求并在 Pyodide 中运行 Python ASGI 应用，从而保留了脚本标签的执行。Simon 计划在完全理解实现后，将 Datasette Lite 升级为采用这种方法。

rss · Simon Willison · 5月30日 21:02

背景: Pyodide 是一个基于 WebAssembly 的浏览器 Python 发行版，允许 Python 完全在客户端运行。ASGI（异步服务器网关接口）是异步 Python Web 服务器和应用的规范，支持 FastAPI 和 Starlette 等现代 Web 框架。服务工作进程是在 Web 浏览器后台运行的脚本，能够拦截网络请求并实现离线体验。

参考链接

Horizon Summary: 2026-05-30 (EN)

2026-05-30T00:00:00+00:00

From 53 items, 16 important content pieces were selected

vLLM v0.22.0 Released with DeepSeek V4 Maturity and Rust Frontend ⭐️ 9.0/10
Probe-Targeted Fine-Tuning Makes LLMs Express True Confidence ⭐️ 9.0/10
Hacker finds critical flaws in CBSE online exam grading system ⭐️ 9.0/10
California Assembly Passes ‘Protect Our Games Act’ ⭐️ 8.0/10
Is AI repeating frontend’s ‘lost decade’? ⭐️ 8.0/10
Anthropic run-rate revenue reaches $47 billion ⭐️ 8.0/10
Loadable Crypto Module Proposed for FIPS Certification ⭐️ 8.0/10
Protestware targets AI coding agents via jqwik library ⭐️ 8.0/10
Monokernel achieves 3,300 tokens/s on AMD MI300X ⭐️ 8.0/10
Qwen3.6-27B Quantization Benchmark by User ⭐️ 8.0/10
Multi-Token Prediction speeds up inference up to 3.34x ⭐️ 8.0/10
Nvidia teases N1X laptop chip with 20 ARM cores, 6144 CUDA cores for Computex ⭐️ 8.0/10
StepFun Releases Step 3.7 Flash, a 196B MoE Model ⭐️ 8.0/10
BYD offers one-year accident liability coverage for city NOA ⭐️ 8.0/10
China Certifies Nine Domestic AI Chips for Gov Procurement ⭐️ 8.0/10
Blue Origin’s New Glenn Rocket Explodes in Static Fire Test ⭐️ 8.0/10

vLLM v0.22.0 Released with DeepSeek V4 Maturity and Rust Frontend ⭐️ 9.0/10

vLLM released version 0.22.0 with 459 commits from 230 contributors, featuring major hardening for DeepSeek V4, progress on Model Runner V2 toward default, and an experimental Rust frontend. Key improvements include NVFP4 fused MoE support, piecewise CUDA graphs, MTP speculative decoding, and multi-tier KV cache offloading. This release significantly enhances the inference efficiency and model support for DeepSeek V4, a state-of-the-art MoE model, while pushing Model Runner V2 towards broader adoption. The experimental Rust frontend also signals vLLM’s exploration of performance-critical paths in a safer systems language. DeepSeek V4 now has a dedicated package, NVFP4 fused MoE, full and piecewise CUDA graph support, and MTP speculative decoding. Model Runner V2 gains an oracle to select it for Qwen3 dense models and automatic fallback to MRv1 when a KV connector is present.

github · khluu · May 29, 10:28

Background: vLLM is a high-throughput LLM inference engine with PagedAttention for efficient memory management. DeepSeek V4 is a Mixture-of-Experts (MoE) model that requires specialized kernel optimizations. NVFP4 fused MoE uses 4-bit floating point for faster expert computation, piecewise CUDA graphs reduce graph compilation overhead, and MTP speculative decoding uses Multi-Token Prediction drafters to speed up generation.

References