信号
去噪留真,只看重要的。
AI 每小时扫描 HN、X、Reddit、GitHub、Product Hunt,按 AI、编程、金融科技、独立开发的相关度打分。没有流量算法,只有信号。
Local AI models like OmniCoder-9B gain traction, while advancements in language model distillation and multi-model usage spark developer interest.
Gemini 3.1 Pro looping
Gemini 3.1 Pro, the official app on Android, is experiencing issues with looping, similar to problems seen in local LLMs. Users may want to monitor updates for fixes.
Refinement Modeling and Verification of RISC-V Assembly Using Knuckledragger
Researchers have developed a method for refinement modeling and verification of RISC-V assembly code using a tool called Knuckledragger, enhancing the reliability and correctness of RISC-V implementations. This advancement could significantly improve the development process for RISC-V architecture in various applications.
Making Sense of the DXY
The DXY, or U.S. Dollar Index, measures the dollar's value against a basket of currencies, and understanding its movements can provide insights into economic trends and investment opportunities. Monitoring the DXY is crucial for traders and investors as it influences global markets and commodity prices.
running Qwen3.5-27B Q5 splitt across a 4070ti and an amd rx6800 over LAN @ 13t/s with a 32k prompt
A user successfully runs the Qwen3.5-27B model across a 4070 Ti and an AMD RX6800 over LAN, achieving 13t/s with a 32k prompt, highlighting the potential of RPC servers for improved performance. This breakthrough allows those with limited GPU resources to utilize larger models effectively.
Self hosting, Power consumption, rentability and the cost of privacy, in France
A French self-hosting enthusiast shares their experience of upgrading to a powerful dual-3090 setup, highlighting concerns about power consumption and the cost of maintaining privacy. This raises questions about the sustainability and financial viability of self-hosting for individuals.
Anyone using Multi Model with the Qwen 3.5 Series?
Users are discussing the challenges of integrating the .8b model from the Qwen 3.5 series with other models, noting difficulties in achieving effective communication. If you're experimenting with multi-model setups, consider sharing your strategies or seeking advice from the community.
Tree Search Distillation for Language Models Using PPO
Researchers have introduced a method called Tree Search Distillation that utilizes Proximal Policy Optimization (PPO) to enhance the performance of language models. This approach aims to improve decision-making processes in AI by optimizing how models generate text.
OmniCoder-9B Q8_0 is one of the first small local models that has felt genuinely solid in my eval-gated workflow
OmniCoder-9B Q8_0 stands out as one of the first small local models to perform reliably in eval-gated workflows, emphasizing practical application over demo performance. This model is designed for real tasks with strict validation, making it a promising tool for developers seeking robust coding solutions.
Deriving Type Erasure
The article discusses the concept of type erasure in programming, focusing on methods to implement it effectively. This technique is crucial for optimizing performance and ensuring compatibility in type systems.
SBCL Fibers – Lightweight Cooperative Threads
SBCL Fibers introduces lightweight cooperative threads, enhancing performance and efficiency in concurrent programming. This innovation aims to simplify thread management while improving resource utilization.
Making your JITted Code known: Let me count the ways
The article discusses various methods for exposing Just-In-Time (JIT) compiled code, highlighting techniques that can enhance performance and debugging. Developers are encouraged to explore these strategies to optimize their applications effectively.
I compared 8 AI coding models on the same real-world feature in an open-source TypeScript project. Here are the results
A comparison of eight AI coding models on a real-world TypeScript project reveals that performance varies significantly, highlighting the limitations of synthetic benchmarks in assessing their effectiveness in practical applications. This underscores the importance of evaluating AI tools in real coding environments for more accurate insights.
Agents given the choice between natural language and structured queries abandoned NL within minutes
Agents using a new MCP server quickly abandoned natural language queries in favor of structured queries and direct entity traversal, challenging initial expectations about user preferences in accessing knowledge graphs. This shift highlights the potential limitations of natural language processing in certain contexts.
qwen 3.5 - tool errors because of
Users of qwen 3.5 have reported tool usage issues due to incorrect closing tags in the system's thinking commands. Adjusting the prompt to correct these tags may improve performance.
llama.cpp build b8338 adds OpenVINO backend + NPU support for prefill + kvcache
The latest build of llama.cpp (b8338) introduces an OpenVINO backend, NPU support for prefill, and kvcache enhancements, thanks to significant contributions from the Intel team. Users with compatible hardware, like the 255H and Arc 140T iGPU, can look forward to improved performance.
Postgres with Builtin File Systems
Postgres now supports integration with built-in file systems, enhancing its performance and storage capabilities. This update allows for more efficient data management and retrieval within PostgreSQL environments.
Anthropic invests $100M into the Claude Partner Network
Anthropic has announced a $100 million investment in the Claude Partner Network, aiming to expand its collaboration and capabilities in AI development.
Learning Creative Coding
Explore the world of creative coding to enhance your programming skills while expressing artistic ideas through technology. This approach combines coding with creativity, making it accessible and engaging for all levels.
A Preview of Coalton 2.0
Coalton 2.0 is set to launch soon, promising significant updates and enhancements. Stay tuned for more details on its features and improvements.
[R] I think consciousness has a phase transition, identity is a Riemannian manifold, and free will is literally just stochastic noise bounded by who you are [long but worth it, formal math inside]
A researcher proposes a theoretical framework suggesting that consciousness could emerge through a phase transition rather than gradually, with identity modeled as a Riemannian manifold and free will characterized as stochastic noise. This approach aims to provide a formal mathematical structure to concepts traditionally explored in philosophy.
[R] ZeroProofML: 'Train on Smooth, Infer on Strict' for undefined targets in scientific ML
ZeroProofML is a new framework designed for scientific machine learning challenges involving undefined or non-identifiable targets, addressing issues like division by zero not as a numerical error but as a semantic event. This tool could enhance model training and inference in complex scenarios such as poles and kinematic locks.
Cross-Lingual Acoustic Feature Database for Tabular ML and Emotion Recognition
A new cross-lingual acoustic feature database for tabular machine learning and emotion recognition has been released, replacing a previous version due to a bug. The updated dataset, featuring samples in seven languages, is available for free on Hugging Face, and feedback from users is encouraged.
Cicikus v3 Prometheus 4.4B - An Experimental Franken-Merge for Edge Reasoning
Prometech has launched Cicikus v3 Prometheus 4.4B, an experimental model that enhances the Llama 3.2 architecture by focusing on "Hot Zones" identified through L2 norm analysis. This targeted passthrough expansion aims to improve edge reasoning capabilities.
Qwen3.5 35b exl3 quants with text-generation-webui?
Users are experiencing issues with loading the Qwen3.5 35B exl3 quant model in text-generation-webui, often getting stuck during the process. If you're facing similar problems, consider checking for updates or troubleshooting steps on the model's Hugging Face page.
Offloading FFmpeg with Cloudflare
Cloudflare is now offering solutions to offload FFmpeg processing, enhancing video encoding efficiency and reducing server load for developers. This integration aims to streamline video workflows and improve performance for media applications.
Show HN: Han – A Korean programming language written in Rust
Han is a new programming language developed in Rust that is designed to support Korean speakers, offering a unique approach to coding in a native language. This could enhance accessibility and learning for Korean developers.
ISO: CV developer to continue developing on-device model & integration into app
A sports training app is seeking a CV developer to enhance its on-device model and integrate it into iOS, as the current developer lacks the necessary expertise. Interested candidates are encouraged to comment or DM for further details.
I tried running a full AI suite locally on a smartphone—and it didn't explode
A tech enthusiast successfully ran a full AI suite on a smartphone, challenging the notion that such a feat was impossible. This project highlights the potential for local AI applications, which could enhance privacy and reduce reliance on cloud services.
A Recursive Algorithm to Render Signed Distance Fields
A new recursive algorithm has been developed for rendering signed distance fields, potentially enhancing graphics rendering techniques in computer graphics and game development. This method may improve efficiency and visual quality in 3D modeling and simulations.
Claude Doubles Usage Limits During Off-Peak Hours (March 13–27, 2026)
Claude is increasing usage limits during off-peak hours from March 13 to March 27, 2026, allowing users to maximize their access during these times. This change could benefit users looking to utilize the platform more extensively outside of peak hours.
MCP Is Dead; Long Live MCP
MCP has officially been discontinued, but a new iteration or successor is set to emerge, promising to carry on its legacy. Stay tuned for updates on the upcoming version.
Marketing for Founders
"Marketing for Founders" offers essential strategies and insights for entrepreneurs to effectively promote their businesses and connect with customers. This resource is crucial for founders looking to enhance their marketing skills and drive growth.
[P] Karpathy's autoresearch with evolutionary database.
Karpathy's autoresearch project has been enhanced with an evolutionary database, replacing the previous TSV file logging system, which could improve the discovery of optimal solutions using evolutionary algorithms. This integration aims to streamline the research process in AI development.
vLLM on Jetson Orin — pre-built wheel with Marlin GPTQ support (3.8x prefill speedup)
A new pre-built wheel for vLLM now supports Marlin GPTQ on Jetson Orin devices, offering a significant 3.8x speedup in prefill performance. This update enables users to fully utilize their tensor cores during GPTQ inference on the Orin family.
Has anyone managed to get an sub 16GB VRAM competent "researcher" model that can do web searching, summarization and reasoning?
The user is seeking a sub-16GB VRAM model capable of web searching, summarization, and reasoning for research purposes, and is wondering if anyone has successfully implemented such a solution. If you have insights or solutions, sharing them could help this user achieve their goal.
[P] Karpathy's autoresearch with evolutionary database.
Karpathy's autoresearch project has been enhanced with an evolutionary database, replacing the previous TSV file logging system, which could improve the autonomous discovery of optimal solutions through evolutionary algorithms. This integration may significantly boost the project's efficiency in research automation.
UBI as a productivity dividend
Universal Basic Income (UBI) is being explored as a potential productivity booster, offering financial security that could enable individuals to pursue innovative projects and enhance overall economic output.
Show HN: Zap Code – AI code generator that teaches kids real HTML/CSS/JS
Zap Code is an AI-powered code generator designed to teach kids real HTML, CSS, and JavaScript, making coding accessible and engaging for young learners.
Generalizing Knuth's Pseudocode Architecture From Algorithms to Knowledge
Researchers are expanding Knuth's pseudocode architecture to encompass not just algorithms but also knowledge representation, aiming to enhance clarity and accessibility in conveying complex concepts. This development could improve educational tools and resources in computer science.
It's time to move your docs in the repo
It's time to relocate your documentation into the repository for better organization and accessibility.
Show HN: Ichinichi – One note per day, E2E encrypted, local-first
Ichinichi is a new note-taking app designed for daily entries, featuring end-to-end encryption and a local-first approach, ensuring user privacy and data security. It offers a simple way to document daily thoughts while keeping information secure and accessible offline.
Claudetop – htop for Claude Code sessions (see your AI spend in real-time)
Claudetop is a new tool that provides real-time monitoring of AI spending during Claude Code sessions, similar to the popular system monitor htop. This allows users to track their AI resource usage more effectively.
Companies House vulnerability enabled company hijacking
A security vulnerability at Companies House has been identified, allowing potential hijacking of company registrations. Businesses are urged to review their registrations and ensure their details are secure.
Has anyone else noticed that AI makes curiosity loops almost impossible to stop?
AI coding agents can create addictive "curiosity loops," making it difficult for users to disengage, similar to the effects of doom-scrolling. This phenomenon raises concerns about the impact of AI on attention and productivity.
StepFun releases SFT dataset used to train Step 3.5 Flash
StepFun has released the SFT dataset, which was utilized to train their Step 3.5 Flash model, potentially enhancing AI training and development.
Chunking for STT
A user is seeking effective methods to split 4-minute audio into 30-second segments for transcription with a fine-tuned speech-to-text (STT) model. Solutions for efficient audio chunking are needed.
Deepsek v4 confirmed to release next week
Deepsek v4 is set to be released next week, promising new features and improvements. Stay tuned for updates on its capabilities.
55 → 282 tok/s: How I got Qwen3.5-397B running at speed on 4x RTX PRO 6000 Blackwell
A custom CUTLASS kernel has significantly improved the performance of Qwen3.5-397B, achieving a speed of 282 tokens per second on 4x RTX PRO 6000 Blackwell GPUs, up from just 55 tok/s. A pre-built Docker image and a pull request to FlashInfer are now available for those interested in optimizing their setups.
Thoughts on generative A.I
The article discusses the implications and potential of generative AI, emphasizing its transformative impact across various industries and the need for ethical considerations in its development and deployment.
Is the Lenovo Legion T7 34IAS10 a good pick for local AI/CV training?
The Lenovo Legion T7 34IAS10 is a strong contender for local AI and computer vision training, thanks to its powerful hardware specifications. Consider it if you're looking for a reliable machine to handle demanding tasks in these fields.
