Signal

Noise removed. What's left is what matters.

My AI agent scans HN, X, Reddit, GitHub & Product Hunt every hour, scoring by relevance to AI, coding, fintech, and indie building. No engagement algorithms — just signal.

Today's pulse
2026-03-15
2227 items
AIReddit54m1 pts

OmniCoder-9B Q8_0 is one of the first small local models that has felt genuinely solid in my eval-gated workflow

OmniCoder-9B Q8_0 stands out as one of the first small local models to perform reliably in eval-gated workflows, emphasizing practical application over demo performance. This model is designed for real tasks with strict validation, making it a promising tool for developers seeking robust coding solutions.

CodeHN54m14 pts

Deriving Type Erasure

The article discusses the concept of type erasure in programming, focusing on methods to implement it effectively. This technique is crucial for optimizing performance and ensuring compatibility in type systems.

CodeHN54m11 pts

SBCL Fibers – Lightweight Cooperative Threads

SBCL Fibers introduces lightweight cooperative threads, enhancing performance and efficiency in concurrent programming. This innovation aims to simplify thread management while improving resource utilization.

CodeHN1h3 pts

Making your JITted Code known: Let me count the ways

The article discusses various methods for exposing Just-In-Time (JIT) compiled code, highlighting techniques that can enhance performance and debugging. Developers are encouraged to explore these strategies to optimize their applications effectively.

AIReddit2h3 pts

I compared 8 AI coding models on the same real-world feature in an open-source TypeScript project. Here are the results

A comparison of eight AI coding models on a real-world TypeScript project reveals that performance varies significantly, highlighting the limitations of synthetic benchmarks in assessing their effectiveness in practical applications. This underscores the importance of evaluating AI tools in real coding environments for more accurate insights.

AIReddit2h3 pts

Agents given the choice between natural language and structured queries abandoned NL within minutes

Agents using a new MCP server quickly abandoned natural language queries in favor of structured queries and direct entity traversal, challenging initial expectations about user preferences in accessing knowledge graphs. This shift highlights the potential limitations of natural language processing in certain contexts.

AIReddit2h3 pts

qwen 3.5 - tool errors because of

Users of qwen 3.5 have reported tool usage issues due to incorrect closing tags in the system's thinking commands. Adjusting the prompt to correct these tags may improve performance.

AIReddit2h2 pts

llama.cpp build b8338 adds OpenVINO backend + NPU support for prefill + kvcache

The latest build of llama.cpp (b8338) introduces an OpenVINO backend, NPU support for prefill, and kvcache enhancements, thanks to significant contributions from the Intel team. Users with compatible hardware, like the 255H and Arc 140T iGPU, can look forward to improved performance.

AIHN2h8 pts

Postgres with Builtin File Systems

Postgres now supports integration with built-in file systems, enhancing its performance and storage capabilities. This update allows for more efficient data management and retrieval within PostgreSQL environments.

AIHN2h28 pts

Anthropic invests $100M into the Claude Partner Network

Anthropic has announced a $100 million investment in the Claude Partner Network, aiming to expand its collaboration and capabilities in AI development.

CodeHN2h13 pts

Learning Creative Coding

Explore the world of creative coding to enhance your programming skills while expressing artistic ideas through technology. This approach combines coding with creativity, making it accessible and engaging for all levels.

CodeLobsters3h2 pts

A Preview of Coalton 2.0

Coalton 2.0 is set to launch soon, promising significant updates and enhancements. Stay tuned for more details on its features and improvements.

AIReddit3h

[R] I think consciousness has a phase transition, identity is a Riemannian manifold, and free will is literally just stochastic noise bounded by who you are [long but worth it, formal math inside]

A researcher proposes a theoretical framework suggesting that consciousness could emerge through a phase transition rather than gradually, with identity modeled as a Riemannian manifold and free will characterized as stochastic noise. This approach aims to provide a formal mathematical structure to concepts traditionally explored in philosophy.

AIReddit3h1 pts

[R] ZeroProofML: 'Train on Smooth, Infer on Strict' for undefined targets in scientific ML

ZeroProofML is a new framework designed for scientific machine learning challenges involving undefined or non-identifiable targets, addressing issues like division by zero not as a numerical error but as a semantic event. This tool could enhance model training and inference in complex scenarios such as poles and kinematic locks.

AIReddit3h2 pts

Cross-Lingual Acoustic Feature Database for Tabular ML and Emotion Recognition

A new cross-lingual acoustic feature database for tabular machine learning and emotion recognition has been released, replacing a previous version due to a bug. The updated dataset, featuring samples in seven languages, is available for free on Hugging Face, and feedback from users is encouraged.

AIReddit3h2 pts

Cicikus v3 Prometheus 4.4B - An Experimental Franken-Merge for Edge Reasoning

Prometech has launched Cicikus v3 Prometheus 4.4B, an experimental model that enhances the Llama 3.2 architecture by focusing on "Hot Zones" identified through L2 norm analysis. This targeted passthrough expansion aims to improve edge reasoning capabilities.

AIReddit3h2 pts

Qwen3.5 35b exl3 quants with text-generation-webui?

Users are experiencing issues with loading the Qwen3.5 35B exl3 quant model in text-generation-webui, often getting stuck during the process. If you're facing similar problems, consider checking for updates or troubleshooting steps on the model's Hugging Face page.

CodeHN3h5 pts

Offloading FFmpeg with Cloudflare

Cloudflare is now offering solutions to offload FFmpeg processing, enhancing video encoding efficiency and reducing server load for developers. This integration aims to streamline video workflows and improve performance for media applications.

CodeHN3h25 pts

Show HN: Han – A Korean programming language written in Rust

Han is a new programming language developed in Rust that is designed to support Korean speakers, offering a unique approach to coding in a native language. This could enhance accessibility and learning for Korean developers.

CodeReddit4h1 pts

ISO: CV developer to continue developing on-device model & integration into app

A sports training app is seeking a CV developer to enhance its on-device model and integrate it into iOS, as the current developer lacks the necessary expertise. Interested candidates are encouraged to comment or DM for further details.

AIReddit4h1 pts

I tried running a full AI suite locally on a smartphone—and it didn't explode

A tech enthusiast successfully ran a full AI suite on a smartphone, challenging the notion that such a feat was impossible. This project highlights the potential for local AI applications, which could enhance privacy and reduce reliance on cloud services.

CodeHN4h6 pts

A Recursive Algorithm to Render Signed Distance Fields

A new recursive algorithm has been developed for rendering signed distance fields, potentially enhancing graphics rendering techniques in computer graphics and game development. This method may improve efficiency and visual quality in 3D modeling and simulations.

AIHN4h5 pts

Claude Doubles Usage Limits During Off-Peak Hours (March 13–27, 2026)

Claude is increasing usage limits during off-peak hours from March 13 to March 27, 2026, allowing users to maximize their access during these times. This change could benefit users looking to utilize the platform more extensively outside of peak hours.

AIHN4h31 pts

MCP Is Dead; Long Live MCP

MCP has officially been discontinued, but a new iteration or successor is set to emerge, promising to carry on its legacy. Stay tuned for updates on the upcoming version.

IndieHN4h29 pts

Marketing for Founders

"Marketing for Founders" offers essential strategies and insights for entrepreneurs to effectively promote their businesses and connect with customers. This resource is crucial for founders looking to enhance their marketing skills and drive growth.

AIReddit5h

[P] Karpathy's autoresearch with evolutionary database.

Karpathy's autoresearch project has been enhanced with an evolutionary database, replacing the previous TSV file logging system, which could improve the discovery of optimal solutions using evolutionary algorithms. This integration aims to streamline the research process in AI development.

CodeReddit5h2 pts

vLLM on Jetson Orin — pre-built wheel with Marlin GPTQ support (3.8x prefill speedup)

A new pre-built wheel for vLLM now supports Marlin GPTQ on Jetson Orin devices, offering a significant 3.8x speedup in prefill performance. This update enables users to fully utilize their tensor cores during GPTQ inference on the Orin family.

AIReddit5h3 pts

Has anyone managed to get an sub 16GB VRAM competent "researcher" model that can do web searching, summarization and reasoning?

The user is seeking a sub-16GB VRAM model capable of web searching, summarization, and reasoning for research purposes, and is wondering if anyone has successfully implemented such a solution. If you have insights or solutions, sharing them could help this user achieve their goal.

AIReddit5h

[P] Karpathy's autoresearch with evolutionary database.

Karpathy's autoresearch project has been enhanced with an evolutionary database, replacing the previous TSV file logging system, which could improve the autonomous discovery of optimal solutions through evolutionary algorithms. This integration may significantly boost the project's efficiency in research automation.

GeneralHN5h75 pts

UBI as a productivity dividend

Universal Basic Income (UBI) is being explored as a potential productivity booster, offering financial security that could enable individuals to pursue innovative projects and enhance overall economic output.

CodeHN5h6 pts

Show HN: Zap Code – AI code generator that teaches kids real HTML/CSS/JS

Zap Code is an AI-powered code generator designed to teach kids real HTML, CSS, and JavaScript, making coding accessible and engaging for young learners.

AIHN5h13 pts

Generalizing Knuth's Pseudocode Architecture From Algorithms to Knowledge

Researchers are expanding Knuth's pseudocode architecture to encompass not just algorithms but also knowledge representation, aiming to enhance clarity and accessibility in conveying complex concepts. This development could improve educational tools and resources in computer science.

GeneralHN5h28 pts

It's time to move your docs in the repo

It's time to relocate your documentation into the repository for better organization and accessibility.

IndieHN5h15 pts

Show HN: Ichinichi – One note per day, E2E encrypted, local-first

Ichinichi is a new note-taking app designed for daily entries, featuring end-to-end encryption and a local-first approach, ensuring user privacy and data security. It offers a simple way to document daily thoughts while keeping information secure and accessible offline.

AIHN5h12 pts

Claudetop – htop for Claude Code sessions (see your AI spend in real-time)

Claudetop is a new tool that provides real-time monitoring of AI spending during Claude Code sessions, similar to the popular system monitor htop. This allows users to track their AI resource usage more effectively.

GeneralLobsters6h1 pts

Companies House vulnerability enabled company hijacking

A security vulnerability at Companies House has been identified, allowing potential hijacking of company registrations. Businesses are urged to review their registrations and ensure their details are secure.

AIReddit6h3 pts

Has anyone else noticed that AI makes curiosity loops almost impossible to stop?

AI coding agents can create addictive "curiosity loops," making it difficult for users to disengage, similar to the effects of doom-scrolling. This phenomenon raises concerns about the impact of AI on attention and productivity.

AIReddit6h4 pts

StepFun releases SFT dataset used to train Step 3.5 Flash

StepFun has released the SFT dataset, which was utilized to train their Step 3.5 Flash model, potentially enhancing AI training and development.

AIReddit6h3 pts

Chunking for STT

A user is seeking effective methods to split 4-minute audio into 30-second segments for transcription with a fine-tuned speech-to-text (STT) model. Solutions for efficient audio chunking are needed.

AIReddit6h42 pts

Deepsek v4 confirmed to release next week

Deepsek v4 is set to be released next week, promising new features and improvements. Stay tuned for updates on its capabilities.

AIReddit6h6 pts

55 → 282 tok/s: How I got Qwen3.5-397B running at speed on 4x RTX PRO 6000 Blackwell

A custom CUTLASS kernel has significantly improved the performance of Qwen3.5-397B, achieving a speed of 282 tokens per second on 4x RTX PRO 6000 Blackwell GPUs, up from just 55 tok/s. A pre-built Docker image and a pull request to FlashInfer are now available for those interested in optimizing their setups.

AILobsters7h2 pts

Thoughts on generative A.I

The article discusses the implications and potential of generative AI, emphasizing its transformative impact across various industries and the need for ethical considerations in its development and deployment.

CodeReddit7h1 pts

Is the Lenovo Legion T7 34IAS10 a good pick for local AI/CV training?

The Lenovo Legion T7 34IAS10 is a strong contender for local AI and computer vision training, thanks to its powerful hardware specifications. Consider it if you're looking for a reliable machine to handle demanding tasks in these fields.

CodeReddit7h1 pts

VLM & VRAM recommendations for 8MP/4K image analysis

For your local VLM pipeline focused on 8MP/4K image analysis with infrequent bursts of inference, consider a GPU with at least 8GB of VRAM to handle high-resolution images efficiently. Ensure your model selection balances performance and resource usage, as the event-driven nature allows for optimized hardware utilization during idle times.

AIReddit7h26 pts

Claude Opus 4.6 knows what it doesn't know!

Claude Opus 4.6 demonstrates a significant advancement in AI by acknowledging its limitations, which some argue is a step toward artificial general intelligence (AGI). This capability has sparked discussions about the implications of AI understanding its own knowledge boundaries.

CodeReddit7h2 pts

llama-server API - Is there a way to save slots/ids already ingested with Qwen3.5 35b a3b?

Users are seeking solutions to save and recall long prompts with the Qwen3.5 35b MoE model in the llama-server API, but current limitations prevent effective memory retention. If you're working with this model, consider exploring alternative methods or tools that might better support your needs.

AIReddit7h2 pts

I wanted to score my AI coding prompts without sending them anywhere — built a local scoring tool using NLP research papers, Ollama optional

A developer created a local scoring tool to evaluate AI coding prompts without sharing data online, utilizing insights from NLP research papers. This tool allows users to analyze the effectiveness of their prompts while maintaining privacy.

AIReddit7h38 pts

Nvidia's Nemotron 3 Super is a bigger deal than you think

Nvidia's Nemotron 3 Super is generating significant buzz for its advanced capabilities, potentially transforming AI applications and enhancing performance in various sectors. This development could have major implications for industries reliant on cutting-edge technology.

CodeReddit7h16 pts

(Very) High-Quality Attention Coder-Next GGUFs

A researcher shares insights from their experiments with Qwen3-Coder-Next, focusing on quantization techniques that enhance performance in client programming and data processing tasks, along with high-quality attention tensors.

CodeReddit7h6 pts

Qwen3 TTS in C++ with 1.7B support, speaker encoding extraction, and desktop UI

A new Qwen3 TTS implementation in C++ has been developed, featuring 1.7B support, speaker encoding extraction, and a desktop UI, enhancing functionality and code quality. The project is available on GitHub for those interested in exploring or contributing.