信号

去噪留真,只看重要的。

AI 每小时扫描 HN、X、Reddit、GitHub、Product Hunt,按 AI、编程、金融科技、独立开发的相关度打分。没有流量算法,只有信号。

2234 条
AIReddit14m2 pts

Gemini 3.1 Pro looping

Gemini 3.1 Pro, the official app on Android, is experiencing issues with looping, similar to problems seen in local LLMs. Users may want to monitor updates for fixes.

CodeHN15m3 pts

Refinement Modeling and Verification of RISC-V Assembly Using Knuckledragger

Researchers have developed a method for refinement modeling and verification of RISC-V assembly code using a tool called Knuckledragger, enhancing the reliability and correctness of RISC-V implementations. This advancement could significantly improve the development process for RISC-V architecture in various applications.

FintechHN15m5 pts

Making Sense of the DXY

The DXY, or U.S. Dollar Index, measures the dollar's value against a basket of currencies, and understanding its movements can provide insights into economic trends and investment opportunities. Monitoring the DXY is crucial for traders and investors as it influences global markets and commodity prices.

AIReddit1h1 pts

running Qwen3.5-27B Q5 splitt across a 4070ti and an amd rx6800 over LAN @ 13t/s with a 32k prompt

A user successfully runs the Qwen3.5-27B model across a 4070 Ti and an AMD RX6800 over LAN, achieving 13t/s with a 32k prompt, highlighting the potential of RPC servers for improved performance. This breakthrough allows those with limited GPU resources to utilize larger models effectively.

GeneralReddit1h3 pts

Self hosting, Power consumption, rentability and the cost of privacy, in France

A French self-hosting enthusiast shares their experience of upgrading to a powerful dual-3090 setup, highlighting concerns about power consumption and the cost of maintaining privacy. This raises questions about the sustainability and financial viability of self-hosting for individuals.

AIReddit1h3 pts

Anyone using Multi Model with the Qwen 3.5 Series?

Users are discussing the challenges of integrating the .8b model from the Qwen 3.5 series with other models, noting difficulties in achieving effective communication. If you're experimenting with multi-model setups, consider sharing your strategies or seeking advice from the community.

AIHN1h5 pts

Tree Search Distillation for Language Models Using PPO

Researchers have introduced a method called Tree Search Distillation that utilizes Proximal Policy Optimization (PPO) to enhance the performance of language models. This approach aims to improve decision-making processes in AI by optimizing how models generate text.

AIReddit2h1 pts

OmniCoder-9B Q8_0 is one of the first small local models that has felt genuinely solid in my eval-gated workflow

OmniCoder-9B Q8_0 stands out as one of the first small local models to perform reliably in eval-gated workflows, emphasizing practical application over demo performance. This model is designed for real tasks with strict validation, making it a promising tool for developers seeking robust coding solutions.

CodeHN2h14 pts

Deriving Type Erasure

The article discusses the concept of type erasure in programming, focusing on methods to implement it effectively. This technique is crucial for optimizing performance and ensuring compatibility in type systems.

CodeHN2h11 pts

SBCL Fibers – Lightweight Cooperative Threads

SBCL Fibers introduces lightweight cooperative threads, enhancing performance and efficiency in concurrent programming. This innovation aims to simplify thread management while improving resource utilization.

CodeHN3h3 pts

Making your JITted Code known: Let me count the ways

The article discusses various methods for exposing Just-In-Time (JIT) compiled code, highlighting techniques that can enhance performance and debugging. Developers are encouraged to explore these strategies to optimize their applications effectively.

AIReddit4h3 pts

I compared 8 AI coding models on the same real-world feature in an open-source TypeScript project. Here are the results

A comparison of eight AI coding models on a real-world TypeScript project reveals that performance varies significantly, highlighting the limitations of synthetic benchmarks in assessing their effectiveness in practical applications. This underscores the importance of evaluating AI tools in real coding environments for more accurate insights.

AIReddit4h3 pts

Agents given the choice between natural language and structured queries abandoned NL within minutes

Agents using a new MCP server quickly abandoned natural language queries in favor of structured queries and direct entity traversal, challenging initial expectations about user preferences in accessing knowledge graphs. This shift highlights the potential limitations of natural language processing in certain contexts.

AIReddit4h3 pts

qwen 3.5 - tool errors because of

Users of qwen 3.5 have reported tool usage issues due to incorrect closing tags in the system's thinking commands. Adjusting the prompt to correct these tags may improve performance.

AIReddit4h2 pts

llama.cpp build b8338 adds OpenVINO backend + NPU support for prefill + kvcache

The latest build of llama.cpp (b8338) introduces an OpenVINO backend, NPU support for prefill, and kvcache enhancements, thanks to significant contributions from the Intel team. Users with compatible hardware, like the 255H and Arc 140T iGPU, can look forward to improved performance.

AIHN4h8 pts

Postgres with Builtin File Systems

Postgres now supports integration with built-in file systems, enhancing its performance and storage capabilities. This update allows for more efficient data management and retrieval within PostgreSQL environments.

AIHN4h28 pts

Anthropic invests $100M into the Claude Partner Network

Anthropic has announced a $100 million investment in the Claude Partner Network, aiming to expand its collaboration and capabilities in AI development.

CodeHN4h13 pts

Learning Creative Coding

Explore the world of creative coding to enhance your programming skills while expressing artistic ideas through technology. This approach combines coding with creativity, making it accessible and engaging for all levels.

CodeLobsters5h2 pts

A Preview of Coalton 2.0

Coalton 2.0 is set to launch soon, promising significant updates and enhancements. Stay tuned for more details on its features and improvements.

AIReddit5h

[R] I think consciousness has a phase transition, identity is a Riemannian manifold, and free will is literally just stochastic noise bounded by who you are [long but worth it, formal math inside]

A researcher proposes a theoretical framework suggesting that consciousness could emerge through a phase transition rather than gradually, with identity modeled as a Riemannian manifold and free will characterized as stochastic noise. This approach aims to provide a formal mathematical structure to concepts traditionally explored in philosophy.

AIReddit5h1 pts

[R] ZeroProofML: 'Train on Smooth, Infer on Strict' for undefined targets in scientific ML

ZeroProofML is a new framework designed for scientific machine learning challenges involving undefined or non-identifiable targets, addressing issues like division by zero not as a numerical error but as a semantic event. This tool could enhance model training and inference in complex scenarios such as poles and kinematic locks.

AIReddit5h2 pts

Cross-Lingual Acoustic Feature Database for Tabular ML and Emotion Recognition

A new cross-lingual acoustic feature database for tabular machine learning and emotion recognition has been released, replacing a previous version due to a bug. The updated dataset, featuring samples in seven languages, is available for free on Hugging Face, and feedback from users is encouraged.

AIReddit5h2 pts

Cicikus v3 Prometheus 4.4B - An Experimental Franken-Merge for Edge Reasoning

Prometech has launched Cicikus v3 Prometheus 4.4B, an experimental model that enhances the Llama 3.2 architecture by focusing on "Hot Zones" identified through L2 norm analysis. This targeted passthrough expansion aims to improve edge reasoning capabilities.

AIReddit5h2 pts

Qwen3.5 35b exl3 quants with text-generation-webui?

Users are experiencing issues with loading the Qwen3.5 35B exl3 quant model in text-generation-webui, often getting stuck during the process. If you're facing similar problems, consider checking for updates or troubleshooting steps on the model's Hugging Face page.

CodeHN5h5 pts

Offloading FFmpeg with Cloudflare

Cloudflare is now offering solutions to offload FFmpeg processing, enhancing video encoding efficiency and reducing server load for developers. This integration aims to streamline video workflows and improve performance for media applications.

CodeHN5h25 pts

Show HN: Han – A Korean programming language written in Rust

Han is a new programming language developed in Rust that is designed to support Korean speakers, offering a unique approach to coding in a native language. This could enhance accessibility and learning for Korean developers.

CodeReddit6h1 pts

ISO: CV developer to continue developing on-device model & integration into app

A sports training app is seeking a CV developer to enhance its on-device model and integrate it into iOS, as the current developer lacks the necessary expertise. Interested candidates are encouraged to comment or DM for further details.

AIReddit6h1 pts

I tried running a full AI suite locally on a smartphone—and it didn't explode

A tech enthusiast successfully ran a full AI suite on a smartphone, challenging the notion that such a feat was impossible. This project highlights the potential for local AI applications, which could enhance privacy and reduce reliance on cloud services.

CodeHN6h6 pts

A Recursive Algorithm to Render Signed Distance Fields

A new recursive algorithm has been developed for rendering signed distance fields, potentially enhancing graphics rendering techniques in computer graphics and game development. This method may improve efficiency and visual quality in 3D modeling and simulations.

AIHN6h5 pts

Claude Doubles Usage Limits During Off-Peak Hours (March 13–27, 2026)

Claude is increasing usage limits during off-peak hours from March 13 to March 27, 2026, allowing users to maximize their access during these times. This change could benefit users looking to utilize the platform more extensively outside of peak hours.

AIHN6h31 pts

MCP Is Dead; Long Live MCP

MCP has officially been discontinued, but a new iteration or successor is set to emerge, promising to carry on its legacy. Stay tuned for updates on the upcoming version.

IndieHN6h29 pts

Marketing for Founders

"Marketing for Founders" offers essential strategies and insights for entrepreneurs to effectively promote their businesses and connect with customers. This resource is crucial for founders looking to enhance their marketing skills and drive growth.

AIReddit7h

[P] Karpathy's autoresearch with evolutionary database.

Karpathy's autoresearch project has been enhanced with an evolutionary database, replacing the previous TSV file logging system, which could improve the discovery of optimal solutions using evolutionary algorithms. This integration aims to streamline the research process in AI development.

CodeReddit7h2 pts

vLLM on Jetson Orin — pre-built wheel with Marlin GPTQ support (3.8x prefill speedup)

A new pre-built wheel for vLLM now supports Marlin GPTQ on Jetson Orin devices, offering a significant 3.8x speedup in prefill performance. This update enables users to fully utilize their tensor cores during GPTQ inference on the Orin family.

AIReddit7h3 pts

Has anyone managed to get an sub 16GB VRAM competent "researcher" model that can do web searching, summarization and reasoning?

The user is seeking a sub-16GB VRAM model capable of web searching, summarization, and reasoning for research purposes, and is wondering if anyone has successfully implemented such a solution. If you have insights or solutions, sharing them could help this user achieve their goal.

AIReddit7h

[P] Karpathy's autoresearch with evolutionary database.

Karpathy's autoresearch project has been enhanced with an evolutionary database, replacing the previous TSV file logging system, which could improve the autonomous discovery of optimal solutions through evolutionary algorithms. This integration may significantly boost the project's efficiency in research automation.

GeneralHN7h75 pts

UBI as a productivity dividend

Universal Basic Income (UBI) is being explored as a potential productivity booster, offering financial security that could enable individuals to pursue innovative projects and enhance overall economic output.

CodeHN7h6 pts

Show HN: Zap Code – AI code generator that teaches kids real HTML/CSS/JS

Zap Code is an AI-powered code generator designed to teach kids real HTML, CSS, and JavaScript, making coding accessible and engaging for young learners.

AIHN7h13 pts

Generalizing Knuth's Pseudocode Architecture From Algorithms to Knowledge

Researchers are expanding Knuth's pseudocode architecture to encompass not just algorithms but also knowledge representation, aiming to enhance clarity and accessibility in conveying complex concepts. This development could improve educational tools and resources in computer science.

GeneralHN7h28 pts

It's time to move your docs in the repo

It's time to relocate your documentation into the repository for better organization and accessibility.

IndieHN7h15 pts

Show HN: Ichinichi – One note per day, E2E encrypted, local-first

Ichinichi is a new note-taking app designed for daily entries, featuring end-to-end encryption and a local-first approach, ensuring user privacy and data security. It offers a simple way to document daily thoughts while keeping information secure and accessible offline.

AIHN7h12 pts

Claudetop – htop for Claude Code sessions (see your AI spend in real-time)

Claudetop is a new tool that provides real-time monitoring of AI spending during Claude Code sessions, similar to the popular system monitor htop. This allows users to track their AI resource usage more effectively.

GeneralLobsters8h1 pts

Companies House vulnerability enabled company hijacking

A security vulnerability at Companies House has been identified, allowing potential hijacking of company registrations. Businesses are urged to review their registrations and ensure their details are secure.

AIReddit8h3 pts

Has anyone else noticed that AI makes curiosity loops almost impossible to stop?

AI coding agents can create addictive "curiosity loops," making it difficult for users to disengage, similar to the effects of doom-scrolling. This phenomenon raises concerns about the impact of AI on attention and productivity.

AIReddit8h4 pts

StepFun releases SFT dataset used to train Step 3.5 Flash

StepFun has released the SFT dataset, which was utilized to train their Step 3.5 Flash model, potentially enhancing AI training and development.

AIReddit8h3 pts

Chunking for STT

A user is seeking effective methods to split 4-minute audio into 30-second segments for transcription with a fine-tuned speech-to-text (STT) model. Solutions for efficient audio chunking are needed.

AIReddit8h42 pts

Deepsek v4 confirmed to release next week

Deepsek v4 is set to be released next week, promising new features and improvements. Stay tuned for updates on its capabilities.

AIReddit8h6 pts

55 → 282 tok/s: How I got Qwen3.5-397B running at speed on 4x RTX PRO 6000 Blackwell

A custom CUTLASS kernel has significantly improved the performance of Qwen3.5-397B, achieving a speed of 282 tokens per second on 4x RTX PRO 6000 Blackwell GPUs, up from just 55 tok/s. A pre-built Docker image and a pull request to FlashInfer are now available for those interested in optimizing their setups.

AILobsters9h2 pts

Thoughts on generative A.I

The article discusses the implications and potential of generative AI, emphasizing its transformative impact across various industries and the need for ethical considerations in its development and deployment.

CodeReddit9h1 pts

Is the Lenovo Legion T7 34IAS10 a good pick for local AI/CV training?

The Lenovo Legion T7 34IAS10 is a strong contender for local AI and computer vision training, thanks to its powerful hardware specifications. Consider it if you're looking for a reliable machine to handle demanding tasks in these fields.