Articles

9709 articles
2026-03-18T01:06:50.000Z
Proposes System A/B/M architecture for autonomous AI learning — passive observation, active exploration, and meta-control switching — inspired by biological cognition.
2026-03-17T21:28:29.000Z
OpenAI releases GPT-5.4 mini (2x faster than 5 mini, 400K context, /usr/bin/zsh.75/1M input) and nano for high-volume workloads — designed for subagents, coding, and computer use.
2026-03-17T20:56:17.000Z
Research proposes distributed systems theory as a principled framework for LLM agent teams — drawing parallels between multi-agent AI and classical distributed computing.
2026-03-17T20:55:53.000Z
Shift from reviewing to verifying AI-generated code using property-based tests, mutation testing, and constraint enforcement — treating AI output like compiled code.
2026-03-17T18:10:37.000Z
Speed at the Cost of Quality: How Cursor AI Impacts Open Source Development
2026-03-17T18:10:25.000Z
Leanstral: Open-Source Foundation for Trustworthy AI Code Agents
2026-03-17T17:05:55.000Z
Open-source MCP server for real-time flight and satellite tracking with AI assistant integration.
2026-03-17T16:56:11.000Z
Hello everyone, I've been working on mlx-tune , an open-source library for fine-tuning LLMs natively on Apple Silicon using MLX. I built this because I use Unsloth daily on cloud GPUs, but wanted to p
2026-03-17T16:56:08.000Z
Been running autoresearch for about a week. ~100 experiments per night on an H100. The keep rate is around 15%. The problem isn't the keep/discard loop. That works. The problem is that some of those k
2026-03-17T03:00:05.000Z · ★ 100
Microsoft CTO Kevin Scott on how large language models and generative AI are transforming the knowledge economy — from GitHub Copilot to protein folding and beyond.
2026-03-17T02:59:55.000Z · ★ 100
SpeciesNet, Google's open-source AI model, identifies ~2,500 species in camera trap photos — used by conservation groups worldwide from Serengeti to Colombia to process millions of images.
2026-03-17T02:59:50.000Z · ★ 100
Gemini in Google Sheets achieves 70.48% on SpreadsheetBench — state-of-the-art for AI spreadsheet manipulation, nearing human expert ability.
2026-03-17T02:59:45.000Z · ★ 100
Google invests $1M AUD in AI-powered Population Health AI to identify hidden heart disease risks in rural Australia, where remote communities face 60% higher mortality rates.
2026-03-17T02:59:36.000Z · ★ 100
Google DeepMind launches Nano Banana 2, combining Pro-quality image generation with Flash speed — featuring subject consistency, text rendering, and real-time world knowledge.
2026-03-17T02:59:31.000Z · ★ 100
Gemini 3.1 Flash-Lite delivers 2.5X faster first-token speed than 2.5 Flash at $0.25/1M input tokens, with state-of-the-art benchmark scores for its tier.
2026-03-17T02:59:25.000Z · ★ 100
arch 10, 2026 Research Demis Hassabis Ten years ago, our AI system AlphaGo became the first program to defeat a world champion at the complex game of Go - reaching a milestone in the field a decade be
2026-03-17T02:59:05.000Z · ★ 100
OpenAI reframes prompt injection defense as social engineering risk management — designing systems where manipulation impact is constrained architecturally, not just filtered at input.
2026-03-17T02:59:00.000Z · ★ 100
Rakuten achieves ~50% faster incident recovery using OpenAI Codex, integrating it across CI/CD, monitoring, and full-stack development workflows.
2026-03-17T02:58:55.000Z · ★ 100
For decades, static application security testing (...
2026-03-01T04:53:48.000Z · ★ 83
Practical lessons from treating Google AI Studio as a coding teammate — why setting boundaries with AI tools matters more than maximizing their output.