Search: [ai]

AI Doesn’t Reduce Work—It Intensifies It

I can get so much done, but after just an hour or two my mental energy for the day feels almost entirely depleted

ai

April 16, 2026 at 9:47:13 AM EDT * · permalink

·

https://simonwillison.net/2026/Feb/9/ai-intensifies-work/

Opinion | The A.I. Disruption Has Arrived, and It Sure Is Fun - The New York Times

The simple truth is that I am less valuable than I used to be. It stings to be made obsolete, but it’s fun to code on the train, too. And if this technology keeps improving, then all of the people who tell me how hard it is to make a report, place an order, upgrade an app or update a record — they could get the software they deserve, too. That might be a good trade, long term.

ai · job

April 16, 2026 at 9:46:09 AM EDT * · permalink

·

https://archive.is/N8Wqd#selection-925.0-925.379

Young People Are Falling Behind, but Not Because of AI - The Atlantic

If AI isn’t to blame for the terrible job market for young people, then what is? In nearly every sector of the economy, the pace of hiring has slowed to levels last seen shortly after the Great Recession. A job market with few hiring opportunities is especially punishing for young people entering the workforce, including those with a college degree.

job · ai

April 15, 2026 at 12:48:52 AM EDT * · permalink

·

https://archive.is/IOdfz#selection-989.0-993.331

Significant raise of reports [LWN.net]

On the kernel security list we've seen a huge bump of reports. We were between 2 and 3 per week maybe two years ago, then reached probably 10 a week over the last year with the only difference being only AI slop, and now since the beginning of the year we're around 5-10 per day depending on the days (fridays and tuesdays seem the worst). Now most of these reports are correct, to the point that we had to bring in more maintainers to help us.

Overall I think we're going to see a much higher quality of software, ironically around the same level than before 2000 when the net became usable by everyone to download fixes. When the software had to be pressed to CDs or written to millions of floppies, it had to survive an amazing quantity of tests that are mostly neglected nowadays since updates are easy to distribute. But before this happens, we have to experience a huge mess that might last for a few years to come! Interesting times...

ai · programming

April 2, 2026 at 5:24:02 PM EDT * · permalink

·

https://lwn.net/Articles/1065620/

First NetHack ascension, and insights into the AI capabilities it requires

each episode corresponds to a random combination of object generations, monster placements and different level variants, which in turn requires using different combinations of strategies at each episode

ai · eval

March 25, 2026 at 8:50:34 PM EDT * · permalink

·

https://mikaelhenaff.substack.com/p/first-nethack-ascension-and-insights

Just Talk To It - the no-bs Way of Agentic Engineering | Peter Steinberger

Typical refactor work is using jscpd for code duplication, knip for dead code, running eslint’s react-compiler and deprecation plugins, checking if we introduced api routes that can be consolidated, maintaining my docs, breaking apart files that grew too large, adding tests and code comments for tricky parts, updating dependencies, tool upgrades, file restructuring, finding and rewriting slow tests, mentioning modern react patterns and rewriting code

ai

March 25, 2026 at 2:48:44 PM EDT * · permalink

·

https://steipete.me/posts/just-talk-to-it

Attention-Residuals/Attention_Residuals.pdf at master · MoonshotAI/Attention-Residuals

ai · llm · paper · memory

March 16, 2026 at 11:05:29 AM EDT * · permalink

·

https://github.com/MoonshotAI/Attention-Residuals/blob/master/Attention_Residuals.pdf

Amazon Admits Extensive AI Use Is Wreaking Havoc on Its Core Business

Meanwhile, management leans on programmers to heavily use AI tools, with employees previously telling the FT that the company set a target for 80 percent of developers to use AI for coding tasks at least once a week.

In sum: more coding with more AI with more human oversight, but fewer humans. We’ll see how that works out.

ai · ai_slop

March 13, 2026 at 6:08:07 PM EDT * · permalink

·

https://futurism.com/artificial-intelligence/amazon-ai-tools-business

Yann LeCun's AMI Labs raises $1.03 billion to build world models | TechCrunch

Although AMI Labs has no plans to generate revenue for the time being, it still plans to engage with prospective customers early on

ai

March 10, 2026 at 10:52:44 AM EDT * · permalink

·

https://techcrunch.com/2026/03/09/yann-lecuns-ami-labs-raises-1-03-billion-to-build-world-models/

Agentbase | Serverless Agent Platform for Developers

ai · agent

March 10, 2026 at 10:37:05 AM EDT * · permalink

·

https://agentbase.sh/

[2602.10715] Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents

Experiments across diverse backbone models, retrieval-based methods, and memory systems demonstrate that cognitive memory remains challenging and reveals failures not captured by existing benchmarks.

ai · llm · memory · agent · paper

March 8, 2026 at 4:16:04 PM EDT * · permalink

·

https://arxiv.org/abs/2602.10715

[2603.04304v1] $V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Having generation and verification co-evolve on the same online rollouts is the fix, and the ablation (Figure 11) shows it matters — co-evolving consistently beats non-co-evolving by 4–6%.

ai · llm · rl · paper

March 7, 2026 at 9:14:49 PM EST * · permalink

·

https://arxiv.org/abs/2603.04304v1

Vibe Coding Is the Future of Programming. How to Get Started.

Instead, he says, business leaders should prioritize creating a culture in which their employees feel empowered to experiment with vibe coding and share their best creations. “Seeing is believing,” says Schluntz, “and I think getting non-developers in every company to use these tools to bring their ideas to life is one of the most powerful things.”

According to Anthropic researcher Eric Schluntz, vibe coding makes it so that “people are limited only by their creativity, not by the skills that they have.” Think about Apple in the 1970s; Steve Jobs was the big ideas guy, and Steve Wozniak was the technical genius who translated Jobs’ ideas into a working product. Vibe coding essentially gives everyone their own personal Woz. “If you have an image of something in your mind, you can go create it,” adds Schluntz.

ai · vibecoding · agent

March 6, 2026 at 10:52:18 AM EST * · permalink

·

https://archive.is/VVt1S#selection-1915.0-1915.352

Jido 2.0 is now available · Agent Jido

TypeScript agent frameworks felt like toys. Single-threaded event loops trying to juggle concurrent agents with promises and prayer. Python agents did a little better, but after a long time they couldn’t stay up. The BEAM was built for exactly this kind of work.

ai · elixir · agent

March 6, 2026 at 9:24:30 AM EST * · permalink

·

https://jido.run/blog/jido-2-0-is-here

KARL: A Faster Agent for Enterprise Knowledge, powered by custom RL

While SFT distillation meaningfully improves overall performance over the base model, the gap between the two approaches is most apparent when combined with test-time compute. On in-distribution tasks, SFT benefits substantially from parallel sampling (69.1 → 75.3), yet on out-of-distribution tasks the gains are negligible (59.4 → 59.6). This suggests that distillation teaches the model to imitate task-specific expert behavior, which scales well within the training distribution but fails to generalize beyond it. In contrast, KARL benefits from test-time compute both in- and out-of-distribution, indicating that RL develops more general search capabilities rather than task-specific heuristic

ai · rl · agent

March 5, 2026 at 10:10:20 PM EST * · permalink

·

https://www.databricks.com/sites/default/files/2026-03/karl.pdf

symphony/elixir/README.md at main · openai/symphony

Why Elixir?

Elixir is built on Erlang/BEAM/OTP, which is great for supervising long-running processes. It has an active ecosystem of tools and libraries. It also supports hot code reloading without stopping actively running subagents, which is very useful during development.

elixir · ai · agent

March 5, 2026 at 10:07:33 PM EST * · permalink

·

https://github.com/openai/symphony/blob/main/elixir/README.md

awni/mylm: Self-personalizing LM

The above command enters you into a chat loop. You can talk to the model and share information like your name. Every now and then /sleep the model to transition short-term memory to long-term memory

The /sleep command:

Generates Q&A pairs based on the context
LoRA fine-tunes the model on the new Q&A pairs plus any from previous sessions
Resets the KV cache

After the /sleep command the model should remember context from previous sessions even though that context is no longer in the KV cache.

ai · llm · experiment

March 5, 2026 at 7:20:19 PM EST * · permalink

·

https://github.com/awni/mylm

SWE-rebench Leaderboard

SWE-rebench: A Continuously Evolving and Decontaminated Benchmark for Software Engineering LLMs

ai

March 3, 2026 at 8:28:54 AM EST * · permalink

·

https://swe-rebench.com/

Qwen3.5 - How to Run Locally Guide | Unsloth Documentation

Qwen3.5 Small models disable thinking by default. Use llama-server to enable it.

ai · llm · inference

March 2, 2026 at 11:57:34 AM EST * · permalink

·

https://unsloth.ai/docs/models/qwen3.5#how-to-enable-or-disable-reasoning-and-thinking

Bcachefs creator claims his custom LLM is 'fully conscious' • The Register

It's not chatbot psychosis, it's 'math and engineering and neuroscience'

ai · psychosis

March 2, 2026 at 9:41:35 AM EST * · permalink

·

https://www.theregister.com/2026/02/25/bcachefs_creator_ai/