Archive

Publication Timeline

18 Articles

2026

February 22

Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting

#AI #NLP #Novel Research

February 14

Agent-based Automated Claim Matching with Instruction-following LLMs

#AI #NLP #Novel Research

February 12

The 10,000x Explosion: Reproducing DeepSeek’s mHC at Scale

#AI #NLP #Technical

February 07

mHC: Manifold-Constrained Hyper-Connections

#AI #NLP #Novel Research

February 03

ConfTuner: Training Large Language Models to Express Their Confidence Verbally

#AI #NLP #Novel Research

January 29

The Zero Temperature Myth: Why "Greedy" Doesn't Always Mean "Same"

#AI #NLP #Technical

January 25

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

#AI #NLP #Novel Research

January 17

Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training

#AI #Novel Research

January 11

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

#AI #NLP #Novel Research

January 04

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

#AI #NLP #Novel Research

2025

December 27

1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities

#AI #Novel Research

December 20

Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

#AI #NLP #Novel Research

December 13

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

#AI #NLP #Novel Research

December 06

LightMem: Lightweight and Efficient Memory-Augmented Generation

#AI #NLP #Novel Research

December 01

The Cockpit of AI: A Beginner’s Guide to LLM Parameters

#AI #Technical

November 30

Inside vLLM: How This Amazing Engine Makes Large Models Lightning Fast

#AI #Technical

November 29

BriLLM: Brain-inspired Large Language Model

#AI #NLP #Novel Research

November 22

Revisiting Long-Context Modeling From Context Denoising Perspective

#AI #NLP #Novel Research