Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 7.2k 394

  2. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 3.4k 289

  3. efficientvit efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    Python 3.2k 231

  4. bevfusion bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 3k 543

  5. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2.2k 422

  6. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 345

Repositories

Showing 10 of 64 repositories
  • fouroversix Public

    Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”

    mit-han-lab/fouroversix’s past year of commit activity
    Python 117 MIT 6 2 0 Updated Feb 2, 2026
  • vlash Public

    Real-Time VLAs via Future-state-aware Asynchronous Inference.

    mit-han-lab/vlash’s past year of commit activity
    Python 302 Apache-2.0 15 14 1 Updated Jan 30, 2026
  • Block-Sparse-Attention Public

    A sparse attention kernel supporting mix sparse patterns

    mit-han-lab/Block-Sparse-Attention’s past year of commit activity
    C++ 447 BSD-3-Clause 44 10 0 Updated Jan 18, 2026
  • fastrl Public

    [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

    mit-han-lab/fastrl’s past year of commit activity
    Python 130 Apache-2.0 11 5 0 Updated Dec 6, 2025
  • flash-moba Public
    mit-han-lab/flash-moba’s past year of commit activity
    C++ 221 BSD-3-Clause 7 2 0 Updated Nov 20, 2025
  • radial-attention Public

    [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

    mit-han-lab/radial-attention’s past year of commit activity
    Python 579 Apache-2.0 32 17 1 Updated Nov 11, 2025
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,601 MIT 244 69 (4 issues need help) 25 Updated Oct 27, 2025
  • streaming-vlm Public

    StreamingVLM: Real-Time Understanding for Infinite Video Streams

    mit-han-lab/streaming-vlm’s past year of commit activity
    Python 863 MIT 57 22 0 Updated Oct 15, 2025
  • efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    mit-han-lab/efficientvit’s past year of commit activity
    Python 3,226 Apache-2.0 231 108 0 Updated Sep 5, 2025
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 3,428 MIT 289 169 10 Updated Jul 17, 2025