Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 3.2k 399

  2. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 663 126

  3. ARES ARES Public

    Automated Evaluation of RAG Systems

    Python 528 54

  4. noscope noscope Public

    Accelerating network inference over video

    Python 436 122

  5. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 432 55

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 261 74

Repositories

Showing 10 of 70 repositories
  • colbert-serve Public
    stanford-futuredata/colbert-serve’s past year of commit activity
    Python 2 0 0 0 Updated Jan 16, 2025
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Jupyter Notebook 195 Apache-2.0 24 3 0 Updated Nov 30, 2024
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 3,190 MIT 399 81 20 Updated Nov 18, 2024
  • ARES Public

    Automated Evaluation of RAG Systems

    stanford-futuredata/ARES’s past year of commit activity
    Python 528 Apache-2.0 54 12 1 Updated Nov 4, 2024
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 96 Apache-2.0 20 2 0 Updated Aug 26, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 126 MIT 32 8 2 Updated Jul 25, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 7 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 34 2,554 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 15 5 0 0 Updated Jan 17, 2024
  • omg Public
    stanford-futuredata/omg’s past year of commit activity
    Python 21 Apache-2.0 3 0 0 Updated Sep 20, 2023