Skip to content
Change the repository type filter

All

    Repositories list

    • Sequence-level curriculum learning for addressing exposure bias in LLMs.
      0000Updated Apr 17, 2026Apr 17, 2026
    • Repository associated with the paper "On the Comprehensibility of Multi-structured Financial Documents using LLMs and Pre-processing Tools"
      Python
      0100Updated Mar 31, 2026Mar 31, 2026
    • Evaluate if LLM is fit to be used for enterprise use cases
      Python
      0210Updated Mar 23, 2026Mar 23, 2026
    • Benchmark comparing context management strategies for instruction-following in multi-journey LLM applications
      Apache License 2.0
      0000Updated Mar 19, 2026Mar 19, 2026
    • Codebase for Manulife's AI research portal
      CSS
      0200Updated Mar 18, 2026Mar 18, 2026
    • Python pipeline to generate 3000 BFSI evaluation prompts for LLM behavioral testing
      Python
      0160Updated Mar 3, 2026Mar 3, 2026
    • benchkit

      Public
      A benchmarking toolkit for evaluating generative AI systems. Companion tool for "Benchmarking Workflow for Generative AI Systems".
      Vue
      MIT License
      0300Updated Feb 27, 2026Feb 27, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.