Skip to content
Change the repository type filter

All

    Repositories list

    • Outputs of the project "Generating textual resources to foster the development of language technologies for Mayan languages"
      Python
      0100Updated Feb 17, 2026Feb 17, 2026
    • LLM-KD

      Public
      Python
      0000Updated Feb 11, 2026Feb 11, 2026
    • url2lang

      Public
      url2lang infers the language of a document from its URL
      Python
      1000Updated Dec 22, 2025Dec 22, 2025
    • mayanv

      Public
      Hosts a number of bilingual Mayan-Spanish corpora
      JavaScript
      2700Updated Nov 19, 2025Nov 19, 2025
    • SMaTD

      Public
      Detection of machine translation
      Python
      0100Updated Nov 10, 2025Nov 10, 2025
    • Parallel URLs Classifier (PUC) infers the parallelness of a pair of documents from their URLs
      Python
      1000Updated Nov 4, 2025Nov 4, 2025
    • MaTiLDA

      Public
      Python
      0100Updated Oct 3, 2025Oct 3, 2025
    • Exploiting large pre-trained models for low-resource neural machine translation
      Shell
      0100Updated Oct 3, 2025Oct 3, 2025
    • PILAR

      Public
      0600Updated May 28, 2025May 28, 2025
    • Code to reproduce the experiments presented in the NAACL Findings 2025 paper "Beyond the Mode: Sequence-Level Distillation of Multilingual Translation Models fo…
      Shell
      0100Updated Apr 29, 2025Apr 29, 2025
    • Language identifier for Romance languages
      Python
      1200Updated Feb 5, 2025Feb 5, 2025
    • demint

      Public
      Repository for the project "DeMINT: Automated Language Debriefing for English Learners via AI Chatbot Analysis of Meeting Transcripts"
      Python
      0700Updated Oct 25, 2024Oct 25, 2024
    • JavaScript
      0000Updated Oct 8, 2024Oct 8, 2024
    • elrd

      Public
      Home of the English Learners Role-Playing Dialogue Dataset (ELRD).
      0000Updated Oct 7, 2024Oct 7, 2024
    • Markdown and static files for the Transducens research group's website.
      HTML
      0000Updated Jul 30, 2024Jul 30, 2024
    • Repository containing the test files for the WMT24 Shared Task: Translation into Low-Resource Languages of Spain
      0000Updated Jul 23, 2024Jul 23, 2024
    • Código fuente del libro "Diseño de compiladores"
      TeX
      0100Updated Apr 29, 2024Apr 29, 2024
    • nmt-maya

      Public
      Hosts code to train bilingual and multilingual NMT models of Mayan languages
      JavaScript
      0000Updated Mar 22, 2024Mar 22, 2024
    • Crawling engine that crawls a set of top-level domains looking for documents in a list of languages
      Python
      31123Updated Feb 6, 2024Feb 6, 2024
    • Python
      0000Updated Jan 11, 2024Jan 11, 2024
    • Code to reproduce the experiments presented in the EMNLP 2021 paper "Rethinking data augmentation for low-resource neural machine translation: a multi-task lear…
      Shell
      2410Updated Nov 28, 2023Nov 28, 2023
    • Code to reproduce the experiments reported in the paper "Cross-lingual neural fuzzy matching for exploiting target-language monolingual corpora in computer-aide…
      Java
      0100Updated Dec 9, 2022Dec 9, 2022
    • biwords

      Public
      Processing of word alignments for compressing parallel corpora
      C++
      0000Updated Oct 21, 2021Oct 21, 2021
    • Tool that allows to build a bilingual lexicon from a parallel corpus
      Shell
      0000Updated Aug 31, 2021Aug 31, 2021
    • The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
      C
      231000Updated Aug 11, 2021Aug 11, 2021
    • bayeseq

      Public
      Auto-encoding variational Bayesian inference for sequence generation models.
      Python
      10000Updated Jan 20, 2021Jan 20, 2021
    • Shell
      0200Updated Oct 12, 2020Oct 12, 2020
    • Java
      2010Updated May 4, 2020May 4, 2020
    • Developments of UA for the EU project GoURMET
      Python
      1100Updated Feb 3, 2020Feb 3, 2020
    • Script and instructions to produce a Bitextor-compatible parallel-data-extraction task from JSONL files as provided by BBC
      Python
      0100Updated Dec 20, 2019Dec 20, 2019