Lucía Ormaechea Grijalba

profile-pic

About Me.

Ph.D. Candidate, Computational Linguist & NLP Researcher

Welcome to my website 👋 My name is Lucía and I am a NLP Researcher and Ph.D. Candidate in Multilingual Information Processing at University of Geneva (FTI/TIM) and Grenoble Computer Science Laboratory (LIG/GETALP), as part of ANR/FNS PROPICTO project.

I hold a B.A. in Hispanic Philology from University of Navarre (Pamplona, Spain) and a M.Sc. in Natural Language Processing from Institut National des Langues et Civilisations Orientales (Paris, France). My main research focuses on Automatic Speech Recognition and Text Simplification systems.

Feel free to contact me for any further information 😀



Personal Information

  • NameLucía
  • Last NameOrmaechea Grijalba
  • FromPamplona, Spain
  • ResidenceGeneva, Switzerland

Research Interests

Speech Processing

Machine Translation

Text Simplification

Multimodal Systems

Resume.

Experience

  • Ph.D. Candidate (FNS Candoc Fellow)

    Swiss National Science Foundation (FNS) | Geneva, Switzerland

    As a Candoc grant-holder, I am pursuing a joint Ph.D. between the Department of Translation Technology (TIM) of University of Geneva and Grenoble Computer Science Laboratory (LIG).

    My work falls within the framework of PROPICTO, a research project that aims to create Speech-to-Pictograph translation systems.

    Present 12.2020
  • NLP Research Intern

    Grenoble Computer Science Lab. | Grenoble, France

    Participation within the BabelDr project: development of an Automatic Speech Recognition (ASR) system for medical-related applications.


    Main tasks:
    • Created resources for grammar-based language models.
    • Containerized ASR-related tools using Docker.
    • Trained HMM-DNN-based acoustic models.
    • Developed a Kaldi web-server API.
    • Conducted a prototype testing and evaluation.

    09.2020 02.2020
  • Translation Intern

    New York Habitat | Remote Working

    ENG > ESP translation of commercial texts:
    • Video transcriptions.
    • Travel articles.
    • Apartment reviews.
    • Client testimonials.

    09.2018 07.2018
  • Undergraduate Research Assistant

    University of Navarre | Pamplona, Spain

    Main tasks:
    • Development of educational materials.
    • Document classification.
    • Proofreading.
    • One-on-one tutorial assessment on academic writing.
    • Targeted classes to non-native Spanish speakers.

    06.2017 09.2016

Education

  • Ph.D. – Multilingual Information Processing

    University of Geneva | Geneva, Switzerland

    My main research aims to explore neural-based approaches for Automatic Text Simplification (ATS), focusing on a spoken modality as an input.


    Keywords: Automatic Speech Recognition, Text Simplification, Machine Translation.

    Present 12.2020
  • Master of Science – Natural Language Processing

    Inalco | Paris, France

    Graduated with High Honors.


    Coursework:
    • Programming in Python, Bash, Perl, C+, Java.
    • Text Mining.
    • Statistical Methods for corpus exploitation.
    • Neural Networks for Speech Recognition.
    • Corpus Linguistics.

    09.2020 09.2018
  • Bachelor of Arts – Hispanic Philology

    University of Navarre | Pamplona, Spain

    Extraordinary End-of-Degree Award Nominee.


    Coursework:
    • Phonetics and Phonology.
    • Lexicology and Semantics.
    • Sociolinguistics and Dialectal Variation.
    • Discourse Analysis.
    • Morphology and Syntax.

    06.2018 09.2014

Languages

Spanish

100%

English

95%

French

95%

Italian

22%

Skills

Programming

  • Python
  • Bash
  • Perl
  • C++
  • Java
  • SQL

Libraries

  • OpenFST
  • Keras
  • Pandas
  • NLTK
  • SpaCy

Web dev

  • HTML
  • CSS
  • Jekyll
  • Flask
  • XML
  • XSLT

Tools

  • Kaldi
  • Git
  • LaTeX
  • Docker
  • Praat
  • SRILM

Publications.



Conference Papers

Une chaîne de traitements pour la simplification automatique de la parole et sa traduction automatique vers des pictogrammes

Cécile Macaire, Lucía Ormaechea Grijalba and Adrien Pupier
In: 29e Conférence sur le Traitement Automatique des Langues Naturelles (TALN), Avignon (France).
June 2022



Presentations

A Tool for Easily Integrating Grammars as Language Models into the Kaldi Speech Recognition Toolkit

Lucía Ormaechea Grijalba, Benjamin Lecouteux, Pierrette Bouillon and Didier Schwab
In: Bridges and Gaps between Formal and Computational Linguistics (ESSLLI 2022 workshop), Galway (Ireland).
August 2022



Reconnaissance vocale du discours spontané pour le domaine médical

Lucía Ormaechea Grijalba, Pierrette Bouillon, Johanna Gerlach, Benjamin Lecouteux, Didier Schwab and Hervé Spechbach
In: Journée Commune AFIA/TLH: Technologies du Langage Humain et Santé (Remote Event).
February 2021



Posters

Integrating Grammar-Based Language Models into Domain-Specific Speech Recognition Systems

Lucía Ormaechea Grijalba
In: Second Advanced Language Processing School (ALPS), co-organized by Univ. Grenoble-Alpes and Naver Labs Europe (Remote Event).
January 2022



Master's Thesis

Mise en place d'un système robuste de reconnaissance automatique de la parole appliqué au domaine médical

Lucía Ormaechea Grijalba
September 2020


Terminal.