Siri - International Computational Linguist (Turkish)

Barcelona, Barcelona, Spain
Software and Services


Posted: 7 Aug 2019
Weekly Hours: 40
Role Number: 200055884
As a linguist, you will design, analyze and improve our linguistic resources and processes to support projects related to natural language processing, such as dictionaries for speech recognition and synthesis, machine translation, text normalization, part of speech tagging, tokenization, lemmatization, inflection, transliteration and context-free grammars for data validation and information extraction. The linguist team is responsible for crafting tasks and methods for collecting and evaluating natural language data. The team builds guidelines for phonetic, phonological, discourse, semantic, syntactic and morphological projects. Do you have a broad knowledge of, and experience in, natural language understanding, phonology, phonetics, syntax, semantics, ontology, corpus linguistics, data acquisition, or any combination thereof? And do you possess multilingual expertise? Join our linguist team!

Key Qualifications

  • Interest and experience in various areas of linguistics, including pragmatics, semantics, syntax, morphology, phonology, phonetics, discourse analysis, sociolinguistics, psycholinguistics and computational linguistics.
  • Experience with theoretical, applied and computational linguistics, and with a variety of linguistic and/or ontological representations (e.g. grammar, syntax, semantics, discourse, pragmatics, inference, etc).
  • Experience with basic programming techniques and familiarity with a mainstream programming language, such as Python, Java, C/C++ (for scripting, corpus analysis, etc.).
  • Native speaker fluency (or comparable) in Turkish.
  • Excellent oral and written communication skills in English.


Your responsibilities will include: - Design, analyze and improve linguistic resources (modeling assets) and processes to support internationalization projects related to natural language processing. - Design tasks and methods for collecting and evaluating annotated, natural language data. - Build guidelines and set standards for phonetic, phonological, discourse, semantic, syntactic and morphological projects. - Provide linguistic and operational mentorship and support to language teams. - Evaluate, analyze and monitor data quality and improve linguistic resources, e.g. use case templates and expansions, non-terminals and dictionaries for speech recognition and speech synthesis. - Contribute to natural language processing tasks assigned to language teams. - Identify errors and regressions and propose solutions to improve accuracy. - Build and develop tooling for linguistic analysis. - Identify needs for linguistic data, corpora and tools.

Education & Experience

- Masters degree in Linguistics, Computational Linguistics, Language Technologies, or related field.

Additional Requirements

  • Desired (but not required) experience/skills:
  • - Experience in crafting and implementing data-based research experiments.
  • - Experience working with large quantities of natural language data, lexical resources, corpora, NLP algorithms and tools.
  • - Excellent problem solving, critical thinking, and communication skills.
  • - Knowledge of one or more foreign languages.
  • - A motivation/cover letter is required in order to be considered for this position.