Vacancy caducado!
Title: Data Linguistic Analyst Location: Palo Alto, CA - remote Duration: 9 months
Responsibilities:- Manage and process large amounts of linguistic data.
- Design quality control metrics and methodology.
- Collaborate with Language Engineers and Applied Scientists in defining metrics, guidelines, and workflows.
- Use existing language processing tools to bootstrap and process data.
- Develop scripts and tools to crawl and process data.
- Oversee the quality of linguistic annotation.
- Train and assign work to vendor language consultants.
- Write data collection guidelines.
- Big Data - Training data for NLP tasks
- Working with Data Manipulation / Data Collection
- Working with 3rd parties for language guidelines
- Providing guidance in collection process
- Previous experience in linguistic data
- Python Scripting, Data processing, Python, Perl, C
- Strong communication and debugging skills
- Masters degree in Linguistics, Computational Linguistics, or a related field.
- Experience with one (or more) functional programming languages such as Python, Perl, C.
- Experience with language annotation and other forms of data markup.
- Experience working with data manipulation and processing using pandas, nltk, scikit or similar toolkits.
- Experience working with TTS, ASR or NLU systems.
- Strong communication skills.
- PhD in Linguistics, Computational Linguistics, or a relevant field.
- +2 years of experience working with Natural Language Processing, Semantics, Translation or Computational Linguistics in general.
- Practical knowledge of version control systems such as GitHub.
- Experience working with more than one language family
Vacancy caducado!