ML Data Linguist (FTC), Amazon Comprehend

25 Nov 2024

Vacancy expired!

DescriptionJob summaryAmazon Web Services (AWS) is looking for a data associate to help with annotations and data analysis.As part of the AiData Team at AWS you will responsible for delivering high-quality training data to ensure the best performance of the AWS machine learning systems. Our goal is to produce the highest quality training data in the industry and to delight our customers by improving human language understanding and natural language processing. This is a Fixed Term Contractor role. Initial Term is 12 months with a possibility of extension once on-boarded.Responsibilities:· Helps define requirements (e.g., tools, training, data collection protocols, etc.) for multiple projects at a given time.· Building a thorough understanding of annotation conventions and train junior Data Associates on applying these in annotation and transcription tasks.· Annotate text data, identifying linguistic categories based on detailed annotation guidelines.· Collect and organize text data from online sources.· Collaborate in defining data quality metrics and verifies metric (task, system performance, etc.) accuracy and makes sure delivery is quality compliant.· Perform error trend analysis and create action plans to improve data quality· Provide feedback to Language Engineers and Scientists on tool improvements and annotation processes.· Diving deep into issues and implement solutions independently· Contribute to process improvements to reduce handling time and improve resource output.About UsInclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future.BASIC QUALIFICATIONS· Bachelor's degree in a relevant field, such as Linguistics, Communications, a foreign language, or other language or data-related disciplines.· Native or near-native English speaker.· Experience with data annotation, linguistic annotation and other forms of data markup.· Experience identifying linguistic ambiguity and annotation inaccuracies in data.PREFERRED QUALIFICATIONS· Master's degree in a relevant field, such as Linguistics, Communications, a foreign language, or other language or data-related disciplines.· Proficient in Spanish, French, German, Hindi, Japanese, Mandarin, Korean, Portuguese, Dutch or another foreign language.· Depth and breadth of knowledge in linguistic theory and/or applied linguistics.· Familiarity with common text processing tools.· Familiarity with json, yaml, xml or other forms of text markup.· Ability to work in different operating systems (Windows, MacOS, or Linux).· Ability to navigate a Unix terminal and use common command line tools.· Ability to strictly adhere to annotation guidelines, think abstractly about language, and identify basic parts of speech.· Excellent communication and organizational skills.· Ability to work collaboratively with other data associates on a team.· Ability to deliver high quality results under tight deadlines.· Comfortable working in a fast paced, collaborative work environment.· Passion for language, linguistics, human language technology and AI.Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Full-time
  • ID: #23485981
  • State: California Santaclara 95050 Santaclara USA
  • City: Santaclara
  • Salary: USD TBD TBD
  • Showed: 2021-11-25
  • Deadline: 2022-01-24
  • Category: Et cetera