Back to Jobs

**Experienced Full Stack Data Linguist – AI Data Preparation and NLP Expert**

Remote, USA Full-time Posted 2025-11-24
At arenaflex, we're on a mission to revolutionize the way we interact with technology, and we're looking for a talented Data Linguist to join our team. As a key member of our computer-based intelligence Information Group, you'll play a crucial role in preparing and refining the data that powers our AI systems, including Large Language Models (LLMs). Your expertise in natural language processing (NLP) and data analysis will help us create the highest-quality training data, driving improvements in human language understanding and natural language handling. **About arenaflex** arenaflex is a leading provider of cloud-based services, and our AI-powered language solutions empower businesses to unlock new insights and drive positive outcomes. Our commitment to innovation and customer satisfaction has earned us a reputation as a trusted partner in the industry. As a Data Linguist at arenaflex, you'll be part of a dynamic team that's passionate about pushing the boundaries of what's possible with AI. **Work/Life Balance** At arenaflex, we believe that work-life balance is essential to long-term happiness and fulfillment. That's why we offer flexible working hours and a supportive environment that encourages you to find your own balance between work and personal life. Our team is dedicated to helping you achieve your goals, both professionally and personally. **Mentorship and Career Development** We're committed to supporting the growth and development of our team members. Our senior colleagues enjoy one-on-one mentoring and comprehensive code reviews, and we're building a culture that celebrates knowledge sharing and mentorship. Whether you're just starting your career or looking to take on new challenges, we'll help you develop the skills and expertise you need to succeed. **Key Responsibilities** As a Data Linguist at arenaflex, your key responsibilities will include: * Developing a deep understanding of data collection and annotation rules, as well as various annotation tools * Commenting on natural language data precisely within deadlines, adhering to guidelines * Participating in data creation, collection, and quality assurance initiatives * Conducting in-depth analysis of data to identify subjective error patterns * Handling special data collection and analysis requests for various NLP/NLU applications * Collaborating with other ML Data Linguists to resolve data ambiguities and annotation conflicts * Providing input to Language Designers on annotation rules, tooling, and processes to drive improvements * Identifying and implementing solutions to complex problems independently * Contributing to process improvements to reduce handling time and increase asset yield * Developing a variety of language resources essential for model improvement, such as datasets for training and assessment **About the Team** The computer-based intelligence Information Group at arenaflex is responsible for delivering high-quality annotated data and a range of language resources to ensure the best performance of various AWS AI language services. These ML-based language services enable customers to quickly add intelligence to their business operations and AI applications, driving positive outcomes. **Essential Capabilities** To succeed as a Data Linguist at arenaflex, you'll need: * A bachelor's degree in Linguistics, Communication, Cognitive Science, or a related field, with a foundation in phonetics, semantics, pragmatics, discourse analysis, and/or speech analysis * At least six months of experience in NLP annotation and various types of data markup * Native or near-native proficiency in English (US) (CEFR C1 or above) * Excellent communication, strong organizational skills, and a keen eye for detail * Ability to work in a fast-paced, highly collaborative, and dynamic environment * Capacity to handle multiple tasks simultaneously and adapt to changing priorities **Preferred Capabilities** While not required, the following skills and experiences will be highly valued: * Ability to quickly learn new rules, technical concepts, and programming tools * Experience with command-line interfaces and basic Unix commands * Knowledge of common text processing tools * Working knowledge of various file formats and markup languages (e.g., JSON, XML, HTML) * Familiarity with clinical terminology or experience working in a clinical or healthcare setting * Enthusiasm for language, semantics, human language technology, and AI * Basic to intermediate programming skills in at least one of the common programming languages (Python, HTML, JavaScript) **arenaflex is an Equal Opportunity Employer** arenaflex is committed to creating a diverse and inclusive work environment. We're an equal opportunity employer and do not discriminate based on race, ethnicity, national origin, gender, sexual orientation, gender identity, protected veteran status, disability, age, or other legally protected status. If you require accommodations due to a disability, please let us know. **Apply Now** If you're passionate about NLP, data analysis, and AI, and you're looking for a challenging and rewarding career opportunity, we encourage you to apply. Join our team at arenaflex and help us shape the future of language technology. Apply Job! Apply for this job    

Similar Jobs