Statistics Specialist

4 semanas atrás


Fortaleza, Brasil Innodata Inc. Tempo inteiro

The Statistics Specialist role at Innodata Inc. focuses on developing and fine‑tuning large‑language models (LLMs) by applying advanced statistical methods, data analysis, and model evaluation techniques. Responsibilities Design and implement statistical frameworks for preparation, analysis, and quality assessment of large‑scale datasets used in training LLMs. Perform exploratory data analysis (EDA) to detect biases, data inconsistencies, and anomalies in training corpora. Collaborate closely with AI researchers and engineers to develop data annotation guidelines and evaluation protocols. Apply advanced statistical methods to assess model outputs and guide iterative training improvements. Develop and maintain automated tools for data preprocessing, labeling, sampling, and quality control in LLM pipelines. Conduct rigorous hypothesis testing, A/B testing, and performance benchmarking of LLMs across various tasks. Interpret model behaviors using statistical insights and provide actionable recommendations for model fine‑tuning. Document methodologies, maintain reproducible workflows, and contribute to technical publications. Qualifications Master’s or PhD in Statistics, Applied Mathematics, Data Science, Computer Science (strong statistics focus), or related quantitative field. Strong knowledge of statistical modeling, probability theory, and experimental design. Hands‑on experience with large‑scale datasets, data cleaning, feature engineering, and statistical analysis tools. Proficiency in programming languages such as Python, R, or Julia, including libraries like pandas, NumPy, SciPy, scikit‑learn, or statsmodels. Familiarity with machine learning frameworks (TensorFlow, PyTorch) and understanding of LLM architectures (transformers, attention mechanisms). Experience with annotation tools and data labeling processes is a plus. Strong analytical thinking, attention to detail, and problem‑solving skills. Excellent written and verbal communication skills in English. As part of the project, you are required to complete the English language assessment. Seniority level Entry level Employment type Full‑time Job function Research, Analyst, Information Technology #J-18808-Ljbffr


  • Statistics Specialist

    2 semanas atrás


    Fortaleza, Brasil Innodata Inc. Tempo inteiro

    Innodata (NASDAQ: INOD) is a leading data engineering company.With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider of choice for 4 out of 5 of the world's biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.By...

  • Statistics Specialist

    2 semanas atrás


    Fortaleza, Ceará, Brasil Innodata Inc. Tempo inteiro R$104.000 - R$156.000 por ano

    Innodata (NASDAQ: INOD) is a leading data engineering company.With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider of choice for 4 out of 5 of the world's biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.By...


  • Fortaleza, Brasil beBeeDataEngineering Tempo inteiro

    About Data Engineering Role We believe in and live a culture based on collaboration. Work Environment Description A healthy work environment is a place where all voices are heard, respected, and valued. Data Engineering Job Summary Data Engineer Responsibilities Develop, implement, and maintain scalable data pipelines and workflows using Apache Airflow....

  • Data Analytics Lead

    Há 4 dias


    Fortaleza, Brasil Bairesdev Tempo inteiro

    Join to apply for the Data Analytics Lead - Remote Work | REF#****** role at BairesDevJoin to apply for the Data Analytics Lead - Remote Work | REF#****** role at BairesDevAt BairesDev, we've been leading the way in technology projects for over 15 years.We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon...