Statistics Specialist

2 semanas atrás


Salvador, Brasil Innodata Inc. Tempo inteiro

About the Role At Innodata, we’re working with the world's largest technology companies on the next generation of generative AI and large language models (LLMs). Key Responsibilities Design and implement statistical frameworks for the preparation, analysis, and quality assessment of large-scale datasets used in training LLMs. Perform exploratory data analysis (EDA) to detect biases, data inconsistencies, and anomalies in training corpora. Collaborate closely with AI researchers and engineers to develop data annotation guidelines and evaluation protocols. Apply advanced statistical methods to assess model outputs and guide iterative training improvements. Develop and maintain automated tools for data preprocessing, labeling, sampling, and quality control in LLM pipelines. Conduct rigorous hypothesis testing, A/B testing, and performance benchmarking of LLMs across various tasks. Interpret model behaviors using statistical insights and provide actionable recommendations for model fine-tuning. Document methodologies, maintain reproducible workflows, and contribute to technical publications. Key Qualifications Master’s or PhD in Statistics, Applied Mathematics, Data Science, Computer Science (with strong statistics focus), or related quantitative field. Strong knowledge of statistical modeling, probability theory, and experimental design. Hands‑on experience with large‑scale datasets, data cleaning, feature engineering, and statistical analysis tools. Proficiency in programming languages such as Python, R, or Julia, including libraries like pandas, NumPy, SciPy, scikit‑learn, or statsmodels. Familiarity with machine learning frameworks (TensorFlow, PyTorch) and understanding of LLM architectures (transformers, attention mechanisms). Experience with annotation tools and data labeling processes is a plus. Strong analytical thinking, attention to detail, and problem‑solving skills. Excellent written and verbal communication skills in English. As part of the project, you are required to complete the English language assessment. Assessment The assessment is mandatory & non‑billable. If interested, kindly share your updated resume at: ****** #J-18808-Ljbffr



  • Salvador, Brasil Innodata Inc. Tempo inteiro

    About the RoleAt Innodata, we're working with the world's largest technology companies on the next generation of generative AI and large language models (LLMs).Key ResponsibilitiesDesign and implement statistical frameworks for the preparation, analysis, and quality assessment of large-scale datasets used in training LLMs.Perform exploratory data analysis...


  • Salvador, Brasil Bebeeanalysis Tempo inteiro

    The ideal candidate will leverage big data and analytics expertise to drive business intelligence across various areas.They will conduct both recurring and ad-hoc analysis for stakeholders, understanding day-to-day challenges that can be better addressed with data.Key ResponsibilitiesCompile and analyze data related to business issues.Develop clear...


  • Salvador, Brasil Bebeedatadriven Tempo inteiro

    Product Analyst PositionThe ideal candidate for this position will be responsible for analyzing data and identifying opportunities for revenue growth.This includes examining customer behavior, market trends, and product performance to inform strategic decisions.This role involves working closely with cross-functional teams to ensure alignment on goals and...


  • Salvador, Brasil Trilogy Tempo inteiro

    OverviewJoin to apply for the Math Subject Matter Expert, 2 Hour Learning (Remote) - $100,000/year USD role at Trilogy.This range is provided by Trilogy.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Base pay range$*****/hr - $*****/hrCan you see the patterns in numbers that others miss?Imagine...