Statistical Data Analyst

1 dia atrás


Ituiutaba, Brasil Innodata Inc. Tempo inteiro

INOD) is a leading data engineering company.
With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider of choice for 4 out of 5 of the world's biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.
By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we're helping usher in the promise of AI.
Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms.
Our global workforce includes over 5,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany.
At Innodata, we're working with the world's largest technology companies on the next generation of generative AI and large language models (LLMs).
We are seeking a highly skilled Data Scientist with a Master's or PhD in Statistics, Applied Mathematics, or a related quantitative field to join our cutting-edge AI team.
In this role, you will contribute to the development, training, and fine-tuning of Large Language Models (LLMs) by applying statistical methods, data analysis, and model evaluation techniques.
Your expertise will directly impact the accuracy, performance, and robustness of advanced AI systems deployed for real-world applications.
Design and implement statistical frameworks for the preparation, analysis, and quality assessment of large-scale datasets used in training LLMs.
Perform exploratory data analysis (EDA) to detect biases, data inconsistencies, and anomalies in training corpora.
Collaborate closely with AI researchers and engineers to develop data annotation guidelines and evaluation protocols.
Apply advanced statistical methods to assess model outputs and guide iterative training improvements.
Develop and maintain automated tools for data preprocessing, labeling, sampling, and quality control in LLM pipelines.
Conduct rigorous hypothesis testing, A/B testing, and performance benchmarking of LLMs across various tasks.
Master's or PhD in Statistics, Applied Mathematics, Data Science, Computer Science (with strong statistics focus), or related quantitative field.
Hands-on experience with large-scale datasets, data cleaning, feature engineering, and statistical analysis tools.
Proficiency in programming languages such as Python, R, or Julia, including libraries like pandas, NumPy, SciPy, scikit-learn, or statsmodels.
Familiarity with machine learning frameworks (TensorFlow, PyTorch) and understanding of LLM architectures (transformers, attention mechanisms).
Experience with annotation tools and data labeling processes is a plus.
Excellent written and verbal communication skills in English.
As part of the project, you are required to complete the English language assessment.
*



  • Ituiutaba, Brasil Bebeestatistical Tempo inteiro

    We are seeking a highly skilled professional to contribute to the development and review of high-quality content related to statistical analysis.Job Title: Statistical ExpertDevelop, review, and validate statistical models and data-driven solutions.Ensure accuracy, clarity, and adherence to academic and industry standards.Collaborate with cross-functional...

  • Data Scientist

    Há 3 dias


    Ituiutaba, Brasil Applaudo Tempo inteiro

    About You You are someone who wants to influence your own development.You're looking for a company where you have the opportunity to pursue your interests and be able to grow professionally.You are a data -driven professional passionate about turning complex data into actionable insights.You thrive on using advanced analytics, statistical modeling, and...


  • Ituiutaba, Brasil Bebeedatascientist Tempo inteiro

    Job Title:">Quantitative Modeling SpecialistAbout the Role: We are seeking a highly skilled Quantitative Modeling Specialist to join our cutting-edge AI team.Key Responsibilities:Contribute to the development, training, and fine-tuning of Large Language Models (LLMs) by applying statistical methods, data analysis, and model evaluation techniques.Design and...


  • Ituiutaba, Brasil Bebeevisualization Tempo inteiro

    Job Title: Data Visualization SpecialistWe are seeking a highly skilled and experienced Data Visualization Specialist to join our team.As a key member of our organization, you will be responsible for designing and implementing data visualization solutions that meet the needs of our stakeholders.About the Role:The successful candidate will have a strong...


  • Ituiutaba, Brasil Improvado Tempo inteiro

    Improvadois an AI-powered, unified platform designed for marketing teams in medium to large-scale enterprises and agencies, who are looking to automate complex marketing intelligence and reporting to make decisions with ease. Improvado gathers, organizes, and untangles marketing data to deliver instant insights through BI and AI, helping to eliminate...


  • Ituiutaba, Brasil Bebeefraud Tempo inteiro

    Fraud Business Analyst Intern Role OverviewThe organization seeks a detail-oriented individual to enhance fraud defenses by supporting process audits, rule tuning, and data-driven investigations.Key responsibilities include:Auditing and validating existing fraud rulesQuality assurance reviews of robotic process automationsAnalyzing triggers and evaluating...

  • Data Pipeline Expert

    1 dia atrás


    Ituiutaba, Brasil Bebeedata Tempo inteiro

    Expert Data Pipeline ArchitectWe are seeking a seasoned expert to craft and maintain scalable data pipelines on AWS, ensuring optimal performance, quality, and security.Key Responsibilities:Develop and optimize Extract-Transform-Load (ETL) pipelines with AWS Glue.Collaborate with data scientists and analysts to integrate data from multiple sources and...


  • Ituiutaba, Brasil Bebeedatascience Tempo inteiro

    Job OverviewWe are seeking a skilled professional to join our team as a Data Insights Specialist.Cultivate and refine large datasets from multiple sources to guarantee accuracy and reliability.Develop and implement sophisticated statistical models and machine learning algorithms to generate valuable insights and forecasts.Design and maintain interactive data...


  • Ituiutaba, Brasil Bebeedataengineering Tempo inteiro

    Data Engineering SpecialistOur organization is seeking a skilled and experienced Data Engineer to join our Threat Research team.The primary responsibility of this role will be to design, develop, and maintain data pipelines for threat intelligence ingestion, validation, and export automation flows.Responsibilities:Design, develop, and maintain data pipelines...


  • Ituiutaba, Brasil Bebeedatadriven Tempo inteiro

    Transform your career into a rewarding journey with data-driven success.We are seeking talented Data Engineers to join our team, where you will have the opportunity to expand your skills and contribute to our growth.We believe that collaboration and learning are key to achieving excellence, which is why we foster a culture of unlimited potential and mutual...