
Unlocking Insights: Data Scientist for Large Language Models
2 semanas atrás
This role involves contributing to the development, training, and fine-tuning of large language models (LLMs) by applying statistical methods, data analysis, and model evaluation techniques.
Key Responsibilities:- Design and implement statistical frameworks for large-scale datasets used in LLMs.
- Perform exploratory data analysis (EDA) to detect biases, data inconsistencies, and anomalies in training corpora.
- Collaborate with AI researchers and engineers to develop data annotation guidelines and evaluation protocols.
- Apply advanced statistical methods to assess model outputs and guide iterative training improvements.
- Develop automated tools for data preprocessing, labeling, sampling, and quality control in LLM pipelines.
- Conduct rigorous hypothesis testing, A/B testing, and performance benchmarking of LLMs across tasks.
- Interpret model behaviors using statistical insights and provide recommendations for model fine-tuning.
- Document methodologies and maintain reproducible workflows.
- Master's or PhD in Statistics, Applied Mathematics, Data Science, Computer Science, or related quantitative field.
- Strong knowledge of statistical modeling, probability theory, and experimental design.
- Experience with large-scale datasets, data cleaning, feature engineering, and statistical analysis tools.
- Proficiency in programming languages such as Python, R, or Julia, including libraries like pandas, NumPy, SciPy, scikit-learn, or statsmodels.
- Familiarity with ML frameworks (TensorFlow, PyTorch) and understanding of LLM architectures.
- Strong analytical thinking, attention to detail, and problem-solving skills.
- Excellent written and verbal communication skills.
As part of this project, an English language assessment is required. Please share your updated resume at tsingh3@innodata.com if interested.
Seniority level:- Associate
- Part-time
- Data Science and Analytics
- Technology and IT Services
-
Senior Data Scientist
3 semanas atrás
Brasil Copoly Tempo inteiroWe're seeking a Senior Data Scientist to lead the development of advanced AI solutions that power deep research, scientific reasoning, and strategic decision-making. You'll work with large language models (LLMs), retrieval-augmented generation (RAG) systems, and Graph-RAG architectures to create intelligent tools that extract and synthesize insights from...
-
Data scientist
Há 3 dias
Brasil AGM Tech Solutions - A Woman And Latina-owned IT Staffing Firm-an Inc. 5000 Company Tempo inteiroPosition Title: Data Scientist Position Summary: Programming proficiency in Python and R, with experience in other languages (e.g., SQL) for data manipulation and retrieval Proficient in data wrangling and preprocessing with tools like Pandas, Num Py, and experience in managing both structured and unstructured data Hands-on experience with large language...
-
Data Scientist
2 semanas atrás
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroResponsibilitiesProgramming proficiency in Python and R, with experience in other languages (e.g., SQL) for data manipulation and retrievalProficient in data wrangling and preprocessing with tools like Pandas, NumPy, and experience in managing both structured and unstructured dataHands-on experience with large language models (LLMs) and understanding of...
-
Data Scientist
Há 7 dias
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroResponsibilitiesProgramming proficiency in Python and R, with experience in other languages (e.g., SQL) for data manipulation and retrieval Proficient in data wrangling and preprocessing with tools like Pandas, NumPy, and experience in managing both structured and unstructured data Hands-on experience with large language models (LLMs) and understanding of...
-
Data Scientist
2 semanas atrás
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroPosition Title: Data Scientist Position Summary: Programming proficiency in Python and R, with experience in other languages (e.G., SQL) for data manipulation and retrieval Proficient in data wrangling and preprocessing with tools like Pandas, NumPy, and experience in managing both structured and unstructured data Hands-on experience with large language...
-
Data Scientist
Há 3 dias
Vitória Brasil Agm Tech Solutions - A Woman And Latina-Owned It Staffing Firm-An Inc. 5000 Company Tempo inteiroPosition Title: Data Scientist Position Summary: Programming proficiency in Python and R, with experience in other languages (e.g., SQL) for data manipulation and retrieval Proficient in data wrangling and preprocessing with tools like Pandas, Num Py, and experience in managing both structured and unstructured data Hands-on experience with large language...
-
Data Scientist
Há 2 dias
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroPosition Title: Data Scientist Position Summary: Programming proficiency in Python and R, with experience in other languages (e.g., SQL) for data manipulation and retrieval Proficient in data wrangling and preprocessing with tools like Pandas, NumPy, and experience in managing both structured and unstructured data Hands-on experience with large...
-
Data Scientist
2 semanas atrás
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroAt AGM Tech Solutions, we partner with high impact enterprise and mid size clients to solve complex challenges through top-tier IT Talent. Our Direct client is hiring: Position Title: Data Scientist Location: Remote Contract Position Candidates must need to be incorporated as a PJ in Brazil Position Summary: Programming proficiency in Python and R, with...
-
Data Scientist
2 semanas atrás
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroAt AGM Tech Solutions, we partner with high impact enterprise and mid size clients to solve complex challenges through top-tier IT Talent. Our Direct client is hiring:Position Title: Data ScientistLocation: RemoteContract PositionCandidates must need to be incorporated as a PJ in BrazilPosition Summary:Programming proficiency in Python and R, with experience...
-
Data Scientist
2 semanas atrás
Brasil AGM Tech Solutions - A Woman and Latina-owned IT Staffing Firm-an Inc. 5000 company Tempo inteiroAt AGM Tech Solutions, we partner with high impact enterprise and mid size clients to solve complex challenges through top-tier IT Talent. Our Direct client is hiring:OverviewPosition Title: Data ScientistLocation: RemoteContract PositionCandidates must be incorporated as a PJ in BrazilResponsibilitiesProgramming proficiency in Python and R, with experience...