Job Summary:
We are looking for a driven and curious Junior Data Scientist to join our growing AI & Data
Science team. The ideal candidate will have a foundational understanding of data analysis and
machine learning, with a strong interest in emerging AI paradigms such as Retrieval-Augmented
Generation (RAG) and agentic AI systems. This role provides an exciting opportunity to work on
real-world applications of LLMs and support the development of intelligent, task-oriented agents.
Key Responsibilities:
● Clean, transform, and analyze data (structured and unstructured) from diverse sources.
● Perform exploratory data analysis to identify actionable insights.
● Support the development and evaluation of traditional and deep learning models.
● Assist in integrating LLMs into workflows using RAG pipelines.
● Collaborate with senior team members to build and refine agentic AI systems for task
automation.
● Contribute to the design and testing of knowledge retrieval systems, including vector stores
and embedding models.
● Develop dashboards, visualizations, and documentation to effectively communicate insights
and findings.
Required Skills & Qualifications:
● Bachelor’s degree in data science, Computer Science, Statistics, Engineering, or a related
discipline.
● Strong programming proficiency in Python, including experience with libraries like pandas,
numpy, scikit-learn, and either LangChain or LlamaIndex.
● Fundamental understanding of Natural Language Processing (NLP) and large language
models (LLMs).
● Hands-on experience with SQL and database interactions.
● A strong passion for advancements in Artificial Intelligence and a commitment to ongoing
learning.
● Exposure to Retrieval-Augmented Generation (RAG) architectures, including vector
databases like FAISS, Chroma, or Pinecone.
● Familiarity with LLM frameworks, such as the OpenAI API and Hugging Face Transformers.
● Knowledge of agentic AI systems, including task planning and autonomous agents
● Experience with cloud platforms (AWS, GCP, Azure) and version control systems (Git).