Internship: TechGuru/ Data Science with CAST Logs
Duration: 6-month
Keywords: Application Logs/Clustering/ LLM/NLP Summarization/Visualization & Reporting
About CAST
CAST is the market leader in Software Intelligence. Its unique technology thoroughly examines the structure of complex software systems, with MRI-like precision. It delivers accurate, actionable, and automated views of software architecture, critical flaws, quality grades, sizing metrics, open-source usage, and cloud readiness levels. Hundreds of companies rely on CAST for greater objectivity in crucial business decisions, faster application modernization for Cloud, raising the quality and security of their custom software.
Founded more than 25 years ago and backed by nearly $200 million in R&D, CAST’s rapid analysis technology and its advanced ‘MRI for Software’ drive IT transformation and enable automation at the world’s largest systems integrators, hundreds of Global 2000 enterprises, and government agencies, in North America, Europe, India and in China.
CAST provides two sets of products of different technologies, implementation models, and usage:
- CAST Highlight is a SaaS product capable of performing rapid application portfolio analysis. It analyzes source code of applications to measure their cloud readiness, to analyze their composition, resiliency, and technical debt.
- CAST Imaging is an on-premises product (also cloud version since 2025) that can run for reverse-engineers all database structures, code components, and interdependencies in custom-built applications. It provides interactive and accurate architecture blueprints, data-call graphs, and end-to-end transaction flows.
We work at the intersection of Software Intelligence and Artificial Intelligence, helping organizations understand and improve their applications. Our mission is to transform runtime data (logs) and static code analysis results (from CAST analyzers) into actionable insights that guide decisions on performance, maintainability, and risk reduction.
The project aims to design and implement an analytics framework that uses Data Science and AI techniques to extract valuable insights from:
- Application Logs: millions of runtime events such as performance metrics, errors, and user transactions.
- CAST Analyzer Outputs: detailed structural analysis of code, dependencies, and technologies.
By correlating these two data sources, the goal is to identify patterns linking code structure to runtime issues, uncover hidden frameworks or technologies, and provide recommendations to improve software quality and performance.
Proposed work
This project invites you to explore the hidden intelligence within software systems by bridging AI, data science, and software engineering. Your milestones will include:
- Exploring and preparing large datasets from runtime logs and CAST analyzers.
- Designing features and analytical approaches to uncover key insights.
- Applying and comparing ML or AI techniques (anomaly detection, clustering, LLM-based summarization).
- Investigating correlations between code, performance, and maintainability.
- Prototyping visualizations or models to communicate discoveries.
Team:
Interns will be joining the CAST R&D team, a dynamic and innovative group of professionals specializing in software research and development. The team consists of experienced software engineers, data scientists, and industry experts who are passionate about pushing the boundaries of software technology. Working in this team offers:
- Exposure to cutting-edge research and development in software technology.
- Opportunities for mentorship and learning from seasoned professionals.
- A collaborative environment where creativity and innovation are encouraged.
- Involvement in projects that have a tangible impact on the industry.
Required Skills
To be successful in this internship, candidates should possess:
- A Bachelor’s degree (or currently pursuing) in Computer Science, Data Science, Engineering, or a related field.
- Strong analytical skills with the ability to interpret complex data.
- Knowledge of Python and common data science libraries (pandas, numpy, scikit-learn, matplotlib).
- Familiarity with machine learning and/or NLP techniques.
- Interest in software analytics, software quality, or AI for code.
- Ability to interpret technical data and communicate insights effectively.
- Excellent problem-solving abilities and a keen attention to detail.
- Strong communication skills, both written and verbal, to effectively collaborate with the team and present findings.
Main Technology
The internship will involve working with various technologies, including but not limited to:
- Data analysis tools: Python, Jupyter, Pandas, NumPy.
- Machine learning libraries: scikit-learn, TensorFlow, PyTorch.
- Data visualization: Matplotlib, Seaborn, Plotly, or Streamlit dashboards.
- NLP and LLM frameworks: OpenAI API, Hugging Face Transformers.
- Version control & collaboration: Git, GitHub.
- Optional integrations: SQL databases, CAST Analyzer outputs, REST APIs.
When: Flexible
Where:
Position is located at CAST’s France office at Meudon, ïle-de-France: 3 Rue Marcel Allégot, 92190 – Meudon
What we offer you
Lunch - Each employee benefits from a Swile card and access to FoodChéri
Remote - the possibility of remote work up to 3 days a week.
An exceptional working environment - we are well settled in a former mansion ideally located in Meudon with a beautiful garden (10 min by train from Montparnasse).
Feedback-Friendly Culture - In CAST we believe in effective feedback, we have been since day one normalizing feedback by incorporating it in our routine and creating safe space for employees to debate about what is and isn’t working.
Career prospects - In addition to our internal mobility policy that encourages employees to move between teams and subsidiaries.
At CAST, employees are encouraged to take on more and more responsibilities during their journey.
We are always looking for talented people who want to grow together with us. Would you like to join a truly entrepreneurial company and to be a part of our exciting journey? Apply today!