-
NTT DATA, Inc.

Develop a Data Lake Infrastructure, from zero to hero (6 months internship)

NTT DATA, Inc.
Luxembourg · Full-time · Entry

Make an impact with NTT DATA

Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive.

Your day at NTT DATA

The primary objective of this internship is to design, implement, and exploit a data lake for storing and analyzing heterogeneous data, particularly data from monitoring and log systems. Initially, the intern will be tasked with designing a data lake infrastructure to centralize these data sources, and in a second phase, to exploit the collected data to generate reports that can be used by business teams.

What You'll Be Doing

Key Roles and Responsibilities:

Study and understanding of key concepts: Grasp the fundamentals of monitoring and logging, as well as their role in collecting infrastructure and application data.

Data lake design:

Define the architecture of the data lake from the scratch, taking into account the specificities of heterogeneous data.

Design a metadata management system to catalog and structure the stored information.

Select the appropriate technologies for the creation of the data lake (e.g., Hadoop, Spark, AWS S3, etc.).

Build the infrastructure (ex: Spark, Kafka...) for the Data Lake.

Technical implementation:

Deploy the data lake infrastructure.

Design and implement the associated metadata model.

Data ingestion into the lake:

  • Collect and load raw primary data (monitoring, logs) into the lake.
  • Enrich the lake with documentation and descriptions of the data to facilitate future exploitation.

Data exploitation and valorization:

  • Implement reporting tools to generate standardized reports usable by business teams.
  • Automate data workflows and report generation if possible.

Required Skills:

Last year of Masters degree in Big Data/Data Science or any relevant degree

Familiarity with Linux

Familiarity with virtualization

Familiarity with building and implementing infrastructure (Spark, Kafka...)

Familiarity with large-scale data processing (Big Data) and related technologies (Hadoop, Spark, etc.).

Knowledge of databases (SQL/NoSQL).

Experience or interest in data lakes and metadata technologies.

Ability to work with large volumes of heterogeneous data (logs, monitoring, etc.).

Preferred Skills:

Devops competencies

An experience in containerization with, for example, Docker

Regular use of a Homelab

Familiarity with data visualization and reporting tools (Tableau, Power BI, etc.).

Workplace type:

Hybrid Working

About NTT DATA

NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo.

Equal Opportunity Employer

NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today.

Key Skills

Ranked by relevance

spark hadoop ai s3 aws sql kafka tableau containerization
Login to Apply
Posted
Oct 07, 2024
Type
Full-time
Level
Entry
Location
Canton of Capellen

Industries

IT Services IT Consulting

Categories

Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
NTT DATA, Inc.
Related

Network Engineer

2026-05-08

Full-time
Not Applicable
Italy
IT Services
Information Technology
View Job Details
Egov Select
Related

Network and Systems Engineer

2026-05-28

Full-time
Not Applicable
Belgium
IT Services
Information Technology
View Job Details
NTT DATA, Inc.
Related

Networking Services Engineer

2026-05-07

Full-time
Not Applicable
Spain
IT Services
Engineering