Big data infrastructure internship | Adaltas

Work description

Major Data and distributed computing are at the core of Adaltas. We accompagny our partners in the deployment, upkeep, and optimization of some of the premier clusters in France. Considering the fact that not too long ago we also offer guidance for working day-day operations.

As a great defender and energetic contributor of open resource, we are at the forefront of the knowledge system initiative TDP (TOSIT Facts System).

Through this internship, you will contribute to the advancement of TDP, its industrialization, and the integration of new open source elements and new functionalities. You will be accompanied by the Alliage specialist group in charge of TDP editor support.

You will also operate with the Kubernetes ecosystem and the automation of datalab deployments Onyxia, which we want to make offered to our buyers as well as to learners as element of our instructing modules (devops, massive facts, and so forth.).

Your qualifications will assist to expand the providers of Alliage’s open source guidance providing. Supported open resource elements involve TDP, Onyxia, ScyllaDB, … For people who would like to do some world-wide-web do the job in addition to big facts, we by now have a really useful intranet (ticket administration, time management, highly developed look for, mentions and connected posts, …) but other pleasant features are anticipated.

You will apply GitOps release chains and compose articles.

You will do the job in a group with senior advisors as mentor.

Firm presentation

Adaltas is a consulting company led by a crew of open up resource specialists focusing on facts management. We deploy and operate the storage and computing infrastructures in collaboration with our customers.

Companion with Cloudera and Databricks, we are also open resource contributors. We invite you to browse our website and our quite a few technological publications to learn much more about the enterprise.

Expertise essential and to be acquired

Automating the deployment of the Onyxia datalab calls for expertise of Kubernetes and Cloud native. You should be comfy with the Kubernetes ecosystem, the Hadoop ecosystem, and the distributed computing product. You will master how the standard components (HDFS, YARN, object storage, Kerberos, OAuth, and so on.) function alongside one another to satisfy the employs of major details.

A great information of using Linux and the command line is required.

All through the internship, you will study:

  • The Kubernetes/Hadoop ecosystem in order to contribute to the TDP challenge
  • Securing clusters with Kerberos and SSL/TLS certificates
  • Significant availability (HA) of services
  • The distribution of sources and workloads
  • Supervision of products and services and hosted applications
  • Fault tolerant Hadoop cluster with recoverability of dropped information on infrastructure failure
  • Infrastructure as Code (IaC) by means of DevOps instruments these kinds of as Ansible and [Vagrant](/en/tag/hashicorp- vagrant/)
  • Be snug with the architecture and procedure of a information lakehouse
  • Code collaboration with Git, Gitlab and Github

Tasks

  • Turn into acquainted with the architecture and configuration approaches of the TDP distribution
  • Deploy and take a look at secure and highly offered TDP clusters
  • Contribute to the TDP understanding foundation with troubleshooting guides, FAQs and article content
  • Actively add strategies and code to make iterative enhancements to the TDP ecosystem
  • Exploration and evaluate the dissimilarities in between the principal Hadoop distributions
  • Update Adaltas Cloud utilizing Nikita
  • Contribute to the advancement of a software to collect consumer logs and metrics on TDP and ScyllaDB
  • Actively lead suggestions to acquire our support resolution

More info

  • Location: Boulogne Billancourt, France
  • Languages: French or English
  • Commencing date: March 2023
  • Length: 6 months

Much of the digital earth operates on Open Supply software program and the Big Information industry is booming. This internship is an opportunity to obtain valuable experience in equally domains. TDP is now the only actually Open up Resource Hadoop distribution. This is a fantastic momentum. As part of the TDP staff, you will have the possibility to understand one particular of the core huge info processing models and take part in the enhancement and the foreseeable future roadmap of TDP. We feel that this is an thrilling prospect and that on completion of the internship, you will be all set for a effective occupation in Major Details.

Products accessible

A notebook with the pursuing attributes:

  • 32GB RAM
  • 1TB SSD
  • 8c/16t CPU

A cluster created up of:

  • 3x 28c/56t Intel Xeon Scalable Gold 6132
  • 3x 192TB RAM DDR4 ECC 2666MHz
  • 3x 14 SSD 480GB SATA Intel S4500 6Gbps

A Kubernetes cluster and a Hadoop cluster.

Remuneration

  • Wage 1200 € / month
  • Restaurant tickets
  • Transportation move
  • Participation in one particular global convention

In the earlier, the conferences which we attended involve the KubeCon arranged by the CNCF foundation, the Open up Source Summit from the Linux Basis and the Fosdem.

For any ask for for more facts and to submit your application, be sure to call David Worms: