Beschreibung
Für unseren Kunden suchen wir einenData Engineer (m/w)
für den Bereich Predictive Maintenance.
Aufgaben:
- Test driven development with Scala and the Apache Spark framework
- Creating data pipelines to create the data preparation layer within the project
- Translating existing complex data preparation SQL queries to Scala
- Create content for new analytics modules whilst communicating with other teams and developers
- Be a part of an international team
Must have:
- Experience with TDD, GIT
- Experienced in Scala and/or Python and Unix/Linux environment
- Proficient with Microsoft Azure and Hortonworks stack
- Experience with the Spark framework
- Big data technologies such as Hadoop, HBase, Hive , ETL frameworks
- Ability to event stream pipelines (Storm, Kafka, Kinesis, ...)
- Good understanding of the Data Science lifecycle
- Good knowledge of infrastructure automation software/tools such as Chef, Terraform and/or Docker
- Fluent in English
Nice to have:
- A history of Machine Learning
- Administrating SQL and NoSQL databases
- Scala and Python