Responsibilities
- Develop industry-leading ICT big data solution that seamlessly integrates the current system and data asset with the new strategy for data analytics.
- Responsible for designing, testing, deploying, and documenting Big Data platform and analytics procedures and their outputs
- Design and Develop scalable and distributed applications using Hadoop Technology Stack such as Apache Pig, Apache Hive and HDFS.
- Provide/Benchmark efficient solutions based on project demands.
- Responsible for implementing Map/Reduce jobs, UDF's and Performance Tuning of Hadoop jobs
- Partner with data scientists, analysts, marketing, product management to provide summary results of data analysis, which will be used to make decisions regarding how to measure business rules and quality of the data.
- Document at a functional level how the procedures work within the data quality applications
- Research all available technologies, determine suitability and provide guidance on the best solution for the project at hand
Requirements
- Good knowledge of the entire data management landscape
- Deep understanding of core Big Data design patterns and the associated challenges involved with data analytics, analysis, certification, modeling, quality improvement and data management implementation projects
- Deep experience using one or more of the following technologies: Hadoop, Spark, Amazon Web Services, Google, Microsoft Azure
- Bachelor’s Degree (or equivalent) in Computer Science, Engineering, Math or a related field
- Hands on in Java programming skills
- Understanding of Map-Reduce
- Expertise in Apache Hadoop, Spark, Apache Hive, HBASE
- Hands-on experience working with low-latency real-time application requirements
Nice to Have
- Exposure to Networking, NMS/EMS