Senior Big Data Engineer- (Hadoop, Spark, Python) LOC : Stow, MA
100,000 - 200,000
Job Description:
- We are looking for a Senior Big Data Engineer to join our Big Data Analytics Platform team. In this role, you will be responsible for designing, developing, deploying and supporting the data ingestion pipeline from our Connected Products.
- In this role, you will be partnering with our data science community by supplying highly per formant data sets for advanced analytics and will provide leadership and experience in the data engineering space.
- The candidate must be results driven, customer focused, technologically savvy, and skilled at working in an agile development environment.
Job Responsibilities:
- Design and develop the data ingestion pipelines into the Big Data & Analytics Platform for a variety of big data use cases
- Deploy and support highly optimized solutions with a focus on automation
- Participate in the end to end delivery of business use-cases including data architecture to deliver results
- Deliver as part of an agile scrum team on the highest business value use cases that will drive our business strategy
- Partner with IT architects to define analytic architecture that best leverages the Big Data Platform to enable advanced analytics capabilities
- Provide leadership to the ongoing maturity of the development process, coach/mentor the development team on best practices and methodologies for enhanced solution development.
- Stay up to date on the relevant technologies, plug into user groups, understand trends and opportunities
Required Demonstrable Skills:
- Deep expertise is working with data all kinds, clean, dirty, unstructured, semi-structured
- Have strong expertise Apache Spark (batch and Spark streaming)
- Experience in real-time and batch data processing and associated technologies
- Able to demonstrate strong skills in programming/scripting languages such as Python, Scala, Java
- Ability do design, develop and deploy end to end data pipelines that meet business requirements and use cases
- Experience with all Cloudera components with focus on Impala, Hive, Hbase, Scoop and Hue
- Experience with automation technologies such as Oozie
- Strong experience with large cloud-compute infrastructure solutions such as Amazon Web Services, Google, Azure
- Knowledge of UNIX/Linux
- Experience with Text Analytics
- Strong knowledge of SQL
- Experience in Hadoop Platform Security and Hadoop Data Governance topics
- Experience in triaging production issues to understand and resolve the issue
- Experience in technical computing (optimization, statistics and machine learning)
- Experience with analytics visualization software such as Tableau
- Experience leading development teams, defining development processes, evaluating new technologies and practices to enhance solution delivery
Qualifications:
- Minimum of BS in Computer Science or similar field required
- Must have at least 5-8 years?? experience in information management and application development
- Must have a minimum of 3-5 years working hands on with Big Data technologies