The role is for a self-motivated individual with software engineering skills and expertise with Big Data and cloud technologies. The candidate will be extensively involved in hands-on activities including POCs, design, documentation, development, and test of new functionality. Candidate must be agile and flexible with changing priorities based on teams needs.
Qualification & Experience:
- CS fundamentals: You have earned at least a B.S. / MS in Computer Science, or related degree AND you have a strong ethos of continuous learning.
- Software engineering & Architecture: at least 6+ years of professional software development experience with languages and systems such as Python/Java, REST API, PySpark, Apache Beam and version control (git), with good analytical & debugging skills.
- Big data: You have extensive experience with data analytics and working knowledge of big data infrastructure such as Google Cloud, Big Query, Data Flow, AWS, Hadoop Eco System, HDFS, Spark. You've routinely built data pipelines with gigabytes/terabytes of data and understand the challenges of manipulating such large datasets.
- Data Science/ML Ops: Experience in operationalization of Data Science projects (ML Ops) using at least one of the popular frameworks available.
- Data Modeling: Flair for data, schema, data model, PL/SQL, Star & snowflake schema, how to bring efficiency in data modeling for efficient querying data for analysis, understands criticality TDD and develops data validation techniques.
- Real Time Systems: Understands evolution of databases for in-memory, NoSQL & indexing technologies along with experience on real-time & stream processing systems like Google pub/sub, GCP technologies, Kafka, AWS/Azure streaming technologies, Storm, Spark Streaming.
- Strong design skills: with a proven track record of success on large/highly complex projects preferably in Enterprise Apps and Integration.
- Project management: You demonstrate excellent project and time management skills, exposure to scrum or other agile practices in JIRA.
- Excellent verbal and written communication skills: Must be able to effectively communicate & work with fellow team members and other functional team members to coordinate & meet deliverables.
Must Have
Software engineering & Architecture
Python, version control (git), analytical & debugging skills
Big Data / DWH
Google Cloud Platform, Big Query, Data Flow, Composer/Airflow, Cloud functions, Stack driver
Data Modeling
Data modeling, SQL, in-memory database, data catalog
Real Time Systems
Google pub/sub, GCP technologies