Primary Skills:Kafka Rabbitmq NoSQL Artificial Intelligence Cassandra MySQL
Secondary Skills:Mahout MongoDB Spark HBase Skills highlighted with ‘‘ are preferred keyskills
Job Location:
Bangalore/Bengaluru
Posted Date:
385 days ago
Job Description
Roles and Responsibilities
Mandatory Skill (2-3): Big data, Hadoop, Hive, Spark, Kafka, MySQL
Preferred Skill: Knowledge on AI/ML, online advertising system (e.g. ocpx, ctr, cvr)
Strong hands-on experience in Big data Analysis Techniques and Statistical models and various data analysis tools
Strong experience in applied statistics skills, such as distributions, statistical testing, regression, etc. Mathematical background in linear algebra.
Excellent understanding of Hadoop.
Excellent scripting and programming skill preferably in Python 3
Experience building data pipelines for batch and stream processing systems
Experience with Spark, various messaging systems, such as Kafka or RabbitMQ
Experience with SQL and NoSQL databases such as HBase, Cassandra or MongoDB
Experience with classification techniques
Experience with BigData ML toolkits like Mahout, Spark ML
Good knowledge of Bid data querying tools like Pig, Hive etc.
Proficiency with Hive-QL & able to Analyze, Develop and Debug the Hive Scripts on his own.
Candidate should have Data Processing ability (ETL techniques) using hive scripting experience.
Candidate MUST NOT be limited to Data Migration capability from legacy DB to Hadoop Cluster
Proficient with Partitioning, Analytical aggregation and dealing with large tables.