Senior Data Scientist
SatsyilCorporation
Posted: October 15, 2018
Interested in this position?
Create a free account to apply with AI-powered matching
Quick Summary
We are looking for a Senior Data Scientist to join our team, with a strong background in data science and a passion for building and optimizing features and data analytics. The ideal candidate will have experience with data science tools such as elastic search and Apache zeppelin, and be able to work collaboratively with cross-functional teams. The successful candidate will be responsible for selecting features, building and optimizing, and driving data science projects from concept to delivery.
Required Skills
Job Description
Satsyil is not a typical software solution company. Talented, hardworking, practiced professionals and principled people who are passionate about customers and software, brought together because they want to provide very affordable, simple and amazing solutions.
Looking for Candidates with the following skills in priority order, also we are expecting the person to do both development and data science analytics work, most likely 50% development and 50% data science work:
• http
• elastic search
• Apache zeppelin
• data science
Responsibilities
• Selecting features, building and optimizing classifiers using machine learning techniques.
• Ability to work with Hadoop/Hive/HDFS/Spark environments to be able to experiment and write the scalable programs.
• Ability to work with structured and unstructured datasets.
• Ability to work with large datasets.
• Enhancing data collection procedures to include information that is relevant for building analytic systems.
• Processing, cleansing, and verifying the integrity of data used for analysis.
• Doing ad-hoc analysis and presenting results in a clear manner.
• Creating automated anomaly detection systems and constant tracking of its performance.
• Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
• Strong mathematical academic background.
• Experience with common data science tool-kits, such as NumPy, SciPy, NLTK, matplotlib, pandas, xlrd.
• Experience with data visualization tools like Kibana/Grafana.
• Experience in using Zeppelin type of tools for quick scripting.
• Proficiency in using query languages such as Hive.
• Extensive experience in using HDFS/HIVE/Spark environments.
• Experience with NoSQL databases.
• Good applied statistics skills, such as distributions, statistical testing, regression, etc.
• Good scripting and programming skills (Python, Java – especially using Spark).
• Data-oriented personality.
All your information will be kept confidential according to EEO guidelines.