Skills - 10+ years industry experience in enterprise-scale products in Java, J2EE, RDBMS, NoSQL, Web services, REST, Docker Data analytics and machine learning - 2+ years in Python, numpy, pandas. Data manipulation and cleaning techniques using pandas DataFrame NLP - Text mining and manipulation basics using nltk framework, text classification, topic modelling. Prediction algos - Supervised approaches for creating predictive models using scikit-learn. Linear and Logistic Regression, more advanced techniques, such as ensembles like Random Forests, Gradient Boosting. Understanding cross validation, overfitting etc. As a volunteer with Datakind Bangalore Chapter, worked with an education NGO for betterment of public education. Project was getting statistical analysis on survey data of public primary education. Technology used - python pandas, jupyter notebook
No volunteer work yet.