Sr Data Scientist (GenAI)

3 years

0.0 Lacs P.A.

Mumbai, Maharashtra, India

Posted:2 weeks ago| Platform: Linkedin logo

Apply Now

Skills Required

dataiotefficiencydriveanalysispoweranalyticscuttingextractionretrievaldesignmodelscalabilitydevelopmentdatabaselearningalgorithmstrainingmonitoringstrategiescollaborationcommunicationtuningpythonsqldockergitstatisticsopenaiapielasticsearchsolrawsazureprocessingpresentation

Work Mode

On-site

Job Type

Full Time

Job Description

Job Title: Sr Data Scientist Organization: Living Things Pvt. Ltd Location: IIT Bombay, Powai, Mumbai Job Type: Full-Time Experience Level: Mid-Level (3+ years experience) About Us: Living Things is a pioneering IoT platform by iCapotech Pvt Ltd, dedicated to accelerating the net zero journey towards a sustainable future. We bring mindfulness in energy usage by our platform. Our solution seamlessly integrates with existing air conditioners, empowering businesses & organisations to optimise & reduce energy usage, enhance operational efficiency, reduce carbon footprints, and drive sustainable practices. Analysis of Electricity consumption across all locations from Electricity Bills. By harnessing the power of real-time data analytics and intelligent insights, our energy saving algorithm helps in saving a minimum of 15% on Air Conditioner’s energy consumption. About the Role: We are seeking a highly skilled and passionate Data Scientist to join our team who will play a pivotal role in developing and deploying cutting-edge solutions, particularly in the domain of document extraction using State of the Art Large Language Models (LLMs) and Retrieval-augmented generation (RAG). Job Responsibilities: Document Extraction using LLMs: Design and develop robust document entity extraction models using State of the Art LLMs. Fine-tune LLMs for specific document extraction tasks. Evaluate model performance and optimize for accuracy, efficiency, and scalability. RAG Development: Development of RAG pipelines for Question answering based on documents Agentic RAG pipelines for communicating with the database Apply advanced machine learning and deep learning algorithms to solve complex data-driven problems. LLM Ops: Implement and maintain robust LLM Ops pipelines for finetuning training and monitoring. Develop and implement strategies for continuous model improvement and retraining. Collaboration & Communication: Effectively communicate technical concepts to both technical and non-technical audiences. Collaborate with cross-functional teams to ensure successful project delivery. Skills and Qualifications: Essential: 3+ years of hands-on experience in developing and deploying machine learning models. Experience with fine-tuning and deploying LLMs. 3+ years of experience with building RAG pipelines. Strong proficiency in Python and SQL. Deep understanding of machine learning and deep learning concepts. Experience with Docker and Git. Preferred: Master's degree or PhD in Computer Science, Data Science, Statistics, or a related field. Proficiency with using open state-of-the-art technologies for RAG components, including: Efficient vector databases: Faiss, Milvus, Qdrant, Weaviate Dense retrieval methods: DPR (Dense Passage Retriever), ColBERT, ANCE Knowledge graph embeddings: RDF2Vec, TransE, RotatE LLM integrations: Hugging Face Transformers, OpenAI API Search engines: Elasticsearch, Solr Experience with cloud computing platforms (e.g., Google Cloud Platform, AWS, Azure). Strong understanding of natural language processing (NLP) techniques. Excellent communication and presentation skills. Show more Show less

Living Things
Living Things
Not specified
No locations

RecommendedJobs for You