Who are we?Indegene is a global consultancy at the forefront of driving innovation in the Pharmaceutical and Life Sciences industry, through combining medical and commercial expertise with innovative digital and AI technologies.We enable global healthcare organizations address complex challenges and drive better health and business outcomes by seamlessly integrating analytics, technology, operations, and medical expertise. Find out more at indegene.comWho are you?We are looking for experienced AI Specialists toDevelop and train Generative AI models.Perform data analysis and prepare data for AI model training.Integrate AI models with Snowflake, AWS and other systems.Required knowledge:Good knowledge in machine learning and Generative AI, especially content generation using AWS Bedrock and OpenAI based models.Strong experience in building scalable (Gen) AI applications on AWS.Unstructured Data Processing & ExtractionDocument Parsing: Experience with PDF, Word, and HTML parsing using tools like PyMuPDF, Apache Tika, or Textract.Optical Character Recognition (OCR): Familiarity with Tesseract OCR, AWS Textract, or Azure Form Recognizer for extracting text from scanned documents.Natural Language Processing (NLP): Ability to clean, preprocess, and structure text using spaCy, NLTK, or Hugging Face Transformers, familiarity with NER Named Entity Recognition Vector Database & EmbeddingsVector Databases: Expertise in FAISS, Qdrant, Pinecone, Weaviate, or ChromaDB for semantic search and retrieval.Embedding Models: Understanding of OpenAI, Cohere, or Sentence-BERT embeddings for document similarity and retrieval.Chunking & Indexing: Experience in splitting large documents into meaningful chunks for efficient retrieval keeping in mind document structure and form and maintaining metadataStrong background and understanding of vector databases.Experience in building (Gen) AI solutions on Snowflake is a plus. Experience with Graph databases is a plusExperience with Agentic AI is a plusGood knowledge in Python for data science, as well as streamlit for rapid deployment of prototypes. Good knowledge in Git, ideally Azure DevOps.Experience to work in an agile and international environment.Experience in the setup and usage of CI/CD pipelines as well as writing software in a test-driven fashion.Good documentation and coaching practiceEQUAL OPPORTUNITYIndegene is proud to be an Equal Employment Employer and is committed to the culture of Inclusion and Diversity. We do not discriminate on the basis of race, religion, sex, colour, age, national origin, pregnancy, sexual orientation, physical ability, or any other characteristics. All employment decisions, from hiring to separation, will be based on business requirements, the candidate’s merit and qualification.We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, national origin, gender identity, sexual orientation, disability status, protected veteran status, or any other characteristics.