Job Title: AI/ML Engineer
Mode: Full Time – Remote
Experience Level: 2- 4 Years
About the Role:
We are seeking an experienced AI/ML Engineer with expertise in designing, developing, and deploying AI-driven solutions. The ideal candidate should have hands-on experience with machine learning, NLP, computer vision, fine-tuning LLMs, and AI-based automation. You will work on diverse AI applications involving structured and unstructured data processing, intelligent automation, and generative AI.
Key Responsibilities:
- Develop and fine-tune AI/ML models for NLP, text analytics, document processing, and chatbot applications.
- Implement Retrieval-Augmented Generation (RAG) for intelligent Q&A and chatbot solutions.
- Work with vector databases for efficient document retrieval and AI-based search.
- Design and optimize text-to-SQL models for natural language query execution.
- Develop intelligent document processing pipelines using OCR, NLP, and embeddings.
- Implement AI-driven automation for ITSM, RFP generation, and CAD diagram analysis.
- Work on image generation using GANs, Stable Diffusion, or transformer-based models.
- Deploy and optimize AI models on on-premise infrastructure for high-performance applications.
- Collaborate with developers to integrate AI functionalities into enterprise systems.
- Ensure scalability, performance optimization, and security of AI solutions.
Required Tech Stack & Tools:
Machine Learning & Deep Learning:
- Frameworks: PyTorch, TensorFlow, Hugging Face Transformers
- Fine-tuning & LLMs: OpenAI models, LLaMA, Mistral, Falcon, GPT-based models
- Text-to-SQL: LLM-based SQL generation, SQLCoder, Spider dataset-based training
- RAG & Search: LangChain, LlamaIndex, FAISS, Weaviate, Pinecone, ChromaDB
- Embeddings & Vector Search: SentenceTransformers, OpenAI/Anthropic embeddings
- Conversational AI – LangChain, LlamaIndex, Rasa
NLP & Document Processing:
- OCR: Tesseract, AWS Textract, Google Vision, Azure OCR
- LLM-based Document Processing: LayoutLM, Donut, Document AI
- Text Extraction & Processing: spaCy, NLTK, Transformers, BERT, T5
Computer Vision & Image Generation:
- CV Models: OpenCV, YOLO, Detectron2, Vision Transformers
- Generative AI: Stable Diffusion, DALL·E, ControlNet, GANs
MLOps & Model Deployment:
- Containerization & Deployment: Docker, Kubernetes, FastAPI, Flask
- On-Premise AI Serving: Triton Inference Server, TorchServe, ONNX Runtime
- Cloud/Hybrid MLOps: MLflow, DVC, Kubeflow (if applicable)
Preferred Qualifications:
- Experience in developing AI-powered automation for ITSM and business workflows.
- Knowledge of fine-tuning open-source LLMs for domain-specific tasks.
- Experience with retrieval-augmented generation (RAG) architectures.
- Ability to work with structured and unstructured data pipelines.
- Strong problem-solving skills and ability to optimize AI solutions for performance.
Contact mail: srividyap@hexalytics.com