Generative AI Engineer

Job Description

We are seeking Multiple Generative AI Engineers to drive the development of state-of-the art AI Technologies for our AI platform. Spearhead into the development of cutting-edge AI models, Platform & Architecture. You will be at the forefront of innovation, driving the evolution of our AI systems to enhance performance, scalability, and user experience.

Job Responsibilities

Responsibilities:

Develop and optimize RAG systems: Design, develop, and deploy Retrieval-Augmented Generation (RAG) systems utilizing dense (DPR) and sparse (BM25) retrieval mechanisms, embedding techniques, and Generative AI frameworks (LangChain, LlamaIndex).

NLP problem-solving and model performance: Solve complex NLP problems (text classification, entity tagging, question answering) and continually monitor model performance, conducting evaluations, and implementing improvements based on data analysis.

Data management and optimization: Ensure data quality and relevance through augmentation, noise removal, and optimization of retrieval efficiency. Evaluate query understanding techniques and implement context-aware retrieval strategies.

Evaluation and metrics: Define and implement evaluation metrics (relevance, factual accuracy, coherence) to assess RAG application effectiveness and monitor performance.

Model development and fine-tuning: Fine-tune and pre-train large language models (LLMs) using techniques like LoRA, QLORA, and prompt tuning. Integrate and optimize AI frameworks for seamless functionality and user experience.

Prompt engineering and research: Champion innovative prompt engineering to unlock zero-shot and few-shot learning capabilities. Lead research initiatives to explore new advancements in generative AI technology.

Collaboration and integration: Collaborate with cross-functional teams to integrate models into the platform and align with user needs.

Job Requirements

We're looking for someone with:

Educational Background:

B.Tech/M.Tech in Computer Science, AI, or a related field.

1+ years of experience (strong advantage for peeps with Machine Learning, NLP, LLM hands-on experience)

Experience (1 to 2+ years ):

1+ Years of hands-on experience in Generative AI Projects.

Hands-on experience with Pre-Training/FineTuning, consuming foundational AI models via APIs.

Hands-on experience with RAG frameworks like (Langchain, LlamaIndex)

Hands on experience with Data Chunking, embedding approaches and developing Retrieval Augmented Generation solutions.

Hands on experience with LLMs context store development, vector search, prompt engineering

Hands-on experience with Prompt Engineering, In-Context Learning (zero-shot/few-shot learning)

Expert level understanding of Python & Hands-on experience with advanced AI/ML frameworks/Libraries (e.g., PyTorch, TensorFlow, Streamlit).

Hands-on experience with NLP, LLMs (Falcon, LLama, GPT series)

Hands-on experience with Vector Databases (Pinecone, Waviate, Faiss).

Hands on experience with building GenAI Web API REST services, FastAPI, Flask etc

Proficiency is backend service building and handling code at scale

Strong understanding of transformer architectures and model compression techniques.

Strong understanding of data preprocessing, feature engineering, and dataset curation for AI training

Bonus points if you have:

Participation in the AI and open-source communities is appreciated

Experience with model compression techniques, quantization, and efficient inference

Experience in fine tuning using techniques like PEFT, QLORA etc.

Knowledge of reinforcement learning and RLHF (Reinforcement Learning from Human Feedback) & Advance Prompt Engineering

A knack for data preprocessing, feature engineering, and dataset curation for AI training

Participation in AI and open-source communities.

A commitment to continuous learning and skill enhancement in AI technology.

Experience with cloud-based development and familiarity with AI-related cloud services (e.g, AWS, Azure, GCP).

Understanding of containerization and orchestration technologies (e.g, Docker, Kubernetes) for deploying AI services.

Essential Skills:

Exceptional problem-solving abilities and innovative thinking in AI research and development

Ability to balance cutting-edge research with practical implementation and user needs

Take ownership of your work in a high-ownership, high-commitment startup environment.

Interview Process

Qualification (Telephonic, Non-Technical, 10 Mins)

Technical Interview (45 Mins, Google Meet Video)

Coding Round (assignment code submission in 24 hours)

Final - Culture & Technical interview (60min, in-person)

docker

Apply Now

Job Type

Full time contract

Location

Onsite

Other Jobs at UltraGenius

Fast Apply

UltraGenius

Sr. FullStack Engineer

Onsite New Delhi

Master Full-stack Developer with experience in React/Typescript based stack. People describe you as “extremely productive”.

You have experience in areas such as algorithms, databases, UX implementation, full-stack performance measurement and optimization.

You're flexible, trustworthy, energetic, and a great communicator.

NodeJS exp required

🔨What you’ll do | Responsibilities

Take full ownership of Irona Labs’s Full-stack Web Platform, all the way from login, dashboard for data, benchmark interactions, to interacting with deployed models.

Help design the initial design & UI → from web interfaces to dashboard and more.

Work together with DL Researchers to coordinate AI and LLM process pipelines to work with our web application.

Be exposed to every level of the company, working closely with the CEO to meet business goals and working off customer feedback.

💰 Benefits

Competitive pay and equity compensation

Unlimited vacation/sick leave

Zero commutes. Work wherever you are, globally.

Modern productivity software: Notion, Linear, ClickUp, Airtable, Miro,

reactjs node-js

View Details

Fast Apply

UltraGenius

Backend Developer

We are seeking a highly skilled and passionate Backend Developer (Nest.js/Python) to join our dynamic GenAI Product Team. You will play a pivotal role in building high-performance, user-centric web applications that deliver exceptional digital experiences. As a key member of our development team, you will be responsible for designing, developing, and maintaining complex Backend applications using Nest.js (Python) and other modern web technologies. you will collaborate closely with designers, product managers, and other developers to create exceptional digital experiences.

python nestjs

View Details

UltraGenius

India

⭐⭐⭐⭐⭐

ultraGenius: a fresh start

Ultragenius has given me a new start in my life. I always wanted to start my gig and work as a freelancer for good clients, and these guys have helped me accomplish my long term goal in only 3rd year of my career. They are highly supportive and helpful. I am grateful that I got a chance to work with them.

Mahesh Inder, Reactjs Dev

⭐⭐⭐⭐⭐

Excellent guys

Shailesh is excellent person to work with if your startup or organisation is finding talent for your web3 product and he is also helping developers and builders in web3 to find great team to work remotely. so he is creating win-win situation for both.

Mitul Gajera, web3