avaamo logo

Avaamo Interviews

1 experience5 reads
Avaamo | Interview (All rounds) | Senior MLE | Bengaluru
avaamo logo
Avaamo
Senior Machine Learning EngineerBengaluru3.5 years
June 24, 20255 reads

Summary

I interviewed at Avaamo in December 2024 for a Senior Machine Learning Engineer role, with three rounds focused on RAG, BERT, LLMs, and vector databases. The first round was the most challenging, testing foundational knowledge in transformer-based models.

Full Experience

I gave interview at Avaamo in Dec 2024 and these were the questions around which the interviews revolved. I have 3.5 years of experience as a data scientist, the role was Senior Machine Learning Engineer.

Question List for the 3 rounds

  1. Explain the basics of RAG architecture and its components.
  2. What are different parsing and chunking strategies?
  3. How does chunking impact the quality of retrieval in RAG?
  4. What is BERT pretraining using MLM and NSP?
  5. How do bi-encoders and cross-encoders differ in architecture and use cases?
  6. What are the pros and cons of bi-encoders vs cross-encoders?
  7. How do you fine-tune embedding models? What loss functions are used (e.g., triplet loss)?
  8. What are the basics of LLMs and their fine-tuning approaches (PEFT, LoRA, instruction tuning, adapters)?
  9. What are decoding strategies in LLMs (temperature, top-k, top-p, beam search)?
  10. What are some techniques for evaluating RAG systems?
  11. How would you optimize latency in a RAG pipeline?
  12. What are some ANN (Approximate Nearest Neighbor) algorithms used in vector databases?
  13. What is the difference between using pretrained models vs fine-tuned models in RAG?
  14. What are some vector database fundamentals and retrieval configurations?
  15. What is semantic caching and how is it useful?
  16. How do encoders function in LLM generation tasks?
  17. What prompting techniques are used in real-world applications?
  18. How would you optimize or improve the performance of a GenAI classification system?

The rounds were similar with the first one being the most challenging testing the basics and experience with transformer based models. Since there work is around developing framework for chatbot building. They use their own proprietary frameworks to solve the problems hence fundamentals are essential.

Interview Questions (18)

Q1
Explain RAG Architecture Basics
Other

Explain the basics of RAG architecture and its components.

Q2
Parsing and Chunking Strategies
Other

What are different parsing and chunking strategies?

Q3
Impact of Chunking on RAG Retrieval Quality
Other

How does chunking impact the quality of retrieval in RAG?

Q4
BERT Pretraining (MLM & NSP)
Other

What is BERT pretraining using MLM and NSP?

Q5
Bi-encoders vs Cross-encoders
Other

How do bi-encoders and cross-encoders differ in architecture and use cases?

Q6
Pros and Cons of Bi-encoders vs Cross-encoders
Other

What are the pros and cons of bi-encoders vs cross-encoders?

Q7
Fine-tuning Embedding Models and Loss Functions
Other

How do you fine-tune embedding models? What loss functions are used (e.g., triplet loss)?

Q8
LLM Basics and Fine-tuning Approaches
Other

What are the basics of LLMs and their fine-tuning approaches (PEFT, LoRA, instruction tuning, adapters)?

Q9
LLM Decoding Strategies
Other

What are decoding strategies in LLMs (temperature, top-k, top-p, beam search)?

Q10
Techniques for Evaluating RAG Systems
Other

What are some techniques for evaluating RAG systems?

Q11
Optimize Latency in RAG Pipeline
System Design

How would you optimize latency in a RAG pipeline?

Q12
ANN Algorithms in Vector Databases
Other

What are some ANN (Approximate Nearest Neighbor) algorithms used in vector databases?

Q13
Pretrained vs Fine-tuned Models in RAG
Other

What is the difference between using pretrained models vs fine-tuned models in RAG?

Q14
Vector Database Fundamentals and Retrieval Configurations
Other

What are some vector database fundamentals and retrieval configurations?

Q15
Semantic Caching and Its Utility
System Design

What is semantic caching and how is it useful?

Q16
Encoder Function in LLM Generation
Other

How do encoders function in LLM generation tasks?

Q17
Prompting Techniques in Real-world Applications
Other

What prompting techniques are used in real-world applications?

Q18
Optimize GenAI Classification System Performance
System Design

How would you optimize or improve the performance of a GenAI classification system?

Have a Avaamo Interview Experience to Share?

Help other candidates by sharing your interview experience. Your insights could make the difference for someone preparing for their dream job at Avaamo.