
Closed
Posted
Paid on delivery
Hypothesis: A constraint-aware, evidence-linked RAG architecture specifically optimized for Small Language Models (SLMs) can achieve legal reasoning accuracy and hallucination reduction comparable to or better than larger LLM-based RAG systems, while operating at significantly lower computational cost. So, basically you can create a new RAG architecture than performs much better than a currently available Plain SLM. Steps: 1. Check these SLMs first to benchmark on Retrieval/QA tasks: Inlegal‑Sbert (Retrieval) / InLegalBERT embeddings (Retrieval) / InLegalLlama (QA) 2. Dataset in consideration: Dataset: a) IPC Sections with conditions, exceptions, constraints, relations b) Supreme court judgement copies c) IPC_and_case_related_QA_dataset 3. Evaluate Inlegal‑Sbert and InLegalBERT embeddings on retrieval task and evaluate InLegalLlama on QA task from the above dataset 4. Evaluation Metrics for Retrieval: nDCG@10 Recall@5 / Recall@10 Mean Reciprocal Rank (MRR) Precision@k Metrics for QA Evaluation: ROUGE-L BERTScore Faithfulness (are all claims backed by retrieved text?) Citation Accuracy Hallucination Rate 5. Now build constraint-aware, evidence-linked RAG architecture and add it with SLM to see if it can beat the previous metrics evaluated from the SLM alone. 6. For building the constraint-aware, evidence-linked RAG architecture, refer following, Core components: =============== 1. Corpus & Index — statute chunks with hierarchy metadata (section/subsection/illustration/exception). 2. Embedding retriever — InLegal-SBERT / InLegalBERT sentence embeddings + FAISS index. 3. Reranker — cross-encoder reranker (InLegalBERT fine-tuned) to re-rank top-k. 4. Constraint module — set of lightweight rule engines/filters enforcing legal constraints before generation. 5. Evidence formatter — organizes retrieved snippets + provenance into generator context. 6. Generator (SLM) — small seq2seq (FLAN-T5 small/TinyLlama / Phi-4) fine-tuned with LoRA/QLoRA to output multi-field evidence-linked answers. 7. Hallucination detector / verifier — checks claims against retrieved evidence; optionally runs a secondary verification model.
Project ID: 40176887
5 proposals
Remote project
Active 3 mos ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
5 freelancers are bidding on average ₹53,008 INR for this job

Having worked on a variety of challenging projects, I am accustomed to solving intricate problems. My proficiency in Machine Learning would be particularly valuable for building the constraint-aware, evidence-linked RAG architecture with small seq2seq models like FLAN-T5 small/TinyLlama / Phi-4 fine-tuned with LoRA/QLoRA. I am well versed in using effective evaluation metrics (such as nDCG@10/Recall@5/MRR/Precision@k/ROUGE-L/BERTScore/Faithfulness/Citation Accuracy/Hallucination Rate) which will be crucial for determining the performance of the new architecture. Moreover, my strong belief in the iterative process of development aligns perfectly with your project's empirical approach. I am thorough and detail-oriented in my working style without losing sight of the bigger picture. This makes me capable of not only understanding but also fulfilling your requirements effectively. Trusting me with this project means putting it into capable hands; hands that are skilled at both dealing with complex code as well as creating innovative solutions. Let's make your vision a reality with a formidable RAG architecture!
₹56,250 INR in 10 days
4.4
4.4

Siddipet, India
Member since Nov 4, 2015
₹12500-37500 INR
$250-750 USD
€8-100 EUR / hour
₹12500-37500 INR
$30-250 AUD
₹1250-2500 INR / hour
$250-750 USD
$30-250 USD
$250-750 USD
₹12500-37500 INR
₹600-1500 INR
$30-250 AUD
₹1250-2500 INR / hour
₹150000-250000 INR
₹12500-37500 INR
$250-750 SGD
$250-750 USD
₹37500-75000 INR
₹750-1250 INR / hour
$30-250 AUD