
Closed
Posted
Paid on delivery
I have a clear, formal definition of both “organizational hierarchy” and “toxicity” that I will share as soon as the project starts. Using that reference, I need a Large Language Model fine-tuned to recognise: • whether a piece of English text expresses toxicity, • weather the text refer to someone junior from senior. Rate the toxicity of the next based on certain parameters. The data you will receive arrives in CSV files—each row contains a single text sample plus a label column I already prepared for validation. If you would like to prototype on plain text or JSON first, that is fine, but the final pipeline must ingest the CSV format directly so I can drop new files in without extra preprocessing. What I’m expecting from you • A reproducible training script (Python + preferred deep-learning library) that loads the CSVs, applies any necessary cleaning, and fine-tunes the base model. • The trained model weights and an inference script or notebook that returns two outputs per line of text: toxicity score/label and the detected organisational level. • A brief README that explains environment setup, command-line usage, and how to extend the label set should we add additional hierarchy levels later. • Evaluation results on the held-out set I provide (precision, recall, F1 for each task). Acceptance criteria 1. Macro-F1 ≥ 0.80 on both toxicity and hierarchy detection tasks when tested on my hidden validation CSV. 2. Inference completes at ≥ 150 samples/second on an A100 or comparable GPU. 3. All code runs in a clean virtual environment using only the libraries listed in the README. If you have questions about the definitions or would like a small sample of the data before bidding, just let me know.
Project ID: 40387441
13 proposals
Remote project
Active 5 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
13 freelancers are bidding on average ₹5,538 INR for this job

Hi there, I can definitely build this dual-task classifier for you. Fine-tuning LLMs for multi-label classification is something I've done before, and your pipeline requirements are clear. 5+ years full-stack experience with Python, PyTorch, and transformer models I'll use a lightweight base model (like DistilBERT or a smaller BERT variant) to hit the 150 samples/sec target on A100 while still achieving 0.80+ macro-F1 The CSV ingestion, cleaning, and training loop will be fully automated so you can drop in new files I'll deliver both the training pipeline and a fast inference script with your two output predictions per sample Clear README with env setup, CLI usage, and extension guides for adding hierarchy levels One thing I'd like to clarify: roughly how many hierarchy levels should the model distinguish beyond just junior/senior? And do you have an estimate of total training samples in your CSV? Looking forward to working on this with you. Thanks
₹5,000 INR in 7 days
6.2
6.2

Hey there, we are a team of developers and we can do this project in no time. Please, send me a message to discuss the work. Thanks Ashish Kumar.
₹7,000 INR in 7 days
5.5
5.5

Hi, I’m a seasoned Applied ML Engineer (6+ yoe) specialized in building practical NLP pipelines for classification & decision support & I can deliver a high-performance system that meets your >150 samples/sec requirement using a specialized modeling strategy My Approach: -Data Audit:Review CSV labels,class balance & text noise to identify risks in toxicity & hierarchy definitions -Baseline Development:Establish TF-IDF & transformer baselines to verify learnability before complex modeling -Multi-Task Fine-Tuning:Train an encoder-based transformer for joint toxicity & hierarchy prediction with specialized scoring heads -Performance Optimization:Use class weighting & threshold tuning to maximize Macro-F1 while ensuring high inference speeds -Production:Deliver a reproducible Python pipeline for direct CSV processing & easy future label extension -Strategy Note:I recommend a fine-tuned encoder model over generative LLMs to ensure the stability & low-latency performance required for your high-throughput needs Relevant experience: -built production ML pipelines for classification & anomaly-detection style tasks where label quality,class imbalance & evaluation rigor mattered -worked on NLP/LLM systems involving structured extraction,semantic classification & reproducible training/inference workflows -built applied data science solutions in churn prediction,anomaly detection & decision-support analytics where the real work was turning noisy raw data into reliable model outputs
₹8,000 INR in 3 days
4.3
4.3

Hi, This is a great fit for my experience building LLM-powered classification systems and data pipelines. I can fine-tune a transformer model (e.g., BERT/DeBERTa) to jointly predict: • Toxicity (multi-class or score-based) • Organizational hierarchy (senior → junior detection) **Approach:** • Load CSV directly (no extra preprocessing needed from your side) • Clean and normalize text (light preprocessing to preserve semantics) • Fine-tune using HuggingFace Transformers with multi-task learning • Optimize for both accuracy (Macro-F1 ≥ 0.80) and speed (≥150 samples/sec on A100 via batching + FP16) **Deliverables:** • Reproducible training script (Python + PyTorch) • Fine-tuned model weights • Fast inference script (batch + GPU optimized) • Outputs: toxicity score/label + hierarchy classification per text • Evaluation report (Precision, Recall, F1 per task) • Clear README (setup, usage, extending labels) I’ve built similar pipelines for large-scale text classification and AI enrichment systems, so I’ll ensure clean code, reproducibility, and production-ready performance. Happy to review your definitions/sample data before starting. Thanks, Akshay
₹7,000 INR in 7 days
4.7
4.7

Dual-task transformer fine-tune — toxicity scoring + hierarchy detection — delivered as a reproducible Python pipeline with ONNX-optimized inference hitting ≥150 samples/sec. That's exactly what I'll build. Here's the approach: Base model: deberta-v3-base (strong on nuanced text classification) with two output heads — one for toxicity rating, one for senior/junior hierarchy detection Training: HuggingFace Transformers + PyTorch, reading your CSVs directly, no preprocessing step on your end Optimization: ONNX Runtime quantization to push inference speed well past your 150 samples/sec threshold on A100 — I've done this before and cut latency by 50% on a production NER model Evaluation: Macro-F1 reported per task on your held-out CSV, targeting ≥0.80 on both heads I'm an AI engineer currently building production NLP pipelines — most recently a privacy/PII classifier (Qwen-3B-Instruct) and a multi-label code classification system achieving 93% accuracy, both CSV-driven with F1-based evaluation. This project is the same pattern, just with your domain definitions plugged in. I'd like to see 20–30 sample rows and your toxicity/hierarchy definitions before we start — not to qualify the project, but to confirm label distribution and finalize the base model choice so there are no surprises at validation time. All four deliverables (training script, weights + inference notebook, README, eval report) ready for handoff.
₹5,000 INR in 7 days
4.1
4.1

Formal definitions do most of the work here. If hierarchy and toxicity aren't operationalized precisely before prompting, you get inconsistent labels regardless of model choice. My approach: structured output schema aligned to your definitions, 20-30 few-shot examples drawn from your corpus, and a baseline run with Qwen or Llama before touching any paid API. You also get an eval set you label once and reuse to track accuracy as prompts evolve. Deliverable is a Python library with CLI so classification runs on new batches without touching internals. M1: Definitions + schema + few-shot set, INR 3,150, 3d. M2: Model integration + eval loop, INR 3,150, 4d. M3: CLI library + accuracy report, INR 3,200, 3d. What does your corpus look like, and do you have any labeled examples already?
₹9,500 INR in 10 days
3.0
3.0

Hi, I can easily DO your work IN 24 HOURS, DM me now to get started, PRICE NEGOTIABLE 100% Work satisfaction is provided
₹4,000 INR in 1 day
2.5
2.5

Hi, You need a text classifier that respects your formal definitions of organizational hierarchy and toxicity—not generic, out-of-the-box models, but something calibrated to how you actually define these concepts. That's the hard part: most off-the-shelf LLMs won't distinguish subtle hierarchy signals your definition captures, and toxicity varies by context and domain. I'd build this using Python with a fine-tuned transformer (DistilBERT or similar) on your labeled data, then expose it via FastAPI for inference. Your formal definitions become the ground truth in the training split; the model learns to recognize patterns *you* care about. This approach stays within budget, runs fast, and avoids per-request cloud API costs. First step: send me your definitions and a few text samples you'd classify, and I'll sketch out the data pipeline and model architecture in 24 hours. Does that scope feel right, or do you need the classifier to handle streaming data or integrate with a specific backend? Best regards, Val
₹1,500 INR in 7 days
1.8
1.8

Greetings I can surely help you for LLM for Hierarchy & Toxicity I am in the IT industry since more than a decade and serve so many clients in building and rebuilding websites, software, and applications I have strong hands-on different cms like webflow, Wordpress, shopify, squarespace, wix and on different programming languages like PHP, Laravel, React, Node.js, HTML, CSS, And I did the migration from HTML to click funnels. I have made so many websites (E-commerce, WordPress, Classified admin, WooCommerce, etc.), bots, softwares, and Mobile applications (Android, IOS, and Huawei Play store) in my entire career. I have strong hands on both the front end and back end. Currently, I am part of the team who are dealing with miscellaneous tasks in dubizzle and Mzad Qatar including design and layouts and they both have more than 1 million users. I believe that you are looking for a web designer and for sure you will get your end desire result with plagiarism-free work and with better quality as I am assuring you this. Package deals can also be done for long-term collaboration as per the client's requirement. Kindly do come on chat so that we can discuss project details further more.
₹1,500 INR in 2 days
0.0
0.0

Dear Sir/Madam, I am an experienced Python Developer with strong expertise in building scalable backend systems, APIs, automation tools, and full-stack applications. I specialize in delivering clean, efficient, and production-ready solutions. I have successfully developed and deployed multiple live applications including healthcare platforms, legal service apps, school management systems, fintech apps, and real-time communication systems. My Core Python Expertise ✔ Django & Django REST Framework ✔ FastAPI (High-performance APIs) ✔ Flask ✔ SQLModel / SQLAlchemy ✔ PostgreSQL / MySQL / MongoDB ✔ Supabase Integration ✔ Authentication (JWT, OAuth) ✔ Payment Gateway Integration (PhonePe, Razorpay, Stripe) ✔ Web Scraping (BeautifulSoup, Selenium) ✔ Automation Scripts ✔ WebSocket & Real-time Systems ✔ Docker Deployment ✔ AWS / VPS Deployment ✔ REST API Design & Optimization What I Can Build For You Secure REST APIs SaaS backend architecture Admin dashboards Real-time chat systems Payment systems Data processing systems Microservices architecture AI/ML API integration Custom business logic systems Recent Project Experience Healthcare booking & wallet system Legal consultation backend platform School ERP & management API Fintech wallet & transaction management Real-time chat application (WebSocket + MQTT) Location-based services & geo APIs
₹8,000 INR in 10 days
0.0
0.0

Lucknow, India
Member since Oct 17, 2016
₹4000-5000 INR
$250-750 USD
$8-15 USD / hour
$30 USD
$250-750 USD
$45 USD
₹600-1500 INR
$8-15 USD / hour
$30-250 CAD
$250-750 USD
$1500-3000 USD
₹75000-150000 INR
$250-750 USD
₹1500-12500 INR
$30-250 AUD
$250-750 USD
$30-250 USD
$250-750 USD
$30-250 USD
$10-30 USD
₹400-750 INR / hour