
Closed
Posted
Paid on delivery
I am managing a podcast production pipeline that uses NotebookLM to generate AI-hosted audio episodes in **PT-PT (European Portuguese — not Brazilian)**. I need a specialist to replace the default NotebookLM voices with ultra-realistic cloned voices, and optionally extend the output into full videocasts featuring talking avatars with accurate lip-sync. ## What You Will Be Doing ### Core Deliverable — Voice Cloning (Required) - Take a 2-minute NotebookLM-generated podcast clip (two AI hosts: male + female, in PT-PT) - Clone the male host voice from a provided voice sample (~200MB clean audio file) - Replace the male voice in the episode with the cloned voice; female voice stays untouched - Output must sound natural, consistent, and broadcast-ready — not robotic or mismatched ### Extended Deliverable — Talking Avatar / Videocast (Strong Plus) - Generate a talking avatar of the male host using provided reference photos - Sync the avatar video to the cloned audio with accurate lip-sync - Final output should be suitable for publishing as a professional videocast --- ## Hard Requirements - **PT-PT accuracy is non-negotiable.** European Portuguese has distinct phonetics, rhythm, vowel reduction, and intonation compared to Brazilian Portuguese. You must demonstrate this distinction in your proposal or demo. - Efficient, repeatable pipeline — minimal manual steps per episode - Deliverables must be production-quality, not rough demos --- ## Tools & Approach I am open to your recommended stack. Common tools in this space include ElevenLabs, RVC, HeyGen, D-ID, and SadTalker — but I care about the result, not the tool. Integrated pipelines are preferred over heavy manual post-processing per episode. Please describe in your proposal: - Your approach to PT-PT voice cloning specifically - Whether your solution is integrated or requires post-production steps - Which tools/platforms you plan to use - Estimated cost per episode (tokens, API credits, or flat fee) once the pipeline is set up ## What to Include in Your Proposal 1. Your experience with voice cloning, specifically in non-English or European Portuguese contexts 2. Whether you accept the base demo only, or base + avatar bonus 3. Your estimated demo delivery time 4. Preferred method for receiving source files (Google Drive, WeTransfer, etc.) 5. Your quote for the full project (pilot + per-episode ongoing rate) 6. Links to previous work — especially talking avatar examples with good lip-sync --- ## Ideal Candidate - Proven experience in high-quality voice cloning - Familiarity with NotebookLM workflows is a plus - Ability to build an efficient, low-manual-intervention pipeline - Available for rapid iteration during the pilot phase Looking forward to your proposals. Candidates who demonstrate PT-PT fluency awareness and avatar capability will be given priority.
Project ID: 40451636
16 proposals
Remote project
Active 6 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
16 freelancers are bidding on average $144 USD for this job

With enthusiasm, I wholeheartedly support your PT-PT Podcast Voice Cloning and Videocast Production project. As an audio producer with over 7 years of experience under my belt, I've relevant skills and an asset. Fluent in European Portuguese, I grasp its distinct phonetics, rhythm, vowel reduction, and intonation which provides incredible clarity to my work. My proficiency will ensure that the outputs are not only natural but also convey the intended message with precision. In terms of voice cloning, I've received great accolades for my work in non-English languages as well as in European Portuguese. Moreover, utilizing a sophisticated – yet streamlined – pipeline is inherent to my workflow. Reducing manual errands is necessary for increased efficiency and faster turnarounds. Although open to recommended tools like ElevenLabs, RVC, D-ID, or SadTalker, my focus truly lies on delivering a high-quality product rather than getting wrapped up in any particular toolset. To get started efficiently, I'd be delighted to receive the source files via Google Drive but remain open to other options as per your preference. My quote for the pilot phase + per-episode ongoing rate should demonstrate you my commitment — excellence derived from my extensive expertise combined with a fair pricing model. Let’s transform your PT-PT Podcast into fascinating videocasts with captivating avatars!
$30 USD in 1 day
5.9
5.9

Hi, This seems related to PT‑PT phonetics and cloning fidelity more than pure model choice — getting vowel quality, rhythm, and intonation right is the hard part. I’ll clone the male host from your 200MB sample, replace the male channel in the 2‑minute NotebookLM clip, and deliver a broadcast-ready audio file plus a short before/after comparison showing PT‑PT correctness. I’ll first run a targeted phonetic check on the sample and a quick test render (same phrasing) to confirm PT‑PT prosody. Pipeline: integrated cloning + render with automated quality checks; optional avatar/video with lip‑sync as an add‑on. I accept the base demo and the avatar bonus. Demo time: 48–72 hours. Send source via Google Drive or WeTransfer. Estimated pilot cost: flat quote after I see the sample; ongoing per-episode rate included in that quote. Ready to start when you share the files. --Smith
$140 USD in 7 days
5.1
5.1

Hi, this is really a speech pipeline problem more than a one-off edit, and the requirement that the result stay PT-PT rather than drift toward BR-PT is the part that matters. The real engineering risk is consistency across episodes: voice identity, PT-PT phonetics, and clean speaker replacement without artifacts at turn boundaries. I usually structure these systems so cloning, dialogue replacement, and final mastering are separated, which keeps the workflow repeatable instead of turning each episode into manual cleanup. The closest match in my past work is TikTok AI Livestream Setup, where I built an integrated TTS + avatar + lipsync pipeline with end-to-end audio routing and sync behavior. AI-Driven Marketing Suite Development -- 2 is also relevant because it was designed as a recurring AI media pipeline, not a rough prototype. For your case, I’d validate the male voice clone first on short PT-PT lines that expose vowel reduction, rhythm, and intonation, then lock the replacement flow before touching the avatar layer. If you want the videocast extension, I’d keep avatar generation downstream from approved audio so lip-sync issues don’t contaminate voice QA. This should be built for repeatable production use, with clear acceptance checks per episode. If useful, I can sketch the episode pipeline and identify where PT-PT QA and speaker-isolation checks need to sit. Thanks, Hercules
$140 USD in 7 days
3.4
3.4

With my rich and diverse background in AI video production, audio editing, and voice talent, I am unequivocally qualified to undertake the demanding project of PT-PT Podcast Voice Cloning & Videocast Production. Having successfully manipulated voices in various languages including non-English and European Portuguese contexts, I possess a deep understanding and respect for the subtle phonetic nuances, intonations, and rhythm that differentiate European Portuguese from its Brazilian counterpart. My portfolio is a testament to my ability to produce ultra-realistic audio episodes that exhibit the required linguistic distinctions while maintaining a natural and coherent flow. Furthermore, as someone who prides themselves on efficiency and precision, I comprehend the value of a streamlined production pipeline and fully integrated tools. In consideration of this, I plan to utilize D-ID and SadTalker for the cloning process along with HeyGen for the talking avatar functionality. My approach will be centered around maximizing automation so as to reduce manual intervention per episode. This strategy not only ensures consistency but also facilitates rapid iteration during the pilot phase—I'm ever-ready to undertake your project with the utmost diligence.
$250 USD in 7 days
2.9
2.9

Hi, I will deliver ultra-realistic cloned voices for your AI-hosted podcast episodes in European Portuguese. With extensive experience in high-quality voice cloning, particularly in non-English contexts, I’ll ensure the male host's voice replicates the nuances of PT-PT with precision. My approach utilizes advanced tools like ElevenLabs for voice cloning and D-ID for avatar generation, allowing for an integrated pipeline that minimizes manual intervention. I understand the importance of a production-ready output, so the cloned voice will sound natural and consistent. For the optional talking avatar, I’ll sync it accurately to the cloned audio, ensuring a professional-quality videocast. I can provide a demo quickly, and once the pipeline is set up, my ongoing rate will be competitive, tailored to your needs. I recommend using Google Drive for file transfers for efficiency. I look forward to discussing this further and am ready for rapid iteration during the pilot phase. Thank you.
$30 USD in 7 days
0.0
0.0

⭐⭐⭐⭐⭐ ✅Hi there, hope you are doing well! I recently completed a project cloning regional voices for podcast AI hosts, delivering natural, broadcast-quality audio that preserved linguistic nuances effortlessly. The key to success in this project is capturing the unique PT-PT phonetics with ultra-realistic voice cloning to avoid robotic or mismatched output. Approach: ⭕ I will use advanced voice cloning models specializing in European Portuguese phonetics. ⭕ Clone the male host voice from your clean sample and replace it seamlessly in your NotebookLM episodes. ⭕ Integrate tools like RVC for voice cloning and HeyGen or D-ID for talking avatar generation with precise lip-sync. ⭕ Develop an automated pipeline minimizing manual intervention for efficient episode processing. ⭕ Deliver production-ready audio and optionally a talking avatar videocast matching your professional standards. ❓ Could you share the clean voice sample via which platform you prefer (Google Drive, WeTransfer)? ❓ Do you have a preferred tool or platform, or do you want me to recommend based on quality? ❓ What is the expected frequency of episodes for ongoing pipeline scaling? I am confident I can deliver a highly natural, consistent PT-PT voice clone and an engaging talking avatar with perfect lip-sync, ensuring production-quality results and smooth workflow. Best regards, Nam
$200 USD in 3 days
0.0
0.0

Hello! I’ve successfully built a similar podcast production pipeline that involved high-quality voice cloning for European Portuguese. This project resulted in a 30% reduction in production time and significantly improved audio quality. I can share examples in chat if you’re interested. For your project, I would leverage a blend of ElevenLabs for voice cloning and D-ID for the talking avatars, ensuring a seamless integration that minimizes manual steps. How do you envision the ideal workflow for episode generation? If you’re open, I can share my previous work and we can discuss how to tailor this to your specific needs. Looking forward to your thoughts!
$140 USD in 7 days
0.0
0.0

Hi, this fits my AI audio/video pipeline work. I’d build a repeatable flow to isolate the male NotebookLM voice, clone with ElevenLabs/RVC, then QC specifically for PT-PT vowel reduction, rhythm, and intonation—not PT-BR. I’ve solved mismatched AI voice output by separating speakers first, then replacing only the target track. Biggest risk is accent drift, so I’d start with a short pilot and tune before batching. Google Drive/WeTransfer works. Base + avatar is fine. Thanks!
$250 USD in 7 days
0.0
0.0

i’ve done very similar recently with multilingual AI voice pipelines using ElevenLabs, RVC and HeyGen, including non-English phonetics where accent drift becomes the real problem. PT-PT needs different vowel compression and cadence handling than BR-PT, so I would not use raw TTS replacement without voice conditioning and cleanup. I’d suggest keeping the female track isolated and processing only the male stem. That keeps timing stable and avoids dialogue sync artifacts. I’d also build the pipeline around reusable inference presets so future NotebookLM episodes need almost zero manual correction. First I’ll extract and clean the male voice sample, train/test the clone, then replace the NotebookLM track and rebalance mastering for broadcast consistency. After that I can extend it into HeyGen or SadTalker avatars with proper lip-sync timing. Do you already have PT-PT reference transcripts for pronunciation validation? Also, do you want fully automated batch episode generation or semi-supervised QC before publishing? Best, Meena S.
$140 USD in 7 days
0.0
0.0

I understand the PT-PT requirement clearly — European Portuguese voice rhythm, vowel reduction, and pronunciation are very different from Brazilian Portuguese, so maintaining authentic PT-PT phonetics is critical for believable output. The workflow can be structured to stay scalable and low-touch after the pilot phase. I can deliver: • Base voice-cloning demo • Optional avatar/video version • Production-ready export pipeline • Per-episode workflow documentation Demo delivery: 1–2 days File transfer: Google Drive or WeTransfer both work. Available for rapid iteration during the pilot phase.
$140 USD in 7 days
0.0
0.0

Drawing on over 9+ years in web and mobile development, I can bring a unique perspective to your PT-PT Podcast Voice Cloning and Videocast Production project. Though my specialties are in E-commerce and CMS based websites, I've always been fascinated by AI and how it can enhance human experiences, as is the case here. The distinctive demands of PT-PT voices, including phonetics, rhythm, vowel reduction, and intonation hold no secrets for me. While I am not currently working with the exact tools mentioned in your project description, my motto always has been "results over tools." With that said, I'm confident in adapting to any necessary pipeline quickly – even during the pilot phase – to ensure our solution provides you with an efficient, repeatable process with minimal manual steps per episode. In regards to your extended deliverables of creating a talking avatar/videocast that syncs perfectly with voicecloned audio, I'm eager to take on this challenge. These tasks are often what make experienced developers such as myself rise above. I also have a wealth of past works that could be relevant examples for your project for your peace of mind.
$140 USD in 7 days
0.0
0.0

Hi, there! I recently worked on a voice cloning project for a podcast and would love to assist you with your PT-PT podcast voice cloning and videocast production. In that project, I focused on creating high-quality voice clones that accurately reflected the nuances of European Portuguese. I utilized advanced tools like ElevenLabs and RVC to ensure that the cloned voices sounded natural and broadcast-ready. One challenge was maintaining the distinct phonetics and intonation of PT-PT while ensuring the output was seamless; I addressed this by carefully selecting voice samples and conducting thorough testing. I offer to replace the default NotebookLM voices with ultra-realistic clones for your podcast episodes. My unique approach includes providing a streamlined pipeline that minimizes manual steps per episode, ensuring efficiency and consistency. For the extended deliverable, I can create a talking avatar of the male host, syncing the video to the cloned audio for a professional videocast output. With my background in audio production and AI voice technologies, I can ensure that your project meets the highest quality standards. If I use my previous experience, your project will likely be completed successfully. Hope to discuss this in detail. Through detailed discussion, I think I can find the better solution to finish your project successfully. Thank you!
$140 USD in 7 days
0.0
0.0

Cape Town, South Africa
Payment method verified
Member since Mar 20, 2026
$30-250 USD
$30-250 USD
$250-750 USD
$250-750 USD
$10-30 USD
£250-750 GBP
$30-250 USD
$30-250 USD
$25-50 USD / hour
$30-250 USD
₹600-1500 INR
₹400-750 INR / hour
$15-25 USD / hour
€300 EUR
₹600-1500 INR
₹400-750 INR / hour
$30-250 USD
$30-250 USD
$250-750 USD
₹600-1500 INR
₹600-1500 INR
$10-30 USD
₹600-1500 INR
$10-30 USD
₹1500-12500 INR