AI Text-to-speech Jobs
I run a women’s clothing store in Abu Dhabi (near Sultan Bin Zayed) and I want our WhatsApp line (+2TQT 50M5) to feel as if a knowledgeable, always-on salesperson is answering. The goal is to create an AI-powered employee that can instantly interpret incoming messages and respond in clear, conversational Arabic and English. The assistant must: • Guide shoppers through order placement—sending a direct purchase link when appropriate, walking them through an automated catalog browsing flow, or seamlessly handing the chat to a human for final confirmation. • Handle product inquiries of every kind: confirm real-time availability, share size/fabric details and specs, and suggest coordinated looks or alternative items when something is out of stock. • Provide pre...
Seeking Full-Stack AI Developer/Agency for "Human-in-the-Loop" EdTech Content Generation Platform Project Overview We are looking for an experienced Full-Stack AI Developer or a specialized AI development agency to build a robust, AI-powered content generation "wrapper" for our EdTech company. The goal of this platform is to empower our in-house content creators. Instead of the AI acting entirely autonomously, a human content creator will upload foundational materials (PDFs, course overviews) into the system. The platform will then use various AI models and media APIs to generate drafts of course materials (video lectures, podcast-style audio, editable presentations, and mock tests). The human creator will then review, edit, and finalize these assets within the platfo...
I run – a platform that gives small businesses their own 24/7 AI phone receptionist named “Sofie”.I need a world-class, production-grade system prompt for Retell AI that powers is not a simple chatbot or voice command handler. This is a full professional AI receptionist that must handle real customer phone calls for many different types of businesses (plumbers, HVAC technicians, dentists, salons, lawyers, chiropractors, auto repair shops, etc.) using one single global prompt must be extremely intelligent, natural, and adaptive so it works perfectly across all business types without needing separate prompts for each Requirements:Must work with Retell AI variables such as {{business_name}}, {{business_type}}, {{current_date}}, {{business_timezone}}, etc. Must automatica...
I'm seeking an Azure AI Engineer with expertise in Foundry. Key Requirements: - Experience with Azure Cognitive Services, Azure Machine Learning, and Azure Bot Services - Ability to assist in the conceptualization stage of AI projects and creation of a high level design documentation Ideal Skills and Experience: - Proficiency in Azure AI services - Strong background in AI project development - Excellent conceptualization and planning skills Looking forward to your proposals!
I run a YouTube channel packed with Malayalam-language content—well over a hundred episodes, most of them longer than twenty minutes each. I now want the entire library released in Hindi, but I do not have the time or budget for studio-based voice work. Instead, I’m looking for a partner who can take advantage of the latest AI-driven voice-cloning or speech-to-speech engines to produce a clean, perfectly synced Hindi track that preserves the original speaker’s tone and pacing. Here is what I need from you: • Each Hindi track must match the timing of the Malayalam original so I can simply drop it beneath the footage without re-editing the video. • Voice colour and energy should stay as close as possible to the source; I want viewers to feel they’re he...
TRS Multilingual is looking for contributors with fluent spoken proficiency in Malaysian English (Malay-accented English) to join our AI voice data collection project. The process is simple: record natural speech on topics of your choice and upload the recordings. ABOUT THE PROJECT This project supports the development of AI speech and transcription models. You will record natural spoken content in English with a natural Malaysian accent — including everyday stories, emotional expressions, character role-play, or casual conversations — and review auto-generated transcriptions for accuracy. RESPONSIBILITIES • Record natural voice content in English with a Malaysian accent (10–30 minutes per recording) • Review and verify auto-generated transcriptions • ...
Requirements for the AI Evaluation & Voice Testing Platform, Phase 1 — Voice Load Testing & Core Evaluation Platform This phase will include: • Test Suite Dashboard (create/manage evaluation suites) • SIP / API / Webhook connection modes • Voice load testing framework (SIPp based) • Concurrent call simulation (demo scale locally, scalable to 3000 ports on server) • Deterministic flows (scripted IVR tests) • Agentic flows using LLM for dynamic conversations • Retry logic for failed calls • Technical metrics collection (latency, success rate, call failures) • Basic reporting dashboard Phase 2 — AI Evaluation Engine & Red Teaming Includes: • AI evaluation scoring (intent accuracy, entity extraction) • Hallucin...
I am building a call-based assistant that proactively phones users on iOS and Android, brick phone holds check-ins, and then remembers what was shared so the next conversation feels truly personal. Here’s the flow I have in mind: the agent dials the user from a mobile app, greets them naturally, explores how they are feeling, and, when appropriate, offers coping strategies or simply listens with empathy. After each call it should securely log the dialogue, extract key wellness markers, and store both the raw transcript and structured data so those insights can be surfaced in later sessions. When it starts the next call, it must be able to reference previous discussions (“Last week you mentioned difficulty sleeping—how was your rest since then?”). I’m com...
I’m ready to turn WhatsApp into a fully fledged workplace by installing an AI employee that never sleeps. The project has three clear goals: • Customer support – The bot must instantly answer common FAQs and, when needed, handle complaints with empathy, escalating only the tricky cases to a human. • Data-driven insights – Every conversation should feed a light analytics layer that uncovers customer-behaviour patterns and drops the highlights into a simple dashboard or scheduled report. • Lead generation – While chatting, the AI should qualify prospects, score them, and pass the hottest leads straight to our CRM for follow-up. I’ll provide access to the official WhatsApp Business API (no grey work-arounds). You’re free to stack it wit...
I am looking for an experienced professional to develop a highly realistic digital avatar (digital clone / virtual human) based on my likeness. The avatar will be created from professional photographs taken in my Chroma Key studio, with full control over angles, lighting, and facial expressions. Key Requirements: Highly realistic appearance with convincing resemblance to me from multiple angles. 4K video quality. Excellent lip synchronization and natural, fluid facial expressions. Realistic voice (cloned or high-quality synthetic) with a friendly and approachable tone. Ability to speak naturally from scripts or bullet points. Desired Features: ============== - Easy to use: I want to input a script or bullet points and receive polished, ready-to-upload outputs (video, audio, or text). - W...
I need an AI agent focused on trading execution. This agent will operate on the Nse employing a day trading strategy. Key requirements: - Execute trades based on AI-generated signals - Analyze market trends in real-time - Adapt to fast-moving market conditions Ideal skills and experience: - Expertise in AI and machine learning - Strong understanding of the NSE and day trading dynamics - Proven track record in developing trading algorithms I'm looking for a developer who can create a reliable and efficient AI trading agent.
I’m a practicing physician who prefers to stay out of the spotlight, yet I still want to share trustworthy health knowledge online. I need a complete AI avatar of myself—face, voice, and personality—so I can generate original content in every format I might tackle as a solo creator: short educational videos, in-depth medical-advice articles, quick social-media posts, and anything else the platforms of tomorrow demand. The avatar must: • Look and sound convincingly like me from multiple angles and in 4K video. • Speak in a friendly, approachable tone while preserving medical accuracy. • Be easy for me to drive: I want to feed it scripts or bullet points and receive polished outputs (video, audio, or text) ready for upload. • Run on a workflow I...
I am putting together a low-cost, 10.5 GHz phased-array radar that must spot and track any flying object out to roughly 25–30 km. The core architecture is Pulse-LFM and the whole unit has to stay hand-held and field-serviceable, so every gram and watt matters. I am looking for someone whose background is firmly rooted in radar system development; experience with phased-array hardware, RF front-ends, and real-time signal processing is essential. Off-the-shelf evaluation boards or turnkey kits will not meet our needs—we have to build the hardware from the ground up. Re-using or adapting open-source material on the software side is welcome, and I am open to AI-assisted techniques for algorithm design or UI generation, as long as the end product remains fully traceable and maintai...
Smart Narrator AI - Context-Aware Text-to-Speech Transform boring text into emotionally intelligent, expressive speech Project Overview Smart Narrator AI is an advanced text-to-speech system that understands emotional context and adapts voice characteristics accordingly. Instead of robotic, flat narration, this system analyzes text intent and speaks it with appropriate tone, pace, and emotion. The Problem with Regular TTS Standard TTS Output: "WARNING! System failure!" (monotone, same as everything else) Smart Narrator AI Output: "WARNING! System failure!" (fast, urgent, high pitch - sounds like actual emergency) The Solution: Adaptive Prosody Generation This project implements context-aware prosody generation - the AI decides HOW to speak based on WHAT the text mean...
POSITION BRIEF: Discover Henderson – AI Platform & Systems Manager Discover Henderson is building a next generation tourism platform powered by AI. We are seeking a technical operator who can oversee our entire digital ecosystem — including our website, automations, data systems, and our AI concierge, Ava. This role ensures the platform runs smoothly, evolves continuously, and delivers an exceptional experience for visitors and local partners. ⭐ Role Title AI Platform Manager / No Code Systems Integrator (Wix + Voiceflow + + Airtable) ⭐ Role Summary You will manage and optimize the full Discover Henderson platform, including the website, partner onboarding systems, automations, data flows, and Ava — our AI concierge. Your job is to maintain stability, improve funct...
Goal: Create a FULLY AUTOMATED process that takes a male audio file and converts it into a female voice. What you must do: 1) Take the male audio I provide 2) Convert it into a female voice 3) Upload the final audio into a Google Drive folder 4) Add the Google Drive link in your competition entry 5) Explain clearly what software/tools you will use 6) Explain clearly how you will automate the FULL process from start to finish Important: - The automation must run locally - The final voice must sound perfectly natural and human - The female voice must correctly reproduce the multiple emotions, tone and intonations from the original audio - The result must NOT sound robotic or AI-generated - The automation must be able to process multiple audio files - Do NOT clean the original audio more...
I need a ten-minute YouTube video built entirely with AI-driven 3D animated characters. The piece must carry a professional, serious tone—think corporate explainer rather than cartoon—while still feeling visually engaging. Precise, frame-accurate lip sync is critical. Whether you connect a pre-recorded voice-over I supply or generate a natural-sounding AI voice yourself, the mouth movements have to match flawlessly throughout the full ten minutes. Please use whichever tools you trust—Unreal Engine’s MetaHuman Animator, Blender with FaceWare, or other reliable AI lip-sync solutions—as long as the final result looks polished and on beat. I will provide the script, branding assets, and any reference footage once we begin. Your deliverables are: • A 1920&t...
We are looking for a HIGH-LEVEL AI conversation engineer to help polish and optimize a live AI phone ordering system already running on Twilio + n8n + ElevenLabs + OpenAI. IMPORTANT: The backend infrastructure and payment loop are already mostly working. We are NOT looking for someone to rebuild the platform. We specifically need someone strong in: - AI prompting - conversational flow optimization - reducing hesitation/repetition - human-like ordering behavior - interruption handling - “pay now” behavior - upsell timing logic - manager escalation behavior - fallback/recovery logic - bilingual conversation flow (English/Spanish later) - voice AI optimization in ElevenLabs Current flow: Call → AI order → Stripe payment link → payment confirmation → receip...
I’m launching a Shopify-based T-shirt line and need the entire store built around an AI-driven buying experience. Shoppers must be able to: • Pick their nationality so the on-screen model automatically adjusts skin tone, facial features, and accent. • Choose size, color, fabric, and—because we specialize in Tees—our core Casual style (I may add athletic or formal later). The same AI engine will also generate short spoken videos (15–60 sec) for TikTok, Reels and YouTube Shorts. Each clip should rotate through three themes—product descriptions, promotional hooks and authentic-sounding customer reviews—ready for me to post straight from Shopify’s dashboard. Scope of work 1. Configure and brand a new Shopify store, including payment, shipping...
I’m building a real-time speech-to-text application for Tamil and need a full mobile solution that runs smoothly on both Android and iOS. The core requirement is low-latency live transcription that recognises the major dialects of Tamil—Madurai, Kongu, Nellai, Chennai and Sri Lankan variations—so users hear their words appear on-screen almost instantly, regardless of accent. My priority is accuracy and speed, followed by an interface that keeps the mic open, shows streaming text, and lets users copy, save or share the transcript once they stop speaking. If you can add useful extras such as offline mode, punctuation handling, or a light / dark theme switcher, feel free to mention them. When you respond, focus on your relevant experience: the speech-to-text engines you&rs...
Siamo uno studio commercialistico alla ricerca di un consulente freelance specializzato in intelligenza artificiale, con esperienza nell’analisi dei processi aziendali e nella progettazione di soluzioni personalizzate. L’obiettivo è individuare come integrare l’AI nella nostra organizzazione per migliorare efficienza, automazione e qualità del lavoro, nel rispetto di riservatezza, sicurezza dei dati e normative applicabili. Attività richieste: Analisi del contesto organizzativo e dei processi interni dello studio. Individuazione delle aree in cui l’AI può apportare valore concreto. Proposta di soluzioni personalizzate e realistiche per le nostre esigenze. Eventuale automazione di attività ripetitive o documentali. Definizion...
Recommended Articles Just for You
How user testing can make your product great
Get your product into the hands of test users and you'll walk away with valuable insights that could make the difference between success and failure.