Skip to main content
Midsummerr
ListenFeaturesServicesPricingAboutBlog
Sign InGet Started
  1. Blog
  2. /
  3. Updates

You Can Now Bring Your Own Voice to Audiobook Characters

Hybrid-mode projects in Midsummerr can now clone a custom character voice from a recorded or uploaded sample, with consent confirmation and instant sample playback.

Midsummerr|June 15, 2026|4 min read
Watercolor voice waveform beside a studio microphone

TL;DR

Midsummerr's hybrid voice workflow now lets you clone a character voice from a recorded or uploaded sample inside the character modal. The shipped flow is character-level, includes sample-quality guidance plus a rights-and-consent confirmation, and plays the cloned sample back as soon as it is ready.

Ready to price your audiobook? Compare Self-Serve, Director-Led, and Voice Conversion →

In this article

  1. 01What shipped
  2. 02Why this is more useful than another text-only voice prompt
  3. 03The scope is intentionally narrow
  4. 04The rights gate matters as much as the clone itself
  5. 05What this changes for real productions
  6. 06FAQ

If you already know what a character should sound like, text prompting is only half the job. The real production need is simpler: take a voice sample you trust and turn it into that character's working voice inside the project. Midsummerr now supports that on hybrid-mode projects with a new Clone a Voice flow inside the character modal.

That is the actual update. This is not a vague "voice cloning is coming" announcement. It shipped as a character-level workflow on the voices page: open a character, switch to the new tab, record or upload a sample, confirm you have the rights to use it, and clone that voice directly into the project.

What shipped

The new flow lives inside the same modal where you already edit and regenerate character voices. Alongside Edit & Regenerate and Previous Voices, hybrid projects now show a third tab: Clone a Voice.

From there, the shipped workflow does four concrete things:

  1. It lets you record or upload a voice sample for that character.
  2. It gives clear sample-quality guidance before you submit anything: one speaker, quiet room, no music, and roughly 30 to 60 seconds of clean speech.
  3. It requires an explicit rights-and-consent confirmation before the clone runs.
  4. It plays the cloned sample back in the modal once the new voice is ready.

That matters because it turns custom voice work into part of the same production surface instead of an off-platform workaround. You do not need a separate handoff just to test whether a voice reference actually belongs on the page.

Ready to try it yourself?

Create your first audiobook free →

Why this is more useful than another text-only voice prompt

Text prompts are good at direction. They are less good at preserving a voice identity you already have in mind.

Sometimes the problem is not "make this character younger" or "make the line warmer." The problem is that you already have a specific performance reference, actor sample, or narrator texture you want the character to inherit. Before this update, that kind of adjustment meant approximating the result indirectly. Now the workflow is more direct: bring in the sample, run the clone, and judge the result with your ears.

For teams producing dialogue-heavy scenes, that changes the speed of iteration. You can move from abstract voice notes to a concrete per-character sample without leaving the project workflow.

The scope is intentionally narrow

This launch is useful because it is specific, not because it tries to do everything at once.

The shipped feature is:

  • Per character, not a project-wide voice replacement
  • Inside the existing voices modal, not a separate dashboard flow
  • Available on hybrid-mode projects, because that is the voice path currently wired for cloned voices

That narrow scope is the right one for production work. Character voice decisions usually happen one role at a time. You do not need a global "clone everything" button when the creative job is deciding whether this voice belongs on this character.

The rights gate matters as much as the clone itself

Voice cloning features get messy when the product pretends consent is an afterthought. This flow does the opposite. Before the user can submit the sample, the modal requires an explicit confirmation that they have the rights to use the voice and any consent required for AI-generated audiobook output.

That is the right production framing. A custom voice is not just a sound-design decision. It is also a rights decision. If a team is going to use cloned voices in a commercial audiobook workflow, that responsibility needs to be stated at the point of use, not buried elsewhere.

What this changes for real productions

The simplest use case is also the most common one: you have a role that should sound like a real person you already have access to, and you want to test that voice inside the actual cast instead of describing it from scratch.

For authors and producers, that makes a few workflows easier:

  • keeping a known voice reference attached to one important role
  • trying a more specific lead-character voice without rebuilding the rest of the cast
  • comparing a cloned custom option against regenerated synthetic options in the same modal history

It also fits the broader Midsummerr production model. We already let teams cast voices, refine pronunciations, shape dialogue, and iterate chapter by chapter. This update makes the "what should this character sound like?" step more direct when the answer already exists in audio form.

If you want the broader production paths around that workflow, the pricing page shows the current tiers and the features page shows the rest of the production surface. If you want to judge the finished output first, start with Jane Eyre, Frankenstein, or Alice in Wonderland.

FAQ

Where does Clone a Voice live in Midsummerr?

It lives inside the character voice modal on the voices page. On hybrid-mode projects, the modal now includes a Clone a Voice tab next to the existing edit/regenerate and history views.

Can I record a sample, or do I need to upload a file?

You can do either. The shipped flow supports recording a sample directly in the UI or uploading an audio file, then using that sample as the source for the character voice.

Is this a project-wide voice import?

No. The shipped feature is character-level. You clone a voice for one character at a time inside that character's modal.

Does the flow include a consent check?

Yes. Before the clone can run, the user must explicitly confirm that they have the rights to use the voice and any required consent for AI-generated audiobook output.

Key takeaways

  • Hybrid-mode projects now have a character-level Clone a Voice flow inside the voices modal.
  • You can record or upload a sample, confirm rights and consent, and hear the cloned result in the same flow.
  • The feature is scoped narrowly on purpose: it is per character and currently only appears on hybrid voice projects.

Ready to turn your book into a cinematic audiobook?

Full-cast AI voices, original music, and sound effects — production-ready in hours, not months.

Get Started FreeListen to Examples

Keep reading

Watercolor studio microphone for audiobook production services
Product Updates

Midsummerr Expands Into Production Services for Studios

Midsummerr now serves audiobook studios and producers with modular production services: script mapping, cueing, audio elements, and dialogue assembly.

June 10, 2026·5 min read
Watercolor open gate beside studio headphones
GuidesUpdated

Does Audible Accept AI-Narrated Audiobooks in 2026?

Short answer: not through standard ACX submission. Here are the current AI-audiobook rules for Audible, Spotify, Google Play, Apple Books, INaudio, and PublishDrive.

June 14, 2026·9 min read
Watercolor brain formed from flowing audio waves
Guides

The Science of Listening: Why Dramatized Audio Lowers Cognitive Load and Sticks

What the research actually says about audiobook comprehension, cognitive load, and memory — and why expressive, multi-voice, sound-designed narration tends to retain listeners better. Careful framing, honest sourcing.

June 5, 2026·11 min read
Watercolor theater stage with headphones and rising chart bars
Guides

Why Dramatized Audiobooks Are Topping the Charts

Dramatized, full-cast audiobooks are dominating the bestseller charts in 2026. Here's the market data behind the surge — chart dominance, publisher investment, and which genres are driving it.

June 5, 2026·9 min read

Midsummerr

Create premium audiobooks with cinematic quality in one click

[email protected]

Quick Links

HomeFeaturesServicesPricingAbout Us

Resources

BlogSupportRequest Demo

Legal

Terms of ServicePrivacy PolicyRefund Policy

© 2026 Midsummerr. All rights reserved.