Optimize transcription

Sally allows you to optimize and regenerate an existing transcription - for example by choosing a different AI model, adjusting the number of speakers, or selecting the correct meeting environment.

This helps you significantly improve your transcription quality without having to re-record the meeting.

Quick navigation:

Optimize & regenerate transcription
Recommendations for best transcription results

1. Optimize & regenerate transcription

Open the meeting you want to optimize.

info

If you want to optimize the transcription of an imported audio file, open it via the "Recordings" tab.

Open the transcription on the right-hand side.

Click “Optimize transcription”. This opens the settings menu.

Figure 3: 'Optimize transcription' button

In the settings window, you can configure how the new transcription should be generated (see the table below for detailed explanations of each setting):

Setting	Explanation	Options
AI model	The AI model determines how precise and robust Sally's transcription will be. Depending on the language, environment, and audio quality, different models can produce significantly better results. Choose the model that matches your situation best.	- Automatic selection: Sally chooses the best model based on audio quality and spoken language. - Sally v1 Common: Versatile multi-language model with low latency — ideal for general meetings. - Sally v1 Common Office Set-Up: Optimized for office environments with enhanced noise and echo suppression. - Sally v1 German: High-precision transcription model specifically for German language meetings. - Sally v1 German Office Set-Up: German-optimized model with additional tuning for typical office background noise.
Spoken language	Select the language used in the meeting. Choosing the correct language greatly improves accuracy.	Any of our 35 supported languages
Number of speakers	Determines how well Sally can separate and attribute different voices. Providing the correct number improves speaker diarization noticeably.	- Detect automatically - Specify exact number - 1–5 participants - 6–10 participants - 11–15 participants - 16–20 participants - 21–25 participants - More than 25 participants
Meeting environment	Helps Sally choose the correct acoustic profile. A precise environment selection improves transcription quality — especially in noisy or echo-heavy surroundings.	- Detect automatically - Field recording / on-site - Office - Dictation - Interview (face-to-face) - Conference / panel discussion - Media production / podcast / studio - Meeting room / boardroom - Online meeting - Phone call / call center / hotline - Unknown environment - Presentation / lecture / seminar / classroom
Regenerate summary	Recreates the meeting summary based on the new transcription — useful if the original transcript contained inaccuracies or missing context.	Checkbox (on/off)

Click "Regenerate".

Sally will now optimize the transcription based on your selected settings.
This process may take a few moments. You can return at any time to run the optimization again.

2. Recommendations for best transcription results

Sally creates transcripts based on the recorded audio.

The more structured your meeting is and the better the audio quality, the more accurate the following will be:

Transcript
Speaker identification
Summary
To-dos and decisions

The following recommendations help you achieve optimal results from the start and avoid common issues.

2.1. Introduce all participants by name at the beginning

A short round of introductions at the beginning significantly improves speaker identification - especially in longer meetings or meetings with many participants.

tip

Ask all participants to briefly introduce themselves at the beginning.

Example:
“Hi, I’m Anna Müller from Marketing.”
“My name is Tobias Schneider, Sales.”

The clearer the initial voice association, the more precise the speaker separation will be later in the transcript.

2.2. Clearly announce the agenda and topic changes

Sally processes meetings most reliably when they follow a clear structure and topics are distinctly separated. Ideally, one person should clearly present the agenda at the beginning, and topic changes should be explicitly announced.

tip

Examples:
“Today we’ll cover three points: budget, timeline, and responsibilities for the upcoming CRM project.”
“Now let’s move on to the budget.”
“Let’s continue with the next topic: implementation.”

Clear verbal structure improves transcript segmentation and leads to more precise summaries.

2.3. Avoid speaking at the same time

When multiple people speak simultaneously, content cannot be clearly assigned.

tip

Only one person should speak at a time.
If interruptions happen, pause briefly and continue one after another.
Decisions should be clearly summarized by one person at the end.

Example:
“To summarize: The deadline is March 15, and Max is responsible.”

Clearly formulated closing statements significantly improve the quality of the summary.

2.4. Speak clearly and at a consistent pace

Very fast, unclear, or mumbled speech increases the likelihood of transcription errors.

tip

Articulate clearly.
Speak at a natural pace.
Insert short pauses between sentences.
Pronounce technical terms clearly and completely.

For important decisions or action items, it helps to state them consciously and clearly.

2.5. Use a high-quality microphone

Audio quality is one of the most important factors for accurate transcripts.

tip

Use a headset or external microphone instead of your laptop’s built-in mic.
Keep the microphone approximately 15–25 cm (6–10 inches) from your mouth.
Ensure a stable connection for Bluetooth devices.
Avoid changing your distance from the microphone while speaking.

The clearer and closer the audio signal, the more accurate the speech recognition.

2.6. Reduce background noise

Background noise can significantly reduce recognition accuracy.

tip

Choose a quiet room.
Close windows and doors.
Mute yourself when not speaking.
Avoid typing or paper noise while others are speaking.
Use noise-cancelling headphones if necessary.

Continuous noises such as fans, construction sounds, or parallel conversations in the background can negatively affect transcription quality.

2.7. Avoid echo and reverberation

Strong room echo makes speech separation more difficult.

tip

Use rooms with carpets, curtains, or soft furniture.
Avoid large, empty conference rooms.
Make sure multiple devices in the same room are not unmuted at the same time (feedback loops).

Echo can cause statements to appear duplicated or distorted in the transcript.

2.8. Do not switch languages during the meeting

Sally processes each transcript most reliably when one primary language is used.

tip

Stick to one language throughout the meeting.
If multiple languages are required, create separate meetings or recordings.

Frequent language switching may reduce word recognition accuracy.

2.9. Test your setup before important meetings

Many audio issues can be prevented with a short test beforehand.

tip

Start a brief test call.
Check microphone quality and volume levels.
Confirm that the correct microphone is selected.
Make sure Sally is properly added to the meeting.

A short test can save significant follow-up work and ensures clean results.

1. Optimize & regenerate transcription​

2. Recommendations for best transcription results​

2.1. Introduce all participants by name at the beginning​

2.2. Clearly announce the agenda and topic changes​

2.3. Avoid speaking at the same time​

2.4. Speak clearly and at a consistent pace​

2.5. Use a high-quality microphone​

2.6. Reduce background noise​

2.7. Avoid echo and reverberation​

2.8. Do not switch languages during the meeting​

2.9. Test your setup before important meetings​