Audio Processing
SAMMY Three includes advanced audio processing capabilities to ensure high-quality voice interactions in any environment.Overview
The audio processing system provides multiple layers of enhancement to deliver crystal-clear voice communication.Noise Suppression
Remove background noise with browser-native or AI-powered suppression
Noise Gate
Software-based filtering to eliminate ambient sounds
Environment Presets
Pre-configured settings optimized for different environments
Configuration
Enable audio processing in your SAMMY Three configuration.Noise Suppression
Two powerful noise suppression modes are available to match your needs.- Browser Native
- Koala AI
Default Browser Suppression
Uses the browser’s built-in echo cancellation and noise suppression capabilities.- No additional configuration required
- Works on all modern browsers
- Zero latency
- Free to use
- Limited effectiveness in very noisy environments
- Quality varies by browser and device
Enhancement Levels
Choose the right suppression level for your environment:Light
Best for: Quiet offices, home officesMinimal processing that preserves natural voice quality while removing light background noise.
Medium
Best for: Open offices, cafesBalanced approach that removes moderate noise while maintaining voice clarity.
Aggressive
Best for: Noisy environments, public spacesMaximum noise removal that prioritizes voice isolation over natural sound.
Noise Gate
Software-based noise gate filters out background noise between speech segments.How It Works
The noise gate acts like an automatic mute button, opening only when you speak and closing during silence to eliminate ambient noise.Configuration Parameters
Volume level (0-1) required to open the gate. Lower values are more sensitive.
- 0.02-0.03: Very sensitive, good for quiet environments
- 0.04-0.06: Standard setting for most environments
- 0.07-0.10: Less sensitive, for noisy environments
Time in milliseconds for the gate to fully open when speech is detected.
- 10-20ms: Very fast, may clip beginning of words
- 30-50ms: Standard, natural sounding
- 60-100ms: Slower, smoother transitions
Time in milliseconds to keep gate open after speech stops, preventing choppy audio.
- 200-300ms: Quick release, good for fast conversations
- 400-500ms: Standard, handles normal pauses
- 600-800ms: Longer hold, better for thoughtful speech
Time in milliseconds for the gate to fully close after hold time expires.
- 50-100ms: Fast close, may sound abrupt
- 150-200ms: Standard, natural fade
- 250-400ms: Slow close, very smooth
Environment Presets
Pre-configured audio settings optimized for common environments.Audio Debugging
Built-in tools to diagnose and fix audio issues.Stutter Analyzer
Debug audio stuttering and performance issues:Common Issues and Solutions
- Echo Problems
- Choppy Audio
- Background Noise
- Muffled Voice
Issue: Hearing echo or feedbackSolutions:Additional Tips:
- Use headphones when possible
- Reduce speaker volume
- Increase distance between mic and speakers
Best Practices
Test in Target Environment: Always test audio settings in the actual environment where the agent will be used
Start with Presets: Use environment presets as a starting point, then fine-tune if needed
Monitor Performance: Enable debug mode during development to catch audio issues early
User Feedback: Provide visual feedback for volume levels and mute status
Fallback Options: Have a text input fallback for environments where audio isn’t suitable
Advanced Configuration
Combining Multiple Techniques
Layer different audio processing techniques for optimal results:Dynamic Adjustment
Adjust audio settings based on user feedback or environment detection:Platform Considerations
Audio processing behavior may vary across different platforms and browsers. Always test on your target platforms.
Browser Support
Feature | Chrome | Firefox | Safari | Edge |
---|---|---|---|---|
Native Suppression | ✅ | ✅ | ✅ | ✅ |
Echo Cancellation | ✅ | ✅ | ✅ | ✅ |
Noise Gate | ✅ | ✅ | ✅ | ✅ |
Koala AI | ✅ | ✅ | ⚠️ | ✅ |
Safari may require additional permissions for advanced audio processing features.