feat: enhance audio device management in Conversational AI demo #179

niraltmark · 2025-05-31T11:14:32Z

This branch will not work without upgrated ElevenLabs JavaScript Client Library

Added functionality to select and manage audio input (microphone) and output (speaker) devices.
Implemented device population and selection UI in the HTML.
Updated JavaScript to handle device selection, status updates, and permissions for microphone access.
Improved user experience by disabling device selectors during active conversations and displaying active device statuses.

- Added functionality to select and manage audio input (microphone) and output (speaker) devices. - Implemented device population and selection UI in the HTML. - Updated JavaScript to handle device selection, status updates, and permissions for microphone access. - Improved user experience by disabling device selectors during active conversations and displaying active device statuses.

vercel · 2025-05-31T11:14:39Z

Someone is attempting to deploy a commit to the ElevenLabs Team on Vercel.

A member of the Team first needs to authorize it.

…ersation-ai-js

vercel · 2025-08-31T18:10:20Z

examples/conversational-ai/javascript/src/app.js

+async function populateDeviceSelectors() {
+  try {
+    // We need to request permission first to get the device labels
+    await navigator.mediaDevices.getUserMedia({ audio: true });


The populateDeviceSelectors() function requests microphone access but never closes the resulting media stream, causing a memory leak and keeping the microphone "active" indefinitely.

View Details

📝 Patch Details

diff --git a/examples/conversational-ai/javascript/src/app.js b/examples/conversational-ai/javascript/src/app.js index bdef975..007bc84 100644 --- a/examples/conversational-ai/javascript/src/app.js +++ b/examples/conversational-ai/javascript/src/app.js @@ -16,9 +16,13 @@ async function getAvailableAudioDevices() { async function populateDeviceSelectors() { try { // We need to request permission first to get the device labels - await navigator.mediaDevices.getUserMedia({ audio: true }); + const stream = await navigator.mediaDevices.getUserMedia({ audio: true }); const devices = await getAvailableAudioDevices(); + + // Clean up the stream immediately after getting device information + stream.getTracks().forEach(track => track.stop()); + const micSelector = document.getElementById("audioDeviceSelector"); const speakerSelector = document.getElementById("speakerDeviceSelector");

Analysis

In the populateDeviceSelectors() function, line 19 calls await navigator.mediaDevices.getUserMedia({ audio: true }) to request microphone permissions so that device labels become available. However, the function never stores a reference to the returned MediaStream or calls .getTracks().forEach(track => track.stop()) to release the microphone resource.

This creates two problems:

Memory leak: The MediaStream object and associated resources are never properly cleaned up

User experience issue: The browser will show the microphone as "in use" (red recording indicator) even when no conversation is active, which can confuse users

The fix is to capture the stream reference and immediately stop all tracks after getting the device information:

const stream = await navigator.mediaDevices.getUserMedia({ audio: true }); // ... get devices and populate selectors ... // Clean up the stream stream.getTracks().forEach(track => track.stop());

This ensures the microphone resource is properly released while still allowing the device enumeration to work correctly.

…n conversation start - Removed the `setSpeakerDevice` function as it was not utilized. - Added logging for the selected speaker device when starting a conversation. - Updated the parameter name for output device ID in the conversation session to enhance clarity.

- Eliminated console logs for microphone and speaker device IDs during conversation initiation to streamline the code and reduce unnecessary output.

Merge branch 'main' into feature/178-provide-input-output-device-conv…

d05ddfc

…ersation-ai-js

vercel bot reviewed Aug 31, 2025

View reviewed changes

nir-fintastic added 2 commits August 31, 2025 23:10

refactor: remove redundant logging in conversation start function

cfce63b

- Eliminated console logs for microphone and speaker device IDs during conversation initiation to streamline the code and reduce unnecessary output.

niraltmark mentioned this pull request Aug 31, 2025

Add audio device controls elevenlabs/packages#185

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enhance audio device management in Conversational AI demo #179

feat: enhance audio device management in Conversational AI demo #179

Uh oh!

niraltmark commented May 31, 2025 •

edited

Loading

Uh oh!

vercel bot commented May 31, 2025

Uh oh!

vercel bot Aug 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: enhance audio device management in Conversational AI demo #179

Are you sure you want to change the base?

feat: enhance audio device management in Conversational AI demo #179

Uh oh!

Conversation

niraltmark commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented May 31, 2025

Uh oh!

vercel bot Aug 31, 2025

Choose a reason for hiding this comment

Analysis

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

niraltmark commented May 31, 2025 •

edited

Loading