You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|[Sentiment and Emotion Analysis](/chapters/audio-intelligence/pages/sentiment-analysis)| Extract sentiments and emotions from the audio, like confusion or interest. |
17
18
|[Name Entity Recognition](/chapters/audio-intelligence/pages/named-entity-recognition)| Automatically identifies and categorizes key information in the audio, like phone number or email address. |
18
-
|[Summarization](/chapters/audio-intelligence/pages/summarization)| Get important information from your conversation. This analysis is performed after the real-time transcription is stopped. |
19
-
20
-
21
-
22
-
## IN PROGRESS bellow this part
23
-
24
-
25
-
<Note>All the configuration properties described below are defined in the [POST /v2/live endpoint](/api-reference/v2/live/init).</Note>
26
-
27
-
28
-
## Multiple channels
29
-
30
-
If you have multiple channels in your audio stream, specify the count in the configuration:
31
-
32
-
```json
33
-
{
34
-
"channels": 2
35
-
}
36
-
```
37
-
38
-
Gladia's real-time API will automatically split the channels and transcribe them separately.
39
-
For each utterance, you'll get a `channel` key corresponding to the channel the utterance came from.
40
-
41
-
<Warning>
42
-
Transcribing an audio stream with multiple channels will be billed exponentially. For example, an audio stream with 2 channels will be billed as double the audio duration, even if the channels are identical.
43
-
</Warning>
19
+
|[Summarization](/chapters/audio-intelligence/pages/summarization)| Get important information from your conversation. This analysis is performed after the real-time transcription is stopped. |
You have nothing to configure. If you have multiple channels in your audio file with different content, the API will automatically transcribe each of them.
19
+
20
+
### Results
21
+
22
+
Each utterance will include a `channel` key, indicating the channel from which the utterance came from.
23
+
24
+
```json
25
+
{
26
+
"utterances": [
27
+
{
28
+
"text": "Vi tester en ganske kort melding.",
29
+
"language": "no",
30
+
"start": 1.02077,
31
+
"end": 3.9749399999999997,
32
+
"confidence": 0.86,
33
+
"channel": 0,
34
+
"speaker": 0,
35
+
"words": [...]
36
+
},
37
+
{...}
38
+
],
39
+
}
40
+
```
41
+
42
+
<Note>
43
+
The cost of transcribing an audio with multiple channels increases proportionally to the number of channels. For instance, a 2-channel audio will be billed double the audio duration, unless both channels are identical.
44
+
</Note>
45
+
46
+
</Tab>
47
+
48
+
<Tabtitle='Real-Time STT'>
49
+
50
+
### Configuration
51
+
52
+
If you have multiple channels in your audio stream, specify it in the `channels` parameter.
53
+
54
+
```json
55
+
{
56
+
"channels": 2
57
+
}
58
+
```
59
+
60
+
### Results
61
+
62
+
Each utterance will include a `channel` key, indicating the channel from which the utterance came from.
63
+
64
+
```json Real-Time
65
+
{
66
+
{
67
+
"type": "transcript",
68
+
"session_id": "...",
69
+
"created_at": "2025-01-17T09:01:30.197Z",
70
+
"data": {
71
+
"id": "00_00000006",
72
+
"is_final": true,
73
+
"utterance": {
74
+
"text": "Bye.",
75
+
"start": 22.539534999999994,
76
+
"end": 22.729984999999996,
77
+
"language": "en",
78
+
"confidence": 1,
79
+
"channel": 0,
80
+
"words": [...]
81
+
}
82
+
}
83
+
},
84
+
}
85
+
```
86
+
87
+
<Note>
88
+
The cost of transcribing an audio stream with multiple channels increases proportionally to the number of channels. For instance, a 2-channel audio stream will be billed as double the audio duration, even if both channels contain identical audio.
0 commit comments