Skip to content

Conversation

@dbrkn
Copy link
Contributor

@dbrkn dbrkn commented Nov 14, 2025

This PR adds the streaming diarization pipeline used for benchmarking. It currently supports the Deepgram Streaming API, with additional pipelines to be integrated in the future.

@dbrkn dbrkn requested a review from EduardoPach November 14, 2025 22:51
@dbrkn dbrkn changed the title Add Streaming Diarization Pipeline Add Deepgram Streaming Diarization Nov 19, 2025
@EduardoPach EduardoPach changed the title Add Deepgram Streaming Diarization Add Deepgram Streaming Diarization and Speechmatics Nov 21, 2025
@dbrkn dbrkn force-pushed the berkin/streaming-diarization branch from e4b7d44 to 62870c1 Compare December 17, 2025 17:14
@dbrkn dbrkn changed the title Add Deepgram Streaming Diarization and Speechmatics Add Deepgram Streaming Diarization Dec 17, 2025
Copy link
Collaborator

@EduardoPach EduardoPach left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just a few nits and good to merge

Comment on lines +71 to +83
elif "model_timestamps_confirmed" in output and output["model_timestamps_confirmed"]:
# Fallback to regular transcription without speaker
for timestamp_group in output["model_timestamps_confirmed"]:
for word_info in timestamp_group:
if "word" in word_info:
words.append(
Word(
word=word_info.get("word", ""),
start=word_info.get("start"),
end=word_info.get("end"),
speaker=None,
)
)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Setting the speaker to None will likely cause an error downstream. I'd suggest raising an error in this case since the main reason speaker labels is the core info for Orchestration

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants