
Overview
The transcript dataset provides access to high-accuracy transcripts from regular earnings calls and unique events. Each transcript is associated with an event and updated in the dataset when available.If we have audio in English for an event, we will provide a transcript.
How it works
Transcript data is provided in JSON format and is derived from our live transcripts. When a live event finishes, the transcript is processed and augmented with confidence scores, paragraph breaks, and more to enhance readability and usability. The transcript is then made available for download and analysis through the API shortly after the event concludes.Chapters
Use the Chapters endpoint to retrieve structured segments, or chapters, associated with a transcript. Querying with the appropriate identifier returns a paginated list of chapter objects within the data array. Each chapter object includes a title, start timestamp and end timestamp, defining distinct sections within the content. See API reference for more details.How to access the data
The API allows you to query the transcripts dataset using a variety of parameters, such as ticker, ISIN, date, and more. You can also use the API to retrieve a list of all transcripts available in the dataset. For more information on how to use the API, see the API reference below.If you have a subscription to all documents we offer, you can retrieve all
documents in a single request using the Document
endpoints.
API Reference
Explore the Transcripts API endpoints.
Speaker identification
Effective mid-April 2025, all new transcripts for regular earnings events include speaker identification. This feature lets you easily determine who is speaking at any given time during an event. While older transcripts will not be retroactively updated, all new transcripts will include speaker identification. There are two types of transcripts available in the API:- Standard transcripts (typeId = 15): These do not include speaker identification.
- In-house transcripts (typeId = 22): These include speaker identification.
Are the fields nullable?
Are the fields nullable?
It’s not always possible to identify who is speaking. In such cases, fields might be null in the JSON file. This can happen for various reasons, such as:
- The speaker is not clearly identified in the audio.
- The speaker’s voice is not recognized.
- The speaker is not part of the event’s official roster.
- We are unable to verify the speaker’s identity or role.
What are the SLAs for speaker identification in transcripts?
What are the SLAs for speaker identification in transcripts?
We strive to provide speaker identification across all transcripts as part of
our ongoing commitment to quality and accuracy. During periods of high
activity—such as earnings season—some events may be prioritized for speaker
attribution based on factors like client interest and market relevance. As a
result, speaker data may appear on certain transcripts sooner than others.
We’re continually enhancing our processes to expand speaker coverage and
ensure timely delivery across all events.
How do I best handle the different types of transcripts when fetching data?
How do I best handle the different types of transcripts when fetching data?
To ensure you always have the most up-to-date and accurate data, we recommend querying the API for both transcript types. This allows you to check whether speaker identification is present in the in-house transcript.
If it is, use that version; if not, fall back to the standard transcript.You can retrieve both types by specifying the query parameter
typeIds=15,22
in your API request. This will return both transcript types for the specified event.There is no speaker identification for a specific event. Why?
There is no speaker identification for a specific event. Why?
There are several reasons why you may not be able to find a transcript for a specific event:
- Make sure the event is a regular earnings event (Earnings Call) in the API.
- Make sure you are on the v3 API.
- Make sure you have access to the transcript dataset.
- Did the event recently take place? If so, the transcript may not be available yet.