Canary Speech

Responses and Scores

When submitting a response or fetching a score, each one has an associated code, associated type, and attached data. The score or response must be a json object whose fields include code, type, and data.

The code (of type string) serves as an identifier for scores and responses and should be used to index a response or score. Each type (of type string) determines what significance the response or score may have and, in some cases, also determines the structure of the data (of type object) field. Details on each type of response or score, what it means, and how its corresponding data is structured are included below.

For specific details on what scores and responses are part of a project and survey, please consult your personalized information packet.

Response Type

The following is a list of recognized response types. The enum value of the response type must be included in every uploaded response.

Type Enum Value Data Structure Description
Recorded Response recordedResponse
{
  "duration": 4.2
}
duration: The length of the audio file in seconds.
The response to any question where the subject answers by recording their voice. Each recorded question must have a corresponding uploaded recording.
Text textField
{
  "answer": "example text"
}
answer: Any text
The response to any question involving a single line or multiline text field.
Number numberField
{
  "answer": 123.4
}
answer: Any number
The response to any question involving a number text box or a number picker.
Date dateField
{
  "answer": "2022-01-15"
}
answer: An ISO-8601 compliant date string
The response to any question asking for a specific date.
Time timeField
{
  "answer": "13:30:00"
}
answer: An ISO-8601 compliant time string in 24-hour format
The response to any question asking for a time of day.
Date-Time dateTimeField
{
  "answer": "2022-01-15T13:30:00Z"
}
answer: An ISO-8601 compliant datetime string with time zone marker
The response to any question asking for a specific date and time. The time zone marker must be present, either as an hour offset value or as ā€œZā€ denoting UTC time.
Multiple Choice multipleChoice
{
  "answer": 3
}
answer: The index (or indices) of the chosen option
The response to any question which gives a number of preset options for the subject to choose from. The value can either be a single value or an array of values if the corresponsing question allows for selecting multiple options.
Range Choice rangeChoice
{
  "answer": 0.5
}
answer: The chosen value within a given range
The response to any question where the answer must fall within a specified range. (i.e. A slider.)

Score Type

The following is a list of all recognized non-verbal score types. These types will be present when fetching scores via the Canary Speech API.

Type Enum Value Data Structure Description
None none
{
  "result": 50
}
result: The static score value
No calculation was involved with this score. It is either a static value or a value taken directly from a response.
Sum sum
{
  "result": 89
}
result: The calculated score value
This score is a summed value from one or more responses and/or one or more other scores. The value will always be numeric.
Count count
{
  "result": 4
}
result: The calculated score value
This score references one or more responses and/or one or more other scores and counts how many of them give a non-empty and non-falsey score. (i.e. Any value other than null, false, 0, empty strings, or empty lists.) The value will always be numeric.
Concatenation concat
{
  "result": [1, 2, 3]
}
result: The calculated score value
This score references one or more responses and/or one or more other scores and concatenates their values together. If all inputs are lists, the output will be a single list; if all inputs are maps, the output will be a single map; otherwise, the output will be all inputs converted to a string (if necessary) and concatenated to a single string.
Average average
{
  "result": 11.5
}
result: The calculated score value
This score sums the values of one or more response and/or one or more other scores, then performs a numerical average on the sum. The value will always be numeric.
Mean, Median, and Mode mmm
{
  "result": {
    "mean": 21.2,
    "median": 24,
    "mode": 17
  }
}
result: A map containing the calculated mean, median, and mode values
This score derives statistical mean, median, and mode values from a collection of one or more responses and/or one or more other scores.
Standard Deviation stddev
{
  "result": {
    "average": 30,
    "stddev": 1.2
  }
}
result: A map containing the calculated average and standard deviation values
This score derives the statistical average and standard deviation from a collection of one or more responses and/or one or more other scores.
Statistics stddev
{
  "result": {
    "min": 0,
    "max": 100,
    "count": 23,
    "sum": 426,
    "mean": 18.5,
    "median": 23,
    "mode": 17,
    "stddev": 6.5
  }
}
result: The calculated statistical values (See description)
This score takes a collection of one or more responses and/or one or more other scores and calculates the minimum/maximum values, length (count) and sum of the collection as well as deriving the statistical mean, median, mode, and standard deviation.

Vocal Scores

In addition to calculated non-verbal scores, there are also scores based on vocal processing. The value and structure of these scores vary depending on the target model used to process the audio files and the score code. Your personalized information packet will contain detailed information, the most common scores are:

Behavioral Scoring Models

Behavioral Screening

Type Score Code Data Structure Description
Screening Result BehavioralScreening_Overall
{
  "result": "Additional Screening Not Recommended"
}
result: Either "Additional Screening Not Recommended" or "Additional Screening Recommended"
A general test for whether the speaker's vocal features are indicative of mental health issues.

Anxiety

Type Score Code Data Structure Description
Overall Anxiety Anxiety_Overall
{
  "result": "low"
}
result: Either "low", "medium", or "high"
A categorization of generalized anxiety disorder (GAD) as indicated by the speaker's vocal biomarkers

Depression

Type Score Code Data Structure Description
Overall Depression Depression_Overall
{
  "result": "low"
}
result: Either "low", "medium", or "high"
Whether the speaker's vocal features are indicative of depression.

Stress

Type Score Code Data Structure Description
Overall Stress Stress_Overall
{
  "result": "low"
}
result: Either "low", "medium", or "high"
A general indication of the speaker's stress.

Mood

Type Score Code Data Structure Description
Overall Mood Mood_Overall
{
  "result": "excellent"
}
result: Either "low", "good", or "excellent"
A general indication of the speaker's mood or state of mind.

Cognitive Scoring Models

Cognitive Screening

Type Score Code Data Structure Description
Screening Result CognitiveScreening_Overall
{
  "result": "Additional Screening Not Recommended"
}
result: Either "Additional Screening Not Recommended" or "Additional Screening Recommended"
A general test for whether the speaker's vocal features are indicative of cognitive health issues.

Alzheimer's

Type Score Code Data Structure Description
Alzheimer Alzheimer_Overall
{
  "result": "negative"
}
result: Either "negative" or "positive"
Whether the speaker's vocal features are indicative of Alzheimer's

Mild Cognitive Impairment (MCI)

Type Score Code Data Structure Description
MCI MCI_Overall
{
  "result": "negative"
}
result: Either "negative" or "positive"
The screening result of whether the speaker's voice is indicative of MCI or not.

Parkinson's

Type Score Code Data Structure Description
Parkinson parkinson_score
{
  "result": "healthy"
}
result: Either "healthy" or "parkinson"
The screening result of whether the speaker's voices is indicative of Parkinson's disease or not.

Misc. Scoring Models

Energy

Type Score code Data Structure Description
Power Energy_Power
{
  "result": 50
}
result: A value ranging from 0-100
A measurement of how loud the speaker's voice is
Speed Energy_Speed
{
  "result": 50
}
result: A value ranging from 0-100
A measurement of how quickly the speaker speaks
Dynamics Energy_Dynamics
{
  "result": 50
}
result: A value ranging from 0-100
A measurement of how much the speaker's voice varies in tone and speed
Overall Energy_Overall
{
  "result": 50
}
result: A value ranging from 0-100
An overall measurement of the energy of a speaker's voice.

Wellness

Type Score code Data Structure Description
Wellness Wellness_Overall
{
  "result": "Low"
}
result: Either "Low", "Med", or "High"
A general measurement of a speakers wellness which can be used to assess a speaker's need for additional support

If your vocal score code isn't in this list, you may need to consult your personalized information packet for the desired labels and values.

Further Reading