say text : the text to speak, which can include intonation characters
[displaying text] : the text to display in the feedback window (if different). Ignored unless Speech Recognition is on.
[using text] : the voice to speak with. (Default is the system voice.)
[speaking rate number] : the rate of speech in words per minute. Average human speech occurs at a rate of 180 to 220 words per minute. (Default depends on the voice used. If “using” is not given, the system speaking rate is the default.)
[pitch number] : the base pitch frequency, a real number from 0 to 127. Values correspond to MIDI note values, where 60 is equal to middle C. Typical pitches range from around 30 to 40 for a low-pitched male voice to perhaps 55 to 65 for a high-pitched child’s voice.
[modulation number] : the pitch modulation, a real number from 0 to 127. A value of 0 corresponds to a monotone in which all speech is at the base speech pitch. Given a pitch value of 46, a modulation of 2 means the widest range of pitches would be 44 to 48.
[volume number] : the volume, a real number from 0 to 1 (default is the system volume).
[stopping current speech boolean] : stop any current speech before starting (default is false). When false, “say” waits for previous speech commands to complete before beginning to speak.
[waiting until completion boolean] : wait for speech to complete before returning (default is true).
[saving to any] : the alias, file reference or path string of an AIFF file (existing or not) to contain the sound output.
When “saving to” or voice-modifying parameters other than “using” are given, text is not displayed in the Speech Recognition feedback window, as it is not used to produce the speech in that case.