The UK Home Automation Archive

Archive Home
Group Home
Search Archive


Advanced Search

The UKHA-ARCHIVE IS CEASING OPERATIONS 31 DEC 2024

Latest message you have seen: Re: Re: Proposal / RFC: Change xAP wire format


[Message Prev][Message Next][Thread Prev][Thread Next][Message Index][Thread Index]

Re: Text to Speech with wav File Introduction


  • Subject: Re: Text to Speech with wav File Introduction
  • From: "mstovenour" <Mstovenour@xxxxxxxxx>
  • Date: Fri, 01 May 2009 13:59:30 -0000

--5-7690081030-3432602313=:0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Excellent.  I thought that would be the best way to accomplish this.  Do
you think it is reasonable to allow URLs on the Sound=3D line?  I'm
thinking that the speech server should be able to retrieve the sound
file so that I'm not forced to manually copy the sounds to each speech
server.  Also the text could be optional if all I wanted was a chime.  I
know this might be more functionality than most speech servers will
implement.  My client side will be user configurable so the user could
configure a simple sound name if the user wanted to interoperate with
speech servers that do not support URLs.

I guess an alternative is to only use a sound name in the xAP message
and then implement a mapping function in the speech server.  Then a
simple sound name like 'chime' might be mapped in the server to
'ftp://ha.example.org/sounds/chime.wav'.
 Humm... the more I think about
this the more I like your implementation where the messaging is generic
and the server decides how to locate the actual sound media.  Nothing
there prevents me from making a smarter speech server that can download
its own sound files.  It makes configuration more complex but keeps the
smarts in the speech server.  I'm a big fan of making each controller as
smart as is practical.  OK, you talked me into it  [;)] .  I'll
implement simple messaging and a smart speech server.


--- In xAP_developer@xxxxxxx, James <james.traynor@...> wrote:
>
> Hi,
>
> Actually this functionality is included in the latest xAPSpeech.  It
> can play a wav file before speech which can be selected from the xAP
> message as well as a default for all messages.  Looks like I
> completely missed updating the schema doc, sorry!  I have added the
> extra item to the doc on mi4 and xAP wiki  which is just a Sound=3D
> item in the tts.speak message.  Only thing to note is the sound name
> should refer to the sound without the file extension
> As you say this kind of this is best handled in a single message and
> not two.   A xAP message can contain multiple blocks/sections  but
> will only ever be about 1 class.
>
> hth
>
> James
>
> On 30 Apr 2009, at 13:57, mstovenour wrote:
>
> >
> >
> > I have an alerting application where I need a TTS application to
> > play a wav file prior to speaking the text. There are multiple
wav
> > files, from a simple chime to a siren, that accompany the text;
this
> > rules out hard coding the wav file into the TTS application.
Clearly
> > I'll need to build both a new client and TTS application. My
> > question is regarding the schema to use.
> >
> > Is there a single schema that can communicate both speech and wav
> > files? If not, do you think I should extend the tts.speak to
include
> > attributes for intro audio files or should I create a totally new
> > schema for the combination (e.g. tts.alert)? I think it is a bad
> > idea to try sending separate messages because UDP does not
guarantee
> > sequenced delivery and it would require a complicated client that
> > knows to wait for multiple messages from different schemas. As an
> > alternative, is it possible to encapsulate two different schemas
in
> > the same message (e.g. tts.speak and Playlist.Track)?
> >
> >
> >
>


--5-7690081030-3432602313=:0
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit





<head>

<style type="text/css">
<!--

/* start of attachment style */
.ygrp-photo-title{
clear: both;
font-size: smaller;
height: 15px;
overflow: hidden;
text-align: center;
width: 75px;
}
div.ygrp-photo{
background-position: center;
background-repeat: no-repeat;
background-color: white;
border: 1px solid black;
height: 62px;
width: 62px;
}

div.photo-title
a,
div.photo-title a:active,
div.photo-title a:hover,
div.photo-title a:visited {
text-decoration: none;
}

div.attach-table div.attach-row {
clear: both;
}

div.attach-table div.attach-row div {
float: left;
/* margin: 2px;*/
}

p {
clear: both;
padding: 15px 0 3px 0;
overflow: hidden;
}

div.ygrp-file {
width: 30px;
valign: middle;
}
div.attach-table div.attach-row div div a {
text-decoration: none;
}

div.attach-table div.attach-row div div span {
font-weight: normal;
}

div.ygrp-file-title {
font-weight: bold;
}
/* end of attachment style */
-->
</style>
</head>
<html><head>
<style type="text/css">
<!--
#ygrp-mkp{
border: 1px solid #d8d8d8;
font-family: Arial;
margin: 14px 0px;
padding: 0px 14px;
}
#ygrp-mkp hr{
border: 1px solid #d8d8d8;
}
#ygrp-mkp #hd{
color: #628c2a;
font-size: 85%;
font-weight: bold;
line-height: 122%;
margin: 10px 0px;
}
#ygrp-mkp #ads{
margin-bottom: 10px;
}
#ygrp-mkp .ad{
padding: 0 0;
}
#ygrp-mkp .ad a{
color: #0000ff;
text-decoration: none;
}
-->
</style>
</head>
<body>



<!-- **begin egp html banner** -->

<br><br>

<!-- **end egp html banner** -->


Excellent.&nbsp; I thought that would be the best way to accomplish
this.&nbsp; Do you think it is reasonable to allow URLs on the Sound=
line?&nbsp; I'm thinking that the speech server should be able to
retrieve the sound file so that I'm not forced to manually copy the sounds
to each speech server.&nbsp; Also the text could be optional if all I
wanted was a chime.&nbsp; I know this might be more functionality than
most speech servers will implement.&nbsp; My client side will be user
configurable so the user could configure a simple sound name if the user
wanted to interoperate with speech servers that do not support
URLs.<br><br>I guess an alternative is to only use a sound name
in the xAP message and then implement a mapping function in the speech
server.&nbsp; Then a simple sound name like 'chime' might be mapped in
the server to 'ftp://ha.example.org/sounds/chime.wav'.&nbsp;
Humm... the more I
think about this the more I like your implementation where the messaging is
ge
neric and the server decides how to locate the actual sound
media.&nbsp; Nothing there prevents me from making a smarter speech
server that can download its own sound files.&nbsp; It makes
configuration more complex but keeps the smarts in the speech
server.&nbsp; I'm a big fan of making each controller as smart as is
practical.&nbsp; OK, you talked me into it <img src="http://us.i1.yimg.com/us.yimg.com/i/mesg/tsmileys2/03.gif";
alt=";)" width="18"
height="18">.&nbsp; I'll implement simple messaging and a
smart speech server.<br><br><br>--- In
xAP_developer@xxxxxxx, James &lt;james.traynor@...&gt;
wrote:<br>&gt;<br>&gt; Hi,<br>&gt;
<br>&gt; Actually this functionality is included in the latest
xAPSpeech.  It  <br>&gt; can play a wav file before speech which
can be selected from
the xAP  <br>&gt; message as well as a default for all messages. 
Looks like I  <br>&gt; completely missed updating the schema doc,
sorry!  I have added the  <br>&gt; extra item to the doc on m
i4 and xAP wiki  which is just a Sound=   <br>&gt; item in the
tts.speak message.  Only thing to note is the sound name 
<br>&gt; should refer to the sound without the file
extension<br>&gt; As you say this kind of this is best handled in
a single message and  <br>&gt; not two.   A xAP message can
contain multiple blocks/sections  but  <br>&gt; will only ever be
about 1 class.<br>&gt; <br>&gt; hth<br>&gt;
<br>&gt; James<br>&gt; <br>&gt; On 30 Apr
2009, at 13:57, mstovenour wrote:<br>&gt; <br>&gt;
&gt;<br>&gt; &gt;<br>&gt; &gt; I have an
alerting application where I need a TTS application to  <br>&gt;
&gt; play a wav file prior to speaking the text. There are multiple wav
 <br>&gt; &gt; files, from a simple chime to a siren, that
accompany the text; this  <br>&gt; &gt; rules out hard coding
the wav file into
the TTS application. Clearly  <br>&gt; &gt; I'll need to
build both a new client and TTS application. My  <br>&gt;
&gt; question is regarding the schema to use.<br>&gt; &
gt;<br>&gt; &gt; Is there a single schema that can
communicate both speech and wav  <br>&gt; &gt; files? If not,
do you think I should extend the tts.speak to include  <br>&gt;
&gt; attributes for intro audio files or should I create a totally new 
<br>&gt; &gt; schema for the combination (e.g. tts.alert)? I
think it is a bad  <br>&gt; &gt; idea to try sending separate
messages because UDP does not guarantee  <br>&gt; &gt;
sequenced delivery and it would require a complicated client that 
<br>&gt; &gt; knows to wait for multiple messages from
different schemas. As an  <br>&gt; &gt; alternative, is it
possible to encapsulate two different schemas in  <br>&gt;
&gt; the same message (e.g. tts.speak and
Playlist.Track)?<br>&gt; &gt;<br>&gt;
&gt;<br>&gt; &gt;<br>&gt;<br>





<!-- **begin egp html banner** -->

<br>



<br>

<!-- **end egp html banner** -->


<div width="1" style="color: white; clear:
both;"/>__._,_.___</div>

<!-- Start Recommendations -->
<!-- End Recommendations -->



<!-- **begin egp html banner** -->

<img src="http://geo.yahoo.com/serv?s=97476590/grpId=9629476/grpspId=1705007709/msgId=1966/stime=1241186776";
width="1" height="1"> <br>

<!-- **end egp html banner** -->


<!-- **begin egp html banner** -->

<br>
<div style="font-family: verdana; font-size: 77%; border-top: 1px
solid #666; padding: 5px 0;" >
Your email settings: Individual EmailTraditional <br>
<a href="http://groups.yahoo.com/group/xAP_developer/join;_ylc=X3oDMTJmZDA0N2JvBF9TAzk3NDc2NTkwBGdycElkAzk2Mjk0NzYEZ3Jwc3BJZAMxNzA1MDA3NzA5BHNlYwNmdHIEc2xrA3N0bmdzBHN0aW1lAzEyNDExODY3NzY-";>Change
settings via the Web</a> (Yahoo! ID required) <br>
Change settings via email: <a href="mailto:xAP_developer-digest@xxxxxxx?subject=Email
Delivery: Digest">Switch delivery to Daily Digest</a>  <a
href = "mailto:xAP_developer-fullfeatured@xxxxxxx?subject=Change
Delivery Format: Fully Featured">Switch to Fully Featured</a>
<br>
<a href="http://groups.yahoo.com/group/xAP_developer;_ylc=X3oDMTJkYWZ0aG02BF9TAzk3NDc2NTkwBGdycElkAzk2Mjk0NzYEZ3Jwc3BJZAMxNzA1MDA3NzA5BHNlYwNmdHIEc2xrA2hwZgRzdGltZQMxMjQxMTg2Nzc2";>
Visit Your Group
</a>
<a href="http://docs.yahoo.com/info/terms/";>
Yahoo! Groups Terms of Use
</a>
<a href="mailto:xAP_developer-unsubscribe@xxxxxxx?subject=Unsubscribe";>
Unsubscribe
</a>
<br>
</div>
<br>

<!-- **end egp html banner** -->


<div style="color: white; clear:
both;"/>__,_._,___</div>
</body>

</html>

--5-7690081030-3432602313=:0--


xAP_Development Main Index | xAP_Development Thread Index | xAP_Development Home | Archives Home

Comments to the Webmaster are always welcomed, please use this contact form . Note that as this site is a mailing list archive, the Webmaster has no control over the contents of the messages. Comments about message content should be directed to the relevant mailing list.