Зарегистрироваться
Восстановить пароль
FAQ по входу

Обработка речи

Требуется помощь в преобразовании раздела Информатика и вычислительная техника

Если Вы компетентны в тематике этого раздела, то Вас, возможно, заинтересует обсуждение планируемых преобразований.

Справочные материалы

Учебно-методические материалы

Студенческие работы

Программное обеспечение

Доверенные пользователи и модераторы раздела

Springer, 2012. — 83 p. The fast pace of the advancement in information and communications technology is reshaping our society and vastly increasing our capabilities for faster learning, higher achievements, and better and wider communication, in addition to more effective and productive collaboration among speech scientists and engineers. One of the important frontiers of...
  • №1
  • 758,60 КБ
  • добавлен
  • изменен
Kluwer, 1993. — 197 p. The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker...
  • №2
  • 2,84 МБ
  • добавлен
  • изменен
Springer, 2011. — 163 p. Many of the things we think about, actions we take, the way we react to stimuli, generate a feeling or subjective experience, for example, an emotion, or a mood. The generic term used in the twentieth century psychology and philosophy literature to denote such an emotion or mood is an old, Middle English (fourteenth century) word affect. The outward...
  • №3
  • 1,52 МБ
  • добавлен
  • изменен
Pergamon Press, 1976. — 149 p. The study of speech is a multidisciplinary subject, and the topic of this book is no exception. The production of speech is properly the province of the anatomist and the physiologist, but in practice it has been studied mainly by the phonetician with help from the physicist. The sounds of speech have been classified by the phonetician, and...
  • №4
  • 2,09 МБ
  • добавлен
  • изменен
Cambridge University Press, 2004. — 226 p. Although widely employed in image processing, the use of fractal techniques and the fractal dimension for speech characterization and recognition is a relatively new concept, which is now receiving serious attention. This book represents the fruits of research carried out to develop novel fractal-based techniques for speech and audio...
  • №5
  • 5,77 МБ
  • добавлен
  • изменен
München: Lincom Europa, 2005. – 143 p. This monograph describes an experiment in Forensic Speaker Identification, showing how speeches samples from the same speaker can be discriminated from speech from different speakers with acoustic features commonly used in forensic. It also explains what is now considered the legally and logically correct approach to Forensic Speaker...
  • №6
  • 40,27 МБ
  • добавлен
  • изменен
Morgan & Claypool, 2005. — 136 p. Immediately following the Second World War, between 1947 and 1955, several classic papers quantified the fundamentals of human speech information processing and recognition. In 1947 French and Steinberg published their classic study on the articulation index. In 1948 Claude Shannon published his famous work on the theory of information. In 1950...
  • №7
  • 1,43 МБ
  • добавлен
  • изменен
Bradford Вook, 1995. — 549 p. The chapters in this book represent the outcome of a research workshop held at the Park Hotel Fiorelle, Sperlonga, 16- 20 May 1988. Twenty-five participants gathered in this small coastal village in Italy , where the Emperor Tiberius kept a Summer house, to discuss psycholinguistic and computational issues in speech and natural-language processing....
  • №8
  • 9,82 МБ
  • добавлен
  • изменен
Springer, 1999. — 212 p. Automatic speech recognition and processing has received a lot of attention during the last decade. Prototypes for speech-to-speech translation are currently being developed that show first impressive results for this highly complex endeavor. They demonstrate that machines can actually be helpful in communicating information between persons speaking...
  • №9
  • 1,65 МБ
  • добавлен
  • изменен
Springer, 1999. — 212 p. Automatic speech recognition and processing has received a lot of attention during the last decade. Prototypes for speech-to-speech translation are currently being developed that show first impressive results for this highly complex endeavor. They demonstrate that machines can actually be helpful in communicating information between persons speaking...
  • №10
  • 960,81 КБ
  • добавлен
  • изменен
Springer, 2015. — 72 p. This book presents state of art research in speech emotion recognition. Readers are first presented with basic research and applications – gradually more advance information is provided, giving readers comprehensive guidance for classify emotions through speech. Simulated databases are used and results extensively compared, with the features and the...
  • №11
  • 716,82 КБ
  • добавлен
  • изменен
Springer, 1999. — 315 p. This book is intended for researchers who want to keep abreast of current developments in corpus-based natural language processing. It is not meant as an introduction to this field; for readers who need one, several entry-level texts are available, including those of (Church and Mercer, 1993; Charniak, 1993; Jelinek, 1997). This book captures the...
  • №12
  • 5,26 МБ
  • добавлен
  • изменен
Springer, 1999. — 315 p. This book is intended for researchers who want to keep abreast of current developments in corpus-based natural language processing. It is not meant as an introduction to this field; for readers who need one, several entry-level texts are available, including those of (Church and Mercer, 1993; Charniak, 1993; Jelinek, 1997). This book captures the...
  • №13
  • 3,61 МБ
  • добавлен
  • изменен
Springer, 1991. — 376 p. Speech coding has been an ongoing area of research for several decades, yet the level of activity and interest in this area has expanded dramatically in the last several years. Important advances in algorithmic techniques for speech coding have recently emerged and excellent progress has been achieved in producing high quality speech at bit rates as low...
  • №14
  • 10,52 МБ
  • добавлен
  • изменен
Kluwer, 1993. — 267 p. This volume contains 34 chapters, loosely grouped into six topical areas. The chapters in this volume reflect the progress and present the state of the art in low bit rate speech coding primarily at bit rates from 2.4 kbit/s to 16 kbit/s. Together they represent important contributions from leading researchers in the speech coding community. The book...
  • №15
  • 7,52 МБ
  • добавлен
  • изменен
Taylor&Francis, 1993. — 225 p. This text deals with two important technologies in human-computer interaction: computer generation of synthetic speech and computer recognition of human speech. These technologies are quite different and the ergonomics problems in implementation are also different. Nonetheless, synthetic speech and speech recognition are usually dealt with in the...
  • №16
  • 825,89 КБ
  • добавлен
  • изменен
Springer, 2017. — 251 p. This book provides scientific understanding of the most central techniques used in speech coding both for advanced students as well as professionals with a background in speech audio and or digital signal processing. It provides a clear connection between the Why’s?, How’s?, and What’s, such that the necessity, purpose and solutions provided by tools...
  • №17
  • 8,49 МБ
  • добавлен
  • изменен
Springer, 2013. — 74 p. The diagnosis and monitoring of many common neurological conditions routinely involve acoustic analysis of the subject’s speech by an expert clinician. There are two significant problems with this: one is that the analysis is time-consuming, hence expensive, and therefore often performed too infrequently, and the other is that the results of the analysis...
  • №18
  • 761,03 КБ
  • добавлен
  • изменен
Springer, 2004. — 237 p. Spoken dialog systems allow people to get information, conduct business, and be entertained, simply by speaking to a computer. There are hundreds of these systems currently in use, handling millions of interactions every day. How do they work? What problems do they solve? The goal of this book is to answer these questions and others like them, including:...
  • №19
  • 4,00 МБ
  • добавлен
  • изменен
EURASIP Journal on Audio, Speech, and Music Processing, 2009. — 66 p. The aim of this special issue is to provide a detailed description of state-of-the-art systems for animating faces during speech, and identify new techniques that have recently emerged from both the audiovisual speech and computer graphics research communities. This special issue is a followup to the first LIPS...
  • №20
  • 10,61 МБ
  • добавлен
  • изменен
Cambridge: Cambridge University Press, 2012. — 508 p. When we speak, we configure the vocal tract which shapes the visible motions of the face and the patterning of the audible speech acoustics. Similarly, we use these visible and audible behaviors to perceive speech. This book showcases a broad range of research investigating how these two types of signals are used in spoken...
  • №21
  • 9,45 МБ
  • добавлен
  • изменен
Springer, 2005. — 203 p. The goal of this book is to present a discussion of the ideas arising from the European Special Event (ESE) on the Integration of Phonetic Knowledge in Speech Technology at Eurospeech 2001 in Aalborg. Where there is discussion, there must be unresolved questions, doubts must exist, integration is not a fait accompli. The different questions asked, methods...
  • №22
  • 5,68 МБ
  • добавлен
  • изменен
Springer, 2011. — 1029 p. — ISBN 10 0387775919, ISBN 13 978-0387775913 When I was being interviewed at the handwriting recognition group of IBM T.J. Watson Research Center in December of 1990, one of the interviewers asked me why, being a mechanical engineer, I was applying for a position in that group. Well, he was an electrical engineer and somehow was under the impression that...
  • №23
  • 13,74 МБ
  • добавлен
  • изменен
New York: Springer, 2018. — 112 p. This book presents and develops several important concepts of speech enhancement in a simple but rigorous way. Many of the ideas are new; not only do they shed light on this old problem but they also offer valuable tips on how to improve on some well-known conventional approaches. The book unifies all aspects of speech enhancement, from single...
  • №24
  • 988,32 КБ
  • добавлен
  • изменен
Springer, 2011. — 88 p. Signal enhancement is a fundamental topic of signal processing in general and of speech processing in particular [1]. In audio and speech applications such as cell phones, teleconferencing systems, hearing aids, human–machine interfaces, and many others, the microphones installed in these systems always pick up some interferences that contaminate the...
  • №25
  • 478,94 КБ
  • добавлен
  • изменен
Springer, 2012. — 112 p. This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames...
  • №26
  • 1,45 МБ
  • добавлен
  • изменен
Springer, 2015. — 113 p. This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations.Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired noise...
  • №27
  • 1,43 МБ
  • добавлен
  • изменен
Morgan & Claypool, 2011. — 112 p. This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations. Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired...
  • №28
  • 1,39 МБ
  • добавлен
  • изменен
Springer, 2009. — 235 p. Noise is everywhere and in most applications that are related to audio and speech, such as human-machine interfaces, hands-free communications, voice over IP (VoIP), hearing aids, teleconferencing/telepresence/telecollaboration systems, and so many others, the signal of interest (usually speech) that is picked up by a microphone is generally contaminated...
  • №29
  • 4,26 МБ
  • добавлен
  • изменен
Academic Press, 2014. — 138 p. Speech enhancement is a classical problem in signal processing, yet still largely unsolved. Two of the conventional approaches for solving this problem are linear filtering, like the classical Wiener filter, and subspace methods. These approaches have traditionally been treated as different classes of methods and have been introduced in somewhat...
  • №30
  • 1,95 МБ
  • добавлен
  • изменен
Springer, 2005. — 415 p. We live in a noisy world! In all applications (telecommunications, hands-free communications, recording, human-machine interfaces, etc) that require at least one microphone, the signal of interest is usually contaminated by background noise and reverberation. As a result, the microphone signal has to be "cleaned" with digital signal processing tools before...
  • №31
  • 5,41 МБ
  • добавлен
  • изменен
Springer, 2008. — 1159 p. The achievement of this Springer Handbook is the result of a wonderful journey that started in March 2005 at the 30th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Two of the editors-in-chief (Benesty and Huang) met in one of the long corridors of the Pennsylvania Convention Center in Philadelphia with Dr Dieter Merkle...
  • №32
  • 18,16 МБ
  • добавлен
  • изменен
Springer, 2014. — 287 p. Second International Conference, SLSP 2014,Grenoble, France, October 14–16, 2014 Proceedings. This volume contains the papers presented at the Second International Conference on Statistical Language and Speech Processing (SLSP 2014), held in Grenoble, France during October 14–16, 2014. SLSP 2014 is the second event in a series to host and promote research...
  • №33
  • 6,44 МБ
  • добавлен
  • изменен
Kluwer, 2000. — 397 p. As the title indicates, "Intonation: Analysis, Modelling and Technology" is a contribution to the study of prosody, with major emphasis on intonation. Intonation and tonal themes are thus the central object of the volume, although temporal and dynamic aspects are also taken into consideration by a good number of papers. Although tonal and prosodic...
  • №34
  • 6,21 МБ
  • добавлен
  • изменен
Springer, 2009. — 228 p. The development of computer and telecommunication technologies led to a revolution in the way that people work and communicate with each other. One of the results is that large amount of information will increasingly be held in a form that is natural for users, as speech in natural language. In the presented work, we investigate the speech signal capture...
  • №35
  • 4,34 МБ
  • добавлен
  • изменен
Kluwer, 1994. — 329 p. This book describes how large multi-layer perceptron networks containing more than 150,000 weights were trained and integrated into a state-of-the-art Hidden Markov Model (HMM) recognizer to provide improved acoustic-phonetic modeling and improved recognition accuracy. The lessons learned along the way form a case study which demonstrates how hybrid systems...
  • №36
  • 4,51 МБ
  • добавлен
  • изменен
Wiesbaden: Springer, 2016. — 148 p. Almut Braun carried out forensic phonetic speaker identification experiments (voice lineups) with 306lay listeners. Blind listeners significantly outperformed sighted listeners when the speech recordings were presented in studio quality. For recordings in mobile phone quality or of whispering voices, blind and sighted listeners achieved similar...
  • №37
  • 7,43 МБ
  • добавлен
  • изменен
MIT Press, 1990. — 854 p. Auditory Scene Analysis addresses the problem of hearing complex auditory environments, using a series of creative analogies to describe the process required of the human auditory system as it analyzes mixtures of sounds to recover descriptions of individual sounds. In a unified and comprehensive way, Bregman establishes a theoretical framework that...
  • №38
  • 4,85 МБ
  • добавлен
  • изменен
Ellis Horwood Limited, 1987. — 282 p. An increased understanding of human speech comprehension is a major goal for research groups working in a number of closely related disciplines. We take the position that genuine advances in our understanding of speech comprehension will be based on explicit computational models of aspects of this process which yield predictions testable using...
  • №39
  • 3,50 МБ
  • добавлен
  • изменен
Springer, 2011. — 200 p. Many existing natural language and spoken language dialogue systems are either very limited in the scope of domain functionality or require a rather cumbersome interaction. With an increasing number of application domains, ranging from unified messaging to trip planning and appointment scheduling, it seems to be obvious that the current interfaces need to...
  • №40
  • 2,41 МБ
  • добавлен
  • изменен
John Wiley, 2007. — 373 p. The Media Resource Control Protocol (MRCP) is a key enabling technology delivering standardised access to advanced media processing resources including speech recognisers and speech synthesisers over IP networks. MRCP leverages Internet and Web technologies such as SIP, HTTP, and XML to deliver an open standard, vendor-independent, and versatile...
  • №41
  • 1,97 МБ
  • добавлен
  • изменен
Kluwer, 1998. — 249 p. This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system described in Chapter 8 as this is clearly an issue which was not treated in the original version. This required the collection of data, the development of a concept for diagnostic evaluation of linguistic word...
  • №42
  • 3,59 МБ
  • добавлен
  • изменен
Springer, 2006. — 398 p. There is no question of the value of applying automatic speech recognition technology as one of the interaction tools between humans and different computational systems. There are many books on design standards and guidelines for different practical issues, such as Gibbon's book Handbook of Standards and Resources for Spoken Language System (1997) and...
  • №43
  • 19,07 МБ
  • добавлен
  • изменен
Springer, 2010. — 351 p. In recent years spoken language research has been successful in establishing technology which can be used in various applications, and which has also brought forward novel research topics that advance our understanding of the human speech and communication processes in general. This book got started in order to collect these different trends together, and...
  • №44
  • 3,59 МБ
  • добавлен
  • изменен
Springer, 2007. — 292 p. International Conference on Nonlinear Speech Processing, NOLISP 2007, Paris, France, May 22-25, 2007. Revised Selected Papers. We present in this volume a collection of revised selected papers from the ISCA Tutorial and Research Workshop on Nonlinear Speech Processing (NOLISP 2007) held in Paris, France, 22–25 May, 2007. NOLISP 2007 was organized by the...
  • №45
  • 4,70 МБ
  • добавлен
  • изменен
CRC Press, 2003. — 385 p. Approaches to the problems of designing speech and language processing algorithms for human machine communication used to be taken from the perspectives of linguistics and speech science, until the late 1970s. Due to the advances in computing and statistical modeling, data driven pattern recognition methods have become a fast moving research area during...
  • №46
  • 3,66 МБ
  • добавлен
  • изменен
Kluwer, 1987. — 278 p. It is well-known that phonemes have different acoustic realizations depending on the context. Thus, for example, the phoneme /t/ is typically realized with a heavily aspirated strong burst at beginning of a syllable as in the word Tom, but without a burst at the end of a syllable in a like cat. Variation such as this is often considered to be problematic for...
  • №47
  • 10,05 МБ
  • добавлен
  • изменен
Springer, 1975. — 358 p. Proceedings of the Symposium on Dynamic Aspects of Speech Perception held at 1 P.O., Eindhoven, Netherlands, August 4-6, 1975. The purpose of the Symposium was to provide a meeting place for those working in the field of speech perception, whose main interest is in the study of the perceptual processes in the decoding of connected speech, hence the...
  • №48
  • 7,35 МБ
  • добавлен
  • изменен
Springer, 2010. — 352 p. More and more devices for human-to-human and human-to-machine communications, where sound pickup and rendering is necessary, require some sophisticated algorithms. This is due to the fact that the acoustic environment in which we live in and communicate is extremely challenging. The difficult problems encountered in this environment are very well known and...
  • №49
  • 9,16 МБ
  • добавлен
  • изменен
Springer, 2011. — 267 p. The telephony network broadly changed during the last decades with the intensive introduction of Voice over Internet Protocol (VoIP) technology and third generation mobile networks. These networks enable new transmission paradigms that affect the perceived quality of speech signals. The perceived characteristics of a speech signal transmitted by a VoIP...
  • №50
  • 1,65 МБ
  • добавлен
  • изменен
Kluwer, 2001. — 328 p. Modern speech synthesis began in the 1950s with the development of electronic formant synthesisers, such as PAT (Parametric Artificial Talker) designed by Walter Lawrence in the UK and OVE designed by Gunnar Fant in Sweden. Many others followed and, with the widespread introduction of fast digital computers, became implemented as computer programs. The best...
  • №51
  • 4,65 МБ
  • добавлен
  • изменен
Springer, 2018. — 144 p. This book presents the consolidated acoustic data for all phones in Standard Colloquial Bengali (SCB), commonly known as Bangla, a Bengali language used by 350 million people in India, Bangladesh, and the Bengali diaspora. The book analyzes the real speech of selected native speakers of the Bangla dialect to ensure that a proper acoustical database is...
  • №52
  • 3,08 МБ
  • добавлен
  • изменен
CRC Press, 2002. — 400 p. A wide range of potential sources of noise and distortion can degrade the quality of the speech signal in a communication system. Noise Reduction in Speech Applications explores the effects of these interfering sounds on speech applications and introduces a range of techniques for reducing their influence and enhancing the acceptability, intelligibility,...
  • №53
  • 10,08 МБ
  • дата добавления неизвестна
  • изменен
Plenum Press, 1983. — 505 p. The work reported in this book results from years of research oriented toward the goal of making an experimental model capable of understanding spoken sentences of a natural language. This is, of course, a modest attempt compared to the complexity of the functions performed by the human brain. A method is introduced for conceiving modules performing...
  • №54
  • 7,76 МБ
  • добавлен
  • изменен
Springer, 2013. — 309 pages. First International Conference, SLSP 2013, Tarragona, Spain, July 29-31, 2013 Proceedings. This volume contains the papers presented at the First International Conference on Statistical Language and Speech Processing (SLSP 2013), held in Tarragona, Spain, during July 29–31, 2013. SLSP 2013 was the first event in a series to host and promote research on...
  • №55
  • 3,31 МБ
  • добавлен
  • изменен
Springer, 2013. — 209 p. First International Conference, SLSP 2013, Tarragona, Spain, July 29-31, 2013 Proceedings. This volume contains the papers presented at the First International Conference on Statistical Language and Speech Processing (SLSP 2013), held in Tarragona, Spain, during July 29–31, 2013. SLSP 2013 was the first event in a series to host and promote research on the...
  • №56
  • 2,79 МБ
  • добавлен
  • изменен
Springer, 2016. — 321 p. This volume contains the papers presented at the Third International Conference on Statistical Language and Speech Processing (SLSP 2015), held in Budapest, Hungary, during November 24-26, 2015. SLSP 2015 was the third event in a series to host and promote research on the wide spectrum of statistical methods that are currently in use in computational...
  • №57
  • 5,99 МБ
  • добавлен
  • изменен
John Wiley, 2005. — 273 p. In many situations, the dialogue between two human beings seems to be performed almost effortlessly. However, building a computer program that can converse in such a natural way with a person, on any task and under any environmental conditions, is still a challenge. One reason why is that a large amount of different types of knowledge is involved in...
  • №58
  • 2,71 МБ
  • добавлен
  • изменен
Springer, 2011. — 419 pp. The 3rd International Workshop on Spoken Dialogue Systems (IWSDS2011) was held at Granada, Spain, 1-3 September 2011, as a satellite event of Interspeech 2011. This annual workshop brings together researchers from all over the world working in the field of spoken dialogue systems. It provides an international forum for the presentation of research and...
  • №59
  • 4,85 МБ
  • добавлен
  • изменен
IEEE/Wiley-Interscience, 2000. — 1041 p. Purposes and Scope. The purposes of this book are severalfold. Principally, of course, it is intended to provide the reader with solid fundamental tools and sufficient exposure to the applied technologies to support advanced research and development in the array of speech processing endeavors. As an academic instrument, however, it may also...
  • №60
  • 14,47 МБ
  • добавлен
  • изменен
Morgan & Claypool, 2006. — 118 p. Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech chain starts with the formation of a linguistic message in a speaker’s brain and ends with the arrival of the message in a listener’s brain. Given the intricacy of the dynamic speech process and its fundamental importance in...
  • №61
  • 1,68 МБ
  • добавлен
  • изменен
Academic Press, 2019. — 199 p. — ISBN 978-0-12-818130-0. This book investigates the utilization of speech analytics across several systems and real-world activities, including sharing data analytics, creating collaboration networks between several participants, and implementing video-conferencing in different application areas. Chapters focus on the latest applications of speech...
  • №62
  • 11,06 МБ
  • добавлен
  • изменен
John Wiley & Sons, Inc., 2013. — 384 p. — 3rd Edition. На англ. языке. Fully updated for the latest speech recognition tools and features, this bestselling guide helps you conquer Dragon NaturallySpeaking and gets you started creating documents, sending e-mail, searching the web, and more using only your voice. You?ll learn Dragon basics like dictation, formatting, and...
  • №63
  • 19,08 МБ
  • добавлен
  • изменен
John Wiley & Sons, Inc., 2013. — 384 p. — 3rd Edition. На англ. языке. Fully updated for the latest speech recognition tools and features, this bestselling guide helps you conquer Dragon NaturallySpeaking and gets you started creating documents, sending e-mail, searching the web, and more using only your voice. You?ll learn Dragon basics like dictation, formatting, and...
  • №64
  • 9,51 МБ
  • добавлен
  • изменен
Kluwer, 2005. — 327 p. There is a serious problem in the recognition of sounds. It derives from the fact that they do not usually occur in isolation but in an environment in which a number of sound sources (voices, traffic, footsteps, music on the radio, and so on) are active at the same time. When these sounds arrive at the ear of the listener, the complex pressure waves coming...
  • №65
  • 14,23 МБ
  • добавлен
  • изменен
IOS Press, 2006. — 389 p. That speech is a dynamic process strikes as a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of the...
  • №66
  • 4,46 МБ
  • добавлен
  • изменен
Springer, 2017. — 77. In the few last years, we saw the rise of practical speech recognition applications, which work well in English and a few other languages. There is no doubt that this trend will continue and a more natural interaction between humans and technology will become part of our lives. Language is one of the most important components of one’s culture and...
  • №67
  • 1,15 МБ
  • добавлен
  • изменен
Springer, 2013. — 225 p. 6th International Conference, NOLISP 2013, Mons, Belgium, June 19-21, 2013 Proceedings. NOLISP, an ISCA tutorial and workshop on non-linear speech processing, is a biannual event whose aim is to present and discuss new ideas, techniques, and results related to alternative approaches in speech processing that may depart from the mainstream. In order to work...
  • №68
  • 2,95 МБ
  • добавлен
  • изменен
Springer, 1997. — 306 p. The field of speech synthesis has secn a large increase in commercial applications in the last ten years. As recently as 1986, there were only a few companies in the synthesis market, all exploiting one of two basic technologies-either formant-based phonemic synthesis or LPC-based diphone synthesis. While these approaches still form the basis of most...
  • №69
  • 6,75 МБ
  • добавлен
  • изменен
Издательство Springer, 2008, -305 pp. This book has its point of departure in courses held at the Tenth European Language and Speech Network (ELSNET) Summer School on Language and Speech Communication which took place at NISLab in Odense, Denmark, in July 2002. The topic of the summer school was Evaluation and Assessment of Text and Speech Systems. Nine (groups of) lecturers...
  • №70
  • 3,22 МБ
  • добавлен
  • изменен
Springer, 2008. — 338 p. — (Text, Speech and Language Technology Series 39). This book edition highlights recent trends and important issues that still remain only partially solved or even unsolved within the broad field of discourse and dialogue. The field is discussed and illustrated both from an overall spoken (multimodal) dialogue system perspective as well as from a more...
  • №71
  • 2,08 МБ
  • добавлен
  • изменен
CMP Books, 2001. — 338 p. In the summer of 2000, I came across the VoiceXML 1.0 standard published by the VoiceXML Forum. I downloaded the specification and began to read it. I had been working on software development in computer telephony for more than 10 years, but I was completely baffled; I couldn't understand most of the specification. I had no idea what the motivation or...
  • №72
  • 2,85 МБ
  • добавлен
  • изменен
Springer, 2012. — 120 p. This book describes novel approaches to improve automatic speech recognition for dialectal Arabic. Since the existing dialectal Arabic speech resources, that are available for the task of training speech recognition systems, are very sparse and are lacking quality, we describe how existing Modern Standard Arabic (MSA) speech resources can be applied to...
  • №73
  • 1,22 МБ
  • добавлен
  • изменен
Springer, 2011. — 137 p. I know what you are asking yourself – there are a lot of books available in speech processing, what is novel in this book? Well, I can summarize the answer for this question in the following points: You always see different algorithms for speech enhancement, deconvolution, signal separation, watermarking, and encryption, separately, without specific...
  • №74
  • 3,74 МБ
  • добавлен
  • изменен
John Wiley, 2013. — 355 p. This book came about as a result of the standing-room-only special session on crowdsourcing for speech processing at Interspeech 2011. There has been a great amount of interest in this new technique as a means to solve some persistent issues. Some researchers dived in head first and have been using crowdsourcing for a few years by now. Others waited to...
  • №75
  • 3,03 МБ
  • добавлен
  • изменен
Springer, 2016. — 288 p. This volume brings together through a peer-revision process advanced research results obtained on nonlinear speech processing, following the tradition initiated by the European COST Action 277: “Nonlinear Speech Processing” (http://www.cost. eu/COST_Actions/ict/277). The research published in this book was discussed for the first time at the 7th edition of...
  • №76
  • 6,42 МБ
  • добавлен
  • изменен
Springer, 2014. — 53 p. As the wavelets gain wide applications in different fields, especially within the signal processing realm, this chapter will provide a survey on widespread employing of wavelets analysis in different applications of speech processing. Many speech processing algorithms and techniques still lack some sort of robustness which can be improved through the use of...
  • №77
  • 946,21 КБ
  • добавлен
  • изменен
NY: Springer International Publishing, 2014. — 53 p. This book provides a survey on wide-spread of employing wavelets analysis in different applications of speech processing. The author examines development and research in different applications of speech processing. The book also summarizes the state of the art research on wavelet in speech processing.
  • №78
  • 1,19 МБ
  • добавлен
  • изменен
2nd Ed. — Springer, 2017. — 96 p. — (SpringerBriefs in Electrical and Computer Engineering). — ISBN 10 3319690019, 13 978-3319690018. This new edition provides an updated and enhanced survey on employing wavelets analysis in an array of applications of speech processing. The author presents updated developments in topics such as; speech enhancement, noise suppression, spectral...
  • №79
  • 2,71 МБ
  • добавлен
  • изменен
2nd Ed. — Springer, 2017. — 115 p. — (SpringerBriefs in Electrical and Computer Engineering). — ISBN 10 3319690019, 13 978-3319690018. This new edition provides an updated and enhanced survey on employing wavelets analysis in an array of applications of speech processing. The author presents updated developments in topics such as; speech enhancement, noise suppression, spectral...
  • №80
  • 1,11 МБ
  • добавлен
  • изменен
Springer, 2005. — 292 p. International Conference on Non-Linear Speech Processing, NOLISP 2005, Barcelona, Spain, April 19-22, 2005. Revised Selected Papers. We present in this volume the collection of finally accepted papers of NOLISP 2005 conference. It has been the third event in a series of events related to Nonlinear speech processing, in the framework of the European COST...
  • №81
  • 4,15 МБ
  • добавлен
  • изменен
2. Auflage. — Springer Vieweg, 2013. — xv, 398 S. — ISBN 978-3-642-31502-2, ISBN 978-3-642-31503-9. Klassiker der Sprachverarbeitung auf dem neuesten Stand der Technik, der neben theoretischen Grundlagen stets auch den Anwendungsbezug herstellt Mit neuen Kapiteln zu den Grundzügen der Signalanalyse sowie Sprachdialogsystemen Elektronisches Zusatzmaterial steht auf...
  • №82
  • 13,98 МБ
  • добавлен
  • изменен
Springer, 1972. — 446 p. Второе, дополненное издание монографии Джеймса Флэнагана "Анализ, синтез и восприятие речи" (первое издание, 1965 года, было переведено на русский в 1968 году издательством "Связь") Для изучающих обработку речевых сигналов.
  • №83
  • 13,68 МБ
  • добавлен
  • изменен
Springer, 2017. — 109 p . Speech communication assumes a dominant role in how we communicate, and it is nowadays available to support interaction with machines in a wide range of scenarios, ranging from personal assistants for smartphones to home entertainment. While in many circumstances audible speech may suffice, there are a multitude of scenarios for which it is inadequate due...
  • №84
  • 2,98 МБ
  • добавлен
  • изменен
Cambridge: Cambridge University Press, 2012. - 155 p. The mechanism of speech is a very complex one and in order to undertake any analysis of language it is important to understand the processes that go to make up the message that a speaker transmits and a listener receives. Professor Fry therefore first takes the reader through the various stages of the speech chain: from...
  • №85
  • 6,30 МБ
  • добавлен
  • изменен
Springer, 2011. — 221 p. The analysis and measurement of the spectrum of a speech signal is one of the most important areas of sound signal processing for a number of fields, yet it is not an area to which a book has been specifically devoted. The accurate determination of the speech spectrum is commonly pursued in diverse areas including speech processing, recognition, and...
  • №86
  • 5,16 МБ
  • добавлен
  • изменен
A study of digital speech processing, synthesis and recognition. This edition contains sections on the international standardization of robust and flexible speech coding techniques, waveform unit concatenation-based speech synthesis, large vocabulary continuous-speech recognition based on statistical pattern recognition, and more.
  • №87
  • 2,40 МБ
  • дата добавления неизвестна
  • изменен
Second Edition, Revised and Expanded. — Marcel Dekker, 2001. — 477 p. More than a decade has passed since the first edition of Digital Speech Processing, Synthesis, and Recognition was published. The book has been widely used throughout the world as both a textbook and a reference work. The clear need for such a book stems from the fact that speech is the most natural form of...
  • №88
  • 4,87 МБ
  • добавлен
  • изменен
Презентация доклада. 43 стр. Содержание/Outline Fundamentals of automatic speech recognition Acoustic modeling Language modeling Database (corpus) and task evaluation Transcription and dialogue systems Spontaneous speech recognition Speech understanding Speech summarization Summary (Annotation) Speech recognition technology has made significant progress...
  • №89
  • 1,19 МБ
  • добавлен
  • изменен
Marcel Dekker, 1992. — 871 p. This book originated in an invitation from Marcel Dekker, Inc., to put together a book of original articles on various aspects of speech signal processing. After discussing the possible scope of such a book with several of our colleagues, we decided that the chapters should stress the advances during the past five to ten years. The past decade has...
  • №90
  • 7,13 МБ
  • добавлен
  • изменен
NOWPress, 2007. — 24 p. — (Foundations and Trends in Signal Processing). Hidden Markov Models (HMMs) provide a simple and effective framework for modelling time-varying spectral vector sequences. As a consequence, almost all present day large vocabulary continuous speech recognition (LVCSR) systems are based on HMMs. Whereas the basic principles underlying HMM-based LVCSR are...
  • №91
  • 707,27 КБ
  • добавлен
  • изменен
Springer, 2011. — 125 p. The preparation of the present brief book was motivated by the significant and long-standing interest of the speech processing community to short-time cepstrum-based parameterization of speech. In approximately 100 pages, this volume brings together relevant information about 11 speech parameterization techniques and some of their variants that emerged...
  • №92
  • 2,24 МБ
  • добавлен
  • изменен
Springer, 2008. — 483 p. Years ago when speech technology was younger, the designers of telephony-based speech recognition applications discovered something interesting. If human factors design, now often called user interface design, is applied to the prompts and flow of these applications, the result is improved system performance. Previously, nearly the only path of performance...
  • №93
  • 3,20 МБ
  • добавлен
  • изменен
CRC Press, 2000. — 247 p. Всеобъемлющее описание алгоритмов и методов кодирования речи. Детали реализации этих алгоритмов в распространенных речевых кодеках. Introduction Speech Production The Speech Chain Articulation Excitation Vocal Tract Phonemes Source-Filter Model Speech Analysis Techniques Sampling the Speech Waveform Systems and Filtering Z-Transform...
  • №94
  • 4,23 МБ
  • дата добавления неизвестна
  • изменен
Springer, 2014. — 188 p. The most of the applications of digital speech processing deal with speech or speaker pattern recognition. To understand the practical implementation of the speech or speaker recognition techniques, there is the need to understand the concepts of digital speech processing and the pattern recognition. This book aims in giving the balanced treatment of both...
  • №95
  • 9,37 МБ
  • добавлен
  • изменен
Springer, 2002. — 134 p. Speech recognition technology is being increasingly employed in humanmachine interfaces. Two of the key problems affecting such technology, however, are its robustness across different speakers and robustness to non-native accents, both of which still create considerable difficulties for current systems. In this book methods to overcome these problems are...
  • №96
  • 1,10 МБ
  • добавлен
  • изменен
Now Publishers, 2010. — 152 p. — (Foundations and Trends in Signal Processing). In December 1974 the first real-time conversation on the ARPAnet took place between Culler-Harrison Incorporated in Goleta, California, and MIT Lincoln Laboratory in Lexington, Massachusetts. This was the first successful application of real-time digital speech communication over a packet network and...
  • №97
  • 8,74 МБ
  • дата добавления неизвестна
  • изменен
Springer, 2004. — 487 p. Springer Handbook of Auditory Research. Volume 18 Although our sense of hearing is exploited for many ends, its communicative function stands paramount in our daily lives. Humans are, by nature, a vocal species and it is perhaps not too much of an exaggeration to state that what makes us unique in the animal kingdom is our ability to communicate via the...
  • №98
  • 2,77 МБ
  • добавлен
  • изменен
InTech, 2007. — 470 p. Digital speech processing is a major field in current research all over the world. In particular for automatic speech recognition (ASR). Very significant achievements have been made since the first attempts of digit recognizers in the 1950’s and 1960’s when spectral resonances were determined by analogue filters and logical circuits. As prof. Furui...
  • №99
  • 9,21 МБ
  • добавлен
  • изменен
Springer, 2011. — 125 p. Automatic speech recognition systems are increasingly applied for modern communication. One example are call centers, where speech recognition based systems provide information or help sorting customer queries in order to forward them to the according experts. The big advantage of those systems is that the computers can be online 24 h a day to process...
  • №100
  • 1,18 МБ
  • добавлен
  • изменен
Kluwer, 1990. — 454 p. Speech sound production is one of the most complex human activities: it is also one of the least well understood. This is perhaps not altogether surprising as many of the complex neurological and physiological processes involved in the generation and execution of a speech utterance remain relatively inaccessible to direct investigation, and must be inferred...
  • №101
  • 7,88 МБ
  • добавлен
  • изменен
McGraw-Hill, 2003. — 338 p. The focus of this book is the narrow question of how to assess quality of packet-switched voice services in general and VoIP services in particular. The approach taken in answering this vexing question is one that I have exploited to very good effect in more than 35 years’ working in the general area of test and evaluation of telecommunications systems....
  • №102
  • 1,84 МБ
  • добавлен
  • изменен
Blackwell, 2010. — 279 p. In undergraduate courses that include phonetics, students typically acquire skills both in ear-training and an understanding of the acoustic, physiological, and perceptual characteristics of speech sounds. But there is usually less opportunity to test this knowledge on sizeable quantities of speech data partly because putting together any database that is...
  • №103
  • 22,44 МБ
  • добавлен
  • изменен
Kluwer, 1999. — 328 p. This book is the development of a series of lectures to undergraduate and postgraduate students at Macquarie University on basic principles in acoustic phonetics and speech signal processing. The first part of the book (Chapters 1 to 4) is intended to provide students with the ability to interpret acoustic records of speech signals in their various forms....
  • №104
  • 5,96 МБ
  • добавлен
  • изменен
O'Reilly Media, Inc., 2013. — 242 p. Go under the hood of an operating Voice over IP network, and build your knowledge of the protocols and architectures used by this Internet telephony technology. With this concise guide, you’ll learn about services involved in VoIP and get a first-hand view of network data packets from the time the phones boot through calls and subsequent...
  • №105
  • 13,87 МБ
  • добавлен
  • изменен
O'Reilly Media, 2013. — 242 p. Go under the hood of an operating Voice over IP network, and build your knowledge of the protocols and architectures used by this Internet telephony technology. With this concise guide, you’ll learn about services involved in VoIP and get a first-hand view of network data packets from the time the phones boot through calls and subsequent connection...
  • №106
  • 24,71 МБ
  • добавлен
  • изменен
Morgan & Claypool, 2008. — 121 p. In this book, we introduce the background and mainstream methods of probabilistic modeling and discriminative parameter optimization for speech recognition. The specific models treated in depth include the widely used exponential-family distributions and the hidden Markov model. A detailed study is presented on unifying the common objective...
  • №107
  • 2,87 МБ
  • добавлен
  • изменен
Speech Repairs, Intonational Boundaries and Discourse Markers: Modeling Speakers’ Utterances in Spoken Dialog by Peter Anthony Heeman University of Rochester, Rochester, New York. 1997 Abstract Interactive spoken dialog provides many new challenges for natural language understanding systems. One of the most critical challenges is simply determining the speaker’s...
  • №108
  • 849,23 КБ
  • добавлен
  • изменен
Springer, 2013. — 227 p. One of the main reasons for the complexity of spoken dialogue systems (SDSs) development constitutes the multi-domain and thus the multi-topic nature of reallife processes. If the application domain is not clearly defined collecting a corpus or establishing valid rules to control the dialogue flow of the SDS becomes a complex task. Within the framework of...
  • №109
  • 2,35 МБ
  • добавлен
  • изменен
Springer, 2013. — 301 p. The book covers a wide range of disciplines related to speech and language and vocal communication in animals. In Part I, the first chapter deals with the current state of understanding of the neurology of speech and language in terms of brain substrates, representation, and theoretical models. The second chapter is a review of what is known about the...
  • №110
  • 4,05 МБ
  • добавлен
  • изменен
Springer, 2008. — 445 p. Cost reduction is of increasing importance for medium and large enterprises. Seen in this context, Interactive Voice Response (IVR) systems are becoming more and more significant. IVR systems can help to automate business processes as for example in call centers, which are now a growing market for IVR systems. Automatic speech recognition (ASR) is the key...
  • №111
  • 1,92 МБ
  • добавлен
  • изменен
Springer, 2011. — 185 p. A self-learning speech controlled system has been developed for unsupervised speaker identification and speech recognition. The benefits of a speech controlled device which identifies its main users by their voice characteristics are obvious: The human-computer interface may be personalized. New ways for interacting with a speech controlled system may be...
  • №112
  • 1,61 МБ
  • добавлен
  • изменен
Springer, 1983. — 713 p. Pitch (i.e., fundamental frequency F 0 and fundamental period T 0 ) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The...
  • №113
  • 12,93 МБ
  • добавлен
  • изменен
Springer, 2017. — 170 p. Text-to-Speech (TTS) synthesis, i.e., artificially produced speech, has finally attained a quality level that makes it possible to include it into ordinary services that are used by common people. With the increasing processing power of smartphones and the development of intelligent personal assistants like Siri, Cortana, and Google Now, synthetic speech...
  • №114
  • 3,05 МБ
  • добавлен
  • изменен
Springer, 2015. — 212 p. The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based methods...
  • №115
  • 4,84 МБ
  • добавлен
  • изменен
Springer, 2012. — 109 p. Speech production and perception, man’s most widely used means of communication, has been the subject of research and intense study for more than 10 decades. Conventional theories of speech production are based on linearization of pressure and volume velocity relations and the speech production system is modeled as a linear source-filter model. This...
  • №116
  • 1,32 МБ
  • добавлен
  • изменен
2nd edition. — Taylor & Francis, 2001. — 317 p. As information technology continues to make more impact on many aspects of our daily lives, the problems of communication between human beings and informationprocessing machines become increasingly important. Up to now such communication has been almost entirely by means of keyboards and screens, but there are substantial...
  • №117
  • 2,42 МБ
  • добавлен
  • изменен
Kluwer, 2000. — 359 p. The study of prosody is perhaps the area of speech research which has undergone the most noticeable development during the past ten to fifteen years. As an indication of this, one can note, for example, that at the latest International Conference on Spoken Language Processing in Philadelphia (October 1996), there were more sessions devoted to prosody than to...
  • №118
  • 5,60 МБ
  • добавлен
  • изменен
Prentice Hall, 2001. — 965 p. Recognition and understanding of spontaneous unrehearsed speech remains an elusive goal. To understand speech, a human considers not only the specific information conveyed to the ear, but also the context in which the information is being discussed. For this reason, people can understand spoken language even when the speech signal is corrupted by...
  • №119
  • 9,62 МБ
  • добавлен
  • изменен
Kluwer, 1992. — 254 p. After almost three scores of years of basic and applied research, the field of speech processing is, at present, undergoing a rapid growth in terms of both performance and applications and this is fuelled by the advances being made in the areas of microelectronics, computation and algorithm design. Speech processing relates to three aspects of voice...
  • №120
  • 3,78 МБ
  • добавлен
  • изменен
InTech, 2011. — 442 p. The book Speech Technologies addresses different aspects of the research field and a wide range of topics in speech signal processing, speech recognition and language processing. The chapters are divided in three different sections: Speech Signal Modeling, Speech Recognition and Applications. The chapters in the first section cover some essential topics...
  • №121
  • 25,54 МБ
  • добавлен
  • изменен
Springer, 2010. — 187 p. The idea for this book was formed during the doctorate of Bernd Iser. Bernd Iser was working on efficient and robust bandwidth extension algorithms in hands-free systems for Harman/Becker Automotive Systems. It turned out that bandwidth extension of speech signals was a topic of appreciable interest, where lots of scientific publications discussing...
  • №122
  • 7,11 МБ
  • добавлен
  • изменен
The Distinctive Features and their Correlates The M-l-T Press, 1952. - 74 p. This report proposes some questions to be discussed by specialists working on various aspects of speech communication. These questions concern the ultimate discrete components of language, their specific structure, their inventory in the languages of the world, their identification on the acoustical...
  • №123
  • 1,32 МБ
  • добавлен
  • изменен
Диссертация, Cambridge University, 1995. — 157 p. The research presented in this thesis addresses the topic of ad hoc retrieval of information from collections of spoken items such as radio news bulletins. Modern digital computers are becoming increasingly adept at processing nontextual data, such as speech. Consequently, new methods are required to allow users to pin-point...
  • №124
  • 1,04 МБ
  • добавлен
  • изменен
Springer, 2005. — 207 p. As part of the steady progress being made in the field of information and telecommunication techniques, voice and speech quality assessment of systems has gained in importance over the last years. An engineering approach to voice and speech quality of systems includes the consideration of how a system is perceived by its users, and how the needs and...
  • №125
  • 1,17 МБ
  • добавлен
  • изменен
L.: A Bradford Book, 1998. - 305p. This book reflects decades of important research on the mathematical foundations of speech recognition. It focuses on underlying statistical techniques such as hidden Markov models, decision trees, the expectation-maximization algorithm, information theoretic goodness criteria, maximum entropy probability estimation, parameter and data...
  • №126
  • 2,06 МБ
  • добавлен
  • изменен
Springer, 2004. — 292 p. The importance of speech and language technologies continues to grow as information, and information needs, pervade every aspect of our lives and every corner of the globe. Speech and language technologies are used to automatically transcribe, analyze, route and extract information from highvolume streams of spoken and written information. Equally...
  • №127
  • 5,85 МБ
  • добавлен
  • изменен
John Wiley, 2009. — 181 p. State-of-the-art speech and language technology has reached a level that allows us to build interactive applications which the users can have short conversations with in order to search for information. We are already dealing with electronic banking facilities, information providing systems, restaurant guides, timetable services, assisting translation...
  • №128
  • 1,09 МБ
  • добавлен
  • изменен
Morgan & Claypool, 2010. — 167 p. Considerable progress has been made in recent years in the development of dialogue systems that support robust and efficient human–machine interaction using spoken language. Spoken dialogue technology allows various interactive applications to be built and used for practical purposes, and research focuses on issues that aim to increase the...
  • №129
  • 1,81 МБ
  • добавлен
  • изменен
Kluwer, 2002. — 193 p. As the performance of speaker-independent continuous speech recognition has improved over the last decade, increasing attention has been given to the poor recognition performance obtained for some speakers, noisy conditions and environments where the quality and the type of the communication channel is unknown. At the same time an increasing number of...
  • №130
  • 11,31 МБ
  • добавлен
  • изменен
Kluwer, 2001. — 277 p. Consider a computer system that you can talk to using ordinary speech (either directly or perhaps using your telephone), and that you can ask questions concerning such things as timetables for public transportation. For example, you might ask the system the departure time of a train from Brussels to Amsterdam, specifying that you wish to arrive in Amsterdam...
  • №131
  • 4,09 МБ
  • добавлен
  • изменен
Draft, 2nd edition: Prentice Hall, 2008 — 1024 p. An explosion of Web-based language techniques, merging of distinct fields, availability of phone-based dialogue systems, and much more make this an exciting time in speech and language processing. The first of its kind to thoroughly cover language technology – at all levels and with all modern technologies – this book takes an...
  • №132
  • 18,89 МБ
  • добавлен
  • изменен
Arunachal Pradesh: Technical and Scientific Publisher, 2017. — 11 p. Speech editing is nothing more than moving about some arrays of numbers. Enhancement filters can be used to remove both natural and intentional noise, to a reasonable extent. And pitch and formant analysis can be used to give a general idea of whether two speakers are the same person or not. There are also other...
  • №133
  • 473,64 КБ
  • добавлен
  • изменен
Springer, 2017. — 845 p. — (Lecture Notes in Computer Science). — ISBN 10 331966428X, 13 978-3319664286. This book constitutes the proceedings of the 19th International Conference on Speech and Computer, SPECOM 2017, held in Hatfield, UK, in September 2017. The 80 papers presented in this volume were carefully reviewed and selected from 150 submissions. The papers present current...
  • №134
  • 66,37 МБ
  • добавлен
  • изменен
John Wiley, 2002. — 407 p. Making machines speak like humans is a dream that is slowly coming to fruition. When the first automatic computer voices emerged from their laboratories twenty years ago, their robotic sound quality severely curtailed their general use. But now after a long period of maturation, synthetic speech is beginning to reach an initial level of acceptability....
  • №135
  • 2,67 МБ
  • добавлен
  • изменен
John Wiley, 2003. — 222 p. In general, voice transmission over the Internet protocol (IP), or VoIP, means transmission of real-time voice signals and associated call control information over an IP-based (public or private) network. The term IP telephony is commonly used to specify delivery of a superset of the advanced public switched telephone network (PSTN) services using IP...
  • №136
  • 5,71 МБ
  • добавлен
  • изменен
Springer, 2011. — 387 p. Automatic speech recognition suffers from a lack of robustness with respect to noise, reverberation and interfering speech. The growing field of speech recognition in the presence of missing or uncertain input data seeks to ameliorate those problems by using not only a preprocessed speech signal but also an estimate of its reliability, to selectively focus...
  • №137
  • 4,52 МБ
  • добавлен
  • изменен
Springer, 1997. — 367 p. Speech technology, the automatic processing of (spontaneously) spoken words and utterances, now is known to be technically feasible and will become the major tool for handling the confusion of languages. The economic implications of this tool are obvious, in particular in the multilingual European Union. Potential and current applications are dictation...
  • №138
  • 7,09 МБ
  • добавлен
  • изменен
John Wiley, 2015. — 583 p. Emotion represents a psychological state of the human mind. Researchers from different domains have diverse opinions about the developmental process of emotion. Philosophers believe that emotion originates as a result of substantial (positive or negative) changes in our personal situations or environment. Biologists, however, consider our nervous and...
  • №139
  • 4,55 МБ
  • добавлен
  • изменен
Springer, 2012. — 161 p. This book came out of approximately ten years of continuing research at Yamagata University. With the emergence of numerous algorithms for a variety of speech processing applications, such as coding, enhancement, and synthesis, a variety of distortion can now be observed. These disturbances degrade the speech quality in an unexpected manner. For example,...
  • №140
  • 11,21 МБ
  • добавлен
  • изменен
Second Edition — John Wiley &Sons Ltd, 2004. — 459 p. This Second Edition continues to provide the fundamental technical background required for low bit rate speech coding and the hottest developments in digital speech coding techniques that are applicable to evolving communication systems. Features new chapters on Pitch Estimation and Voice-Unvoiced Classification of Speech,...
  • №141
  • 9,44 МБ
  • дата добавления неизвестна
  • изменен
Springer, 2015. — 87 p. Voice-based call centers or business process outsourcing units generate huge amounts of speech data everyday during their day-to-day operations. Large and diverse types of information are hidden in these natural language conversations, which is begging to be exploited. The whole area of voice analytics deals with the aspect of deriving usable information...
  • №142
  • 2,29 МБ
  • добавлен
  • изменен
Addison Wesley, 2003. — 155 p. Most people have experienced an automated speech-recognition system when calling a company. Instead of prompting callers to choose an option by entering numbers, the system asks questions and understands spoken responses. With a more advanced application, callers may feel as if they're having a conversation with another person. Not only will the...
  • №143
  • 888,10 КБ
  • добавлен
  • изменен
Springer, 2017. — 233 p. — ISBN 3319536117. This book focuses on speech signal phenomena, presenting a robustification of the usual speech generation models with regard to the presumed types of excitation signals, which is equivalent to the introduction of a class of nonlinear models and the corresponding criterion functions for parameter estimation. Compared to the general class...
  • №144
  • 4,97 МБ
  • добавлен
  • изменен
Springer, 2019. — 282 p. — ISBN 978-3-030-15852-1. This book explores the processes of spoken language production and perception from a neurobiological perspective. After presenting the basics of speech processing and speech acquisition, a neurobiologically-inspired and computer-implemented neural model is described, which simulates the neural processes of speech processing and...
  • №145
  • 13,58 МБ
  • добавлен
  • изменен
Springer, 2013. — 134 p. During production of speech human beings impose emotional cues on the sequence of sound units to convey the intended message. Speech without emotional information is unnatural and monotonous. Most of the existing speech systems are able to process studio recorded neutral speech. However, in the present real world communication scenario, speech systems...
  • №146
  • 1,46 МБ
  • добавлен
  • изменен
Springer, 2016. — 126 p. Speech enhancement is incorporated as an essential component in all voice communication devices to improve their performance in noisy environments. Speech enhancement is an important issue for mobile phones, hands-free telephones and also for hearing aids. It has been a challenging problem for researchers to develop new enhancement algorithms that...
  • №147
  • 3,00 МБ
  • добавлен
  • изменен
ISTE/John Wiley, 2013. — 221 p. The preparation of this book was carried out while preparing an accreditation to supervise research. This is a synthesis covering the past 10 years of research, since my doctorate [LAN 04], in the field of man–machine dialogue. The goal here is to outline the theories, methods, techniques and challenges involved in the design of computer programs...
  • №148
  • 911,29 КБ
  • добавлен
  • изменен
Kluwer, 1996. — 524 p. The term speech and speaker recognition often refers to the science and technology of developing algorithms and implementing them on machines to recognize the linguistic content in a spoken utterance and to identify the talker who speaks the utterance. Since speech is the most natural means of communication among human beings, it also plays a key role in the...
  • №149
  • 7,84 МБ
  • добавлен
  • изменен
World Scientific, 2007. — 563 p. It is generally agreed that speech will play a major role in defining next-generation human-machine interfaces because it is the most natural means of communication among humans. To push forward this vision, speech research has enjoyed a long and glorious history spanning the entire twentieth century. As a result in the last three decades we have...
  • №150
  • 15,70 МБ
  • добавлен
  • изменен
Springer, 1989. — 216 p. Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates;...
  • №151
  • 3,17 МБ
  • добавлен
  • изменен
Springer, 2012. — 184 p. Data driven methods have long been used in Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) synthesis and have more recently been introduced for dialogue management, spoken language understanding, and Natural Language Generation. Machine learning is now present end-to-end in Spoken Dialogue Systems (SDS). However, these techniques require data...
  • №152
  • 1,55 МБ
  • добавлен
  • изменен
Springer, 2015. — 250 p. This book addresses the subject of emotional speech, especially its encoding and decoding process during interactive communication, based on an improved version of Brunswik’s Lens Model. The process is shown to be influenced by the speaker’s and the listener’s linguistic and cultural backgrounds, as well as by the transmission channels used. Through...
  • №153
  • 5,57 МБ
  • добавлен
  • изменен
Academic Press, 2016. — 303 p. Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an...
  • №154
  • 4,08 МБ
  • добавлен
  • изменен
Springer, 2012. — 264 p. This book is organized by research topic. Each chapter focuses on a major topic and can be read independently. Each chapter contains advanced algorithms along with real speech examples and evaluation results to validate the usefulness of the selected topics. Special attention has been given to the topics related to improving overall system robustness and...
  • №155
  • 4,53 МБ
  • добавлен
  • изменен
IGI Global, 2009. — 573 p. It has been widely accepted that speech perception is a multimodal process and involves information from more than one sensory modality. The famous McGurk effect [McGurk and MacDonald, Nature 264(5588): 746–748, 1976] shows that visual articulatory information is integrated into our perception of speech automatically and unconsciously. For example, a...
  • №156
  • 117,59 МБ
  • добавлен
  • изменен
N.-Y.: CRC Press, 2013. — 705 p. This text is, in part, an outgrowth of graduate course on speech signal processing at the University of Texas at Dallas since the fall of 1999. The fact that no textbook existed at the time on speech enhancement, other than a few edited books suitable for the experts, made it difficult to teach the fundamental principles of speech enhancement in...
  • №157
  • 17,51 МБ
  • добавлен
  • изменен
Springer, 2007. — 438 p. We are surrounded by sounds. Such a noisy environment makes it difficult to obtain desired speech and it is difficult to converse comfortably there. This makes it important to be able to separate and extract a target speech signal from noisy observations for both man–machine and human–human communication. Blind source separation (BSS) is an approach for...
  • №158
  • 13,07 МБ
  • добавлен
  • изменен
ISTE/John Wiley, 2009. — 505 p. This book, entitled Spoken Language Processing, addresses all the aspects covering the automatic processing of spoken language: how to automate its production and perception, how to synthesize and understand it. It calls for existing know-how in the field of signal processing, pattern recognition, stochastic modeling, computational linguistics,...
  • №159
  • 4,23 МБ
  • добавлен
  • изменен
Springer, 1976. — 300 p. During the past ten years a new area in speech processing, generally referred to as linear prediction, has evolved. As with all scientific research, results did not always get published in a logical order and terminology was not always consistent. In mid-1974, we decided to begin an extra hours and weekends project of organizing the literature in linear...
  • №160
  • 4,80 МБ
  • добавлен
  • изменен
John Wiley, 2008. — 555 p. When the book Digital Speech Transmission – Enhancement, Coding and Error Concealment by Peter Vary and Rainer Martin appeared in 2006, it was clear that a subject of this importance and this range could not be treated in all its details on 600-some pages. Important aspects had to be left out and had to be postponed to a succeeding volume. The...
  • №161
  • 7,97 МБ
  • добавлен
  • изменен
Springer, 2012. — 70 p. Human beings recognize speaker, language and speech using multiple cues present in speech signal and evidences are combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language and speech recognition systems mostly rely on spectral/cepstral features which are affected by channel...
  • №162
  • 1,47 МБ
  • добавлен
  • изменен
Springer, 2019. — 70 p. Human beings recognize speaker, language, emotion, and speech using multiple cues present in speech signal and evidences are combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language, emotion, and speech recognition systems mostly rely on spectral/cepstral features which are...
  • №163
  • 1,93 МБ
  • добавлен
  • изменен
Springer, 2019. — 70 p. — ISBN 978-1-4614-1158-1. Human beings recognize speaker, language, emotion, and speech using multiple cues present in speech signal and evidences are combined to arrive at a decision. Humans use several prosodic cues for these recognition tasks. But conventional automatic speaker, language, emotion, and speech recognition systems mostly rely on...
  • №164
  • 548,68 КБ
  • добавлен
  • изменен
Springer, 2004. — 431 p. The present coming of age of speech technologies coincides with the advent of mobile computing and the accompanying need for ubiquitous information access. This has generated enormous commercial interest around deploying speech interaction to IT-based services. In his book, Michael gives an in-depth review of the nuts and bolts of constructing speech...
  • №165
  • 2,75 МБ
  • добавлен
  • изменен
Springer, 1997. — 229 p. EC AY 96 Workshop, Budapest, Hungary, August 13, 1996, Revised Papers This volume contains a selection of extended and revised versions of papers presented to the European Conference on Artificial Intelligence ECAI-96 Workshop on Dialogue Processing in Spoken Language Systems. The workshop took place on August 13, 1996 in Budapest, Hungary. This workshop...
  • №166
  • 2,61 МБ
  • добавлен
  • изменен
University of Ljubljana, 2012. - 116 p. The two main objectives of this project are to analyse the efficiency of several techniques widely used among the field of emotion recognition through spoken audio signals, and, secondly, obtain empirical data that proves that it is actually plausible to do so with a more than acceptable performance rate. For that purpose, our research will...
  • №167
  • 2,58 МБ
  • добавлен
  • изменен
InTech, 2008. — 576 p. After decades of research activity, speech recognition technologies have advanced in both the theoretical and practical domains. The technology of speech recognition has evolved from the first attempts at speech analysis with digital computers by James Flanagan’s group at Bell Laboratories in the early 1960s, through to the introduction of dynamic...
  • №168
  • 41,85 МБ
  • добавлен
  • изменен
John Wiley, 2002. — 403 p. Playing with a new technology is fun. I have been a teacher in one form or another for over 20 years, but it still gets me excited when I see something that seems so obvious and so simple that it is shocking it hasn’t been done before. That’s the way I feel about VoiceXML. VoiceXML makes it possible for anyone who can build a basic Web page to create a...
  • №169
  • 2,06 МБ
  • добавлен
  • изменен
Kluwer, 2004. — 104 p. The conjunction of several factors having occurred throughout the past few years will make humans significantly change their behavior vis-а-vis machines. In particular the use of speech technologies will become normal in the professional domain, but also in everyday life. The performance of speech recognition components has significantly improved: only...
  • №170
  • 2,26 МБ
  • добавлен
  • изменен
Newnes, 2011. — 381 p. Voice over IP (VoIP) in particular and Voice over Packet (VoP) in general have been advocated and studied since the mid 1970s. It was the advent of DSP technology for voice compression in the late 1980s and early 1990s that gave these services the impetus they needed to enter the mainstream. Commercial-grade technologies and services started to appear in the...
  • №171
  • 9,20 МБ
  • добавлен
  • изменен
Springer, 2018. — 120 р. This book shows ways of augmenting the capabilities of Natural Language Processing (NLP) systems by means of cognitive-mode language processing. The authors employ eye-tracking technology to record and analyze shallow cognitive information in the form of gaze patterns of readers/annotators who perform language processing tasks. The insights gained from...
  • №172
  • 4,72 МБ
  • добавлен
  • изменен
Springer, 2018. — 120 р. This book shows ways of augmenting the capabilities of Natural Language Processing (NLP) systems by means of cognitive-mode language processing. The authors employ eye-tracking technology to record and analyze shallow cognitive information in the form of gaze patterns of readers/annotators who perform language processing tasks. The insights gained from...
  • №173
  • 2,57 МБ
  • добавлен
  • изменен
Springer, 2005. — 490 p. An increasing number of telephone services are offered in a fully automatic way with the help of speech technology. The underlying systems, called spoken dialogue systems (SDSs), possess speech recognition, speech understanding, dialogue management, and speech generation capabilities, and enable a more-or-less natural spoken interaction with the human...
  • №174
  • 19,25 МБ
  • добавлен
  • изменен
Springer, 2013. — 59 p. A leading use of speech recognition technology is the conversion of large speech databases into text for indexing and retrieval purposes. Using a large vocabulary continuous speech recognition (LVCSR) engine seems to provide a natural solution, as speech can be fully converted into text and then indexed and searched. One method used for searching speech...
  • №175
  • 592,68 КБ
  • добавлен
  • изменен
IGI Global, 2010. — 342 p. As social scientists often define it, technology refers to devices and processes that extend our natural capabilities. Microscopes make it possible to see smaller things and telescopes enable us to see things that are further away. Cars extend the amount of space that we are able to travel far beyond where our feet can take us during a given period of...
  • №176
  • 2,44 МБ
  • добавлен
  • изменен
Springer, 2007. — 362 p. The best way to introduce this textbook is by using the words Volker Dellwo and his colleagues had chosen to begin their chapter How Is Individuality Expressed in Voice? While they use this statement to motivate the introductory chapter on speech production and the phonetic description of speech, it constitutes a framework of the entire book as well:What...
  • №177
  • 4,17 МБ
  • добавлен
  • изменен
Springer, 2007. — 316 p. The best way to introduce this textbook is by using the words Volker Dellwo and his colleagues had chosen to begin their chapter How Is Individuality Expressed in Voice? While they use this statement to motivate the introductory chapter on speech production and the phonetic description of speech, it constitutes a framework of the entire book as well:What...
  • №178
  • 5,04 МБ
  • добавлен
  • изменен
John Wiley, 2008. — 592 p. Voice over IP (VoIP) gained popularity through actual deployments and by making use of VoIP - based telephone and fax calls with global roaming and connectivity via the Internet. Several decades of effort have gone into VoIP, and these efforts are benefitting real applications. Several valuable books have been published by experts in the field. While I...
  • №179
  • 4,98 МБ
  • добавлен
  • изменен
Springer, 2014. — 304 p. Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014 Proceedings. The Spanish Thematic Network on Speech Technology (RTTH) and the ISCASpecial Interest Group on Iberian Languages (SIG-IL) are pleased to present the selected papers of IberSpeech 2014, Joint VIII Jornadas en Tecnologías del Habla and IV...
  • №180
  • 4,42 МБ
  • добавлен
  • изменен
Springer, 2010. — 490 p. Speech dereverberation has been on the agenda of the signal processing community for several years. It is only in the last decade, however, that the topic has really taken off, as seen from the growing number of publications appearing in the journals and at conferences. One of the reasons that the topic has become more popular is the rapidly growing...
  • №181
  • 9,19 МБ
  • добавлен
  • изменен
Springer, 2010. — 382 p. Advances in Speech Recognition: Mobile Environments, Call Centers and Clinics provides a forum for today’s speech technology industry leaders – drawn from private enterprises and academic institutions all over the world – to discuss the challenges, advances, and aspirations of voice technology. The collection of essays contained in this volume represents...
  • №182
  • 7,81 МБ
  • добавлен
  • изменен
Springer, 2013. — 72 p. AT&T, Yahoo! Research, and other companies, along with academicians, technology developers, and market analysts. They analyze the growing markets for mobile speech, new methodological approaches to the study of natural language, empirical research findings on natural language and mobility, and future trends in mobile speech. This book is divided into four...
  • №183
  • 4,71 МБ
  • добавлен
  • изменен
Springer, 2012. — 546 p. — ISBN 978-1-4614-0263-3. Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism is an anthology of the research findings of thirty-five speaker recognition experts from around the world. The book provides a multidimensional look at the complex science involved in determining whether a suspect’s voice truly matches forensic speech samples,...
  • №184
  • 7,29 МБ
  • добавлен
  • изменен
Springer, 1983. — 503 p. This volume contains invited and contributed papers presented at the, NATO Advanced study Institute on "Recent Advances in Speech, Understanding and Dialog systems" held in Bad Windsheim, Federal, Republic of Germany, July 5 to July 18, 1987. It is divided into the, three parts Speech coding and Segmentation, Word Recognition, and, Linguistic Processing....
  • №185
  • 16,50 МБ
  • добавлен
  • изменен
EURASIP Journal on Advances in Signal Processing, 2010. — 94 p. Significant knowledge about microphone arrays has been gained from years of intense research and product development. There have been numerous applications suggested, for example, from large arrays (in the order of 100 elements) for use in auditoriums to small arrays with only 2 or 3 elements for hearing aids and...
  • №186
  • 7,14 МБ
  • добавлен
  • изменен
IEEE Press, 2000. — 560 p. Speech commW1ication is an interdisciplinary subject. Although much of the research material for the book comes from engineering literature (e.g., IEEE journals), a wide variety of sources is employed (especially for Chapters 3-5). The book is directed primarily at an engineering audience le.g., to a final-year undergraduate or graduate course in...
  • №187
  • 34,40 МБ
  • добавлен
  • изменен
Entropics Ltd., 1999. — 667 p. The HTK Application Programming Interface (HAPI) is a library of functions providing the programmer with an interface to any speech recognition system supplied by Entropic or developed using the Hidden Markov Model Toolkit (HTK). HTK is a set of UNIX tools which are used to construct all the components of a modern speech recogniser. One of the...
  • №188
  • 1,68 МБ
  • добавлен
  • изменен
Springer, 2015. — 336 p. This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech...
  • №189
  • 6,02 МБ
  • добавлен
  • изменен
CRC Press, 2010. — 381 p. It is becoming increasingly apparent that all forms of communication—including voice—will be transmitted through packet-switched networks based on the Internet Protocol (IP). Therefore, the design of modern devices that rely on speech interfaces, such as cell phones and PDAs, requires a complete and up-to-date understanding of the basics of speech coding....
  • №190
  • 9,07 МБ
  • добавлен
  • изменен
John Wiley, 2006. — 274 p. The total number of mobile phone subscribers worldwide is expected to exceed two billion in 2006. While ordinary voice calling remains the dominant application, mobile devices are becoming increasingly sophisticated, with features like multimedia messaging, cameras, web browsers, games, video, and music. The data capabilities of mobile networks are also...
  • №191
  • 2,38 МБ
  • добавлен
  • изменен
Emerald Group, 2012. — 459 p. The last 15 years have seen a revolution in auditory physiology, but the new ideas have been slow to gain currency outside specialist circles. Undoubtedly, one of the main reasons for this has been the lack of a general source for non-specialists, and it is hoped that this book will bring current thinking to a much wider audience. While the book...
  • №192
  • 4,52 МБ
  • добавлен
  • изменен
The MIT Press, 2012. — 339 p. — ISBN 978-0-262-01685-8. На англ. языке. In The Voice in the Machine , Roberto Pieraccini examines six decades of work in science and technology to develop computers that can interact with humans using speech and the industry that has arisen around the quest for these technologies. He shows that although the computers today that understand...
  • №193
  • 7,19 МБ
  • добавлен
  • изменен
Springer, 2010. — 279 p. During the past years the mystery of emotions has increasingly attracted interest in research on human–computer interaction. In this work we investigate the problem of how to incorporate the user’s emotional state into a spoken language dialogue system. The book describes the recognition and classification of emotions and proposes models integrating...
  • №194
  • 4,74 МБ
  • добавлен
  • изменен
Springer, 2015. — 187 p. If we want the vocal human–computer interaction to become more intuitive, it is inevitable to make the computer notice, interpret, and react to human ways of expression and patterns in communication beyond the recognition of the mere word strings. This is specifically important when it comes to subtle or hidden characteristics carrying connotations or...
  • №195
  • 2,98 МБ
  • добавлен
  • изменен
Proceedings of 35th International Conference on Electronics and Nanotechnology (ELNANO), Kyiv, 2015. - P.269- 274. Refined recommendations for choosing optimal, in the sense of automatic speech recognition (ASR) accuracy maximum, parameters of the late reverberation suppression technique, have been proposed in this paper.
  • №196
  • 297,06 КБ
  • добавлен
  • изменен
Prentice-Hall, 2002. — 800 p. Speech and hearing, man's most used means of communication, have been the objects of intense study for more than 150 years-from the time of von Kempelen's speaking machine to the present day. With the advent of the telephone and the explosive growth of its dissemination and use, the engineering and design of evermore bandwidth-efficient and...
  • №197
  • 18,80 МБ
  • дата добавления неизвестна
  • изменен
John Wiley, 2006. — 338 p. VoIP means transmitting speech over computer networks. In contrast to classical telephony, where research into the relation between physical transmission parameters, the resulting speech signal and the related speech quality has a longer tradition, speech quality of VoIP has only recently become an issue. The present book tries to merge knowledge of the...
  • №198
  • 2,69 МБ
  • добавлен
  • изменен
Prentice-Hall International, Inc. , Englewood Cliffs, New Jersey, 1993. — 507 p. From preface of the book: ".the fundamental goal of the book would be to provide a theoretically sound, technically acurate, and reasonably complete description of the basic knowledge and ideas that constitute a modern system for speech recognition by machine. "
  • №199
  • 4,16 МБ
  • дата добавления неизвестна
  • изменен
Prentice Hall, 1978. — 512 p. Классическая книга по цифровой обработке речевых сигналов Introduction Fundamentals of Digital Processing Digital Models for Speech Signal Time-Domain Methods for Speech Processing Digital Representations of the Speech Waveform Short-Time Fourier Analysis Homomorphic Speech Processing Linear Predictive Coding of Speech Digital Speech Processing for...
  • №200
  • 35,57 МБ
  • дата добавления неизвестна
  • изменен
NOWPress, 2007. — 194 p. — (Foundations and Trends in Signal Processing). Краткое изложение современных подходов к цифровой обработке речи. Since even before the time of Alexander Graham Bell’s revolutionary invention, engineers and scientists have studied the phenomenon of speech communication with an eye on creating more efficient and effective systems of human-to-human and...
  • №201
  • 3,19 МБ
  • дата добавления неизвестна
  • изменен
Boston: Pearson, 2010. — 1060 p. Speech signal processing has been a dynamic and constantly developing field for more than 70 years. The earliest speech processing systems were analog systems. They included, for example, the Voder (voice demonstration recorder) for synthesizing speech by manual controls, developed by Homer Dudley and colleagues at Bell Labs in the 1930s and...
  • №202
  • 14,33 МБ
  • добавлен
  • изменен
John Wiley, 2012. — 302 p. Advances in computing–in terms of both the creation of novel mathematical techniques and the design of data-driven technologies–have fuelled the ubiquitous development and deployment of speech technologies over the last two decades. Some of the core speech technologies and their applications to coding, recognition, synthesis, enhancement and such have...
  • №203
  • 1,20 МБ
  • добавлен
  • изменен
Kluwer, 1995. — 471 p. The term speech processing refers to the scientific discipline concerned with the analysis and processing of speech signals for getting the best benefit in various practical scenarios. These different practical scenarios correspond to a large variety of applications of speech processing research. Examples of some applications include enhancement, coding,...
  • №204
  • 6,73 МБ
  • добавлен
  • изменен
InTech, 2012. — 326 p. — ISBN 9535108313, ISBN 9789535108313. This book focuses primarily on speech recognition and the related tasks such as speech enhancement and modeling. This book comprises 3 sections and thirteen chapters written by eminent researchers from USA, Brazil, Australia, Saudi Arabia, Japan, Ireland, Taiwan, Mexico, Slovakia and India. Section 1 on speech...
  • №205
  • 12,08 МБ
  • добавлен
  • изменен
InTech, 2012. — 149 p. Speech processing is the process by which speech signals are interpreted, understood, and acted upon. Interpretation and production of coherent speech are both important in the processing of speech. It is done by automated systems such as voice recognition software or voice-to-text programs. Speech processing includes speech recognition, speaker recognition,...
  • №206
  • 5,72 МБ
  • добавлен
  • изменен
Springer, 1998. — 130 p. Once in a while, something nice happens, as if by coincidence, serendipitously. It happened to me when T.V. Raman asked me to supervise his Ph.D. thesis on building a system to speak documents, especially those with technical content or a lot of structure. The project had many interesting points, for example: the need for a programming language for writing...
  • №207
  • 1,92 МБ
  • добавлен
  • изменен
Springer, 2015. — 156 p. "Ultra Low Bit-Rate Speech Coding" focuses on the specialized topic of speech coding at very low bit-rates of 1 Kbits/sec and less, particularly at the lower ends of this range, down to 100 bps. The authors set forth the fundamental results and trends that form the basis for such ultra low bit-rates to be viable and provide a comprehensive overview of...
  • №208
  • 2,28 МБ
  • добавлен
  • изменен
Springer, 2012. — 136 p. During production of speech human beings impose durational constraints and intonation patterns on the sequence of sound units to convey the intended message. This inherent ability of the human beings in using the prosody (duration and intonation) knowledge is naturally acquired, and is difficult to articulate. But for synthesizing speech from a text by a...
  • №209
  • 1,67 МБ
  • добавлен
  • изменен
Springer, 2012. — 136 p. During production of speech human beings impose durational constraints and intonation patterns on the sequence of sound units to convey the intended message. This inherent ability of the human beings in using the prosody (duration and intonation) knowledge is naturally acquired, and is difficult to articulate. But for synthesizing speech from a text by a...
  • №210
  • 1,67 МБ
  • добавлен
  • изменен
Springer, 2013. — 127 p. Human beings use speech as a primary mode of communication for conveying messages. A speech signal carries multiple cues related to intended message, speaker and language identities, behavioural and emotional mood of the speaker and characteristics of background environment. Human beings exploit all these cues for performing various speech tasks. Now a...
  • №211
  • 1,94 МБ
  • добавлен
  • изменен
Springer, 2014. — 129 p. Robust speech systems in mobile environment have gained a special interest in recent years in order to enable access to remote voice-activated services. In this context, three major challenges that need to be considered are: varying background conditions, speech coding, and transmission channel errors. In this book, we focus on improving the recognition...
  • №212
  • 3,91 МБ
  • добавлен
  • изменен
Springer, 2015. — 119 p. This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Implicit processing of linear prediction (LP) residual and (ii)...
  • №213
  • 2,74 МБ
  • добавлен
  • изменен
Springer, 2017. — 100 p. The goal of developing a phone recognition system (PRS) is to derive the sequence of basic sound units from the speech signal. Most of the state-of-the-art PRSs are developed using spectral features such as Mel frequency cepstral coefficients. Spectral features mainly represent the gross shape of the vocal tract, but not the information related to the...
  • №214
  • 2,06 МБ
  • добавлен
  • изменен
Lippincott Williams & Wilkins, 2011. - 416 p. Written in a clear, reader-friendly style, Speech Science Primer serves as an introduction to speech science and covers basic information on acoustics, the acoustic analysis of speech, speech anatomy and physiology, and speech perception. It also includes topics such as research methodology, speech motor control, and history/evolution...
  • №215
  • 8,83 МБ
  • добавлен
  • изменен
8th ELSNET Summer School, Chios Island, Greece, July 15-30 2000, Revised Lectures. — Springer, 2003. — 202 p. This book originated from the 8th ELSNET Summer School on Language and Communication that was held in the summer of 2000 on the island of Chios in ELSNET is the European Network in Human Language Technologies, a network some 140 academic institutions and private companies...
  • №216
  • 1,67 МБ
  • добавлен
  • изменен
Kluwer, 1989. — 169 p. In order to perceive speech and other sounds, the incoming sound wave must be transformed into a variety of representations, each bringing forth different aspects of the signal, its source, and meaning. Understanding how we perceive and how machines can be made to perceive auditory signals means, in part, discovering appropriate representations for the...
  • №217
  • 5,64 МБ
  • добавлен
  • изменен
Springer, 2014. — 497 p. 16th International Conference, SPECOM 2014, Novi Sad, Serbia, October 5–9, 2014 Proceedings. The Speech and Computer International Conference (SPECOM) is a regular event organized since the first SPECOM in 1996 that was held in St. Petersburg, Russian Federation. It is a conference with a long tradition that attracts researchers in the area of computer...
  • №218
  • 6,86 МБ
  • добавлен
  • изменен
Springer, 2015. — 519 p. 17th International Conference, SPECOM 2015, Athens, Greece, September 20–24, 2015 Proceedings. The Speech and Computer International Conference (SPECOM) is a regular event organized since 1996 when the first SPECOM was held in St. Petersburg, Russian Federation. It is a conference with a long tradition that attracts researchers in the area of computer...
  • №219
  • 17,29 МБ
  • добавлен
  • изменен
Taylor & Francis, 2002. — 359 p. This book is about an aspect of applied scholarly endeavour, forensic phonetics, that carries with it very serious social responsibilities. The book makes it clear that forensic speaker identification requires scholarly expertise, and in several disparate areas. Expertise, like forensically useful fundamental frequency, is a long-term thing. It...
  • №220
  • 3,44 МБ
  • добавлен
  • изменен
Springer, 1995. — 517 p. This book collects the contributions to the NATO Advanced Study Institute on New Advances and Trends in Speech Recognition and Coding, held in Bubi6n, Granada (Spain), from June 28th to July 10th 1993. The goal of the ASI was to bring together the most important experts on speech recognition and coding to discuss and disseminate their most recent findings,...
  • №221
  • 11,59 МБ
  • добавлен
  • изменен
Springer, 1997. — 399 p. This book presents a collection of papers from the Spring 1995 Workshop on Computational Approaches to Processing the Prosody of Spontaneous Speech, hosted by the ATR Interpreting Telecommunications Research Laboratories in Kyoto, Japan. The workshop brought together leading researchers in the fields of speech and signal processing, electrical engineering,...
  • №222
  • 5,92 МБ
  • добавлен
  • изменен
Springer, 2009. — 206 p. State-of-the-art automatic speech recognition (ASR) systems use statistical data-driven methods based on hidden Markov models (HMMs). Although such approaches have proved to be efficient choices, ASR systems often perform much worse than human listeners, especially in the presence of unexpected acoustic variability. To improve performance, we usually rely...
  • №223
  • 2,08 МБ
  • добавлен
  • изменен
Springer, 2014. — 199 p. Speech is a naturally occuring nonstationary signal essential not only for personto- person communication but has become an important aspect of Human Computer Interaction (HCI). Some of the issues related to analysis and design of speech-based applications for HCI have received widespread attention. With continuous upgradation of processing techniques,...
  • №224
  • 3,13 МБ
  • добавлен
  • изменен
Springer, 2012. — 251 p. — ISBN-10 1461445922, ISBN-13 9781461445920. In Monitoring Adaptive Spoken Dialog Systems, authors Alexander Schmitt and Wolfgang Minker investigate statistical approaches that allow for recognition of negative dialog patterns in Spoken Dialog Systems (SDS). The presented stochastic methods allow a flexible, portable and accurate use. Beginning with the...
  • №225
  • 4,81 МБ
  • добавлен
  • изменен
Springer, 2004. — 399 p. The first edition having been sold out, gives me a welcome opportunity to augment this volume by some recent applications of speech research. A new chapter, by Holger Quast, treats speech dialogue systems and natural language processing. Dictation programs for word processors, voice dialing for mobile phones, and dialogue systems for air travel...
  • №226
  • 6,98 МБ
  • добавлен
  • изменен
John Wiley, 2014. — 345 p. It might be safe to claim that 20 years ago, neither the term ‘computational paralinguistics’ nor the field it denotes existed. Some 10 years ago, the term did not yet exist either. However, in hindsight, the field had begun to exist if we think of the first steps towards the automatic processing of emotions in speech in the mid-1990s. For example,...
  • №227
  • 4,72 МБ
  • добавлен
  • изменен
Academic Press, 2006 Обработка естественного языка с многоязыковой точки зрения Introduction Language Characteristics Linguistic Data Resources Multilingual Acoustic Modeling Multilingual Dictionaries Multilingual Language Modeling Multilingual Speech Synthesis Automatic Language Identification Other Challenges: Non-native Speech, Dialects, Accents,and Local Interfaces...
  • №228
  • 2,63 МБ
  • дата добавления неизвестна
  • изменен
Springer, 2011. — 113 p. Soft Computing (SC) techniques have been recognized nowadays as attractive solutions for modeling highly nonlinear or partially defined complex systems and processes. These techniques resemble biological processes more closely than conventional (more formal) techniques. However, despite its increasing popularity, soft computing lacks a precise definition...
  • №229
  • 1,74 МБ
  • добавлен
  • изменен
InTech, 2010. — 174 p. Speech processing has come a long way since the year of 1947, when R. K. Potter, G. A. Kopp, and H. Green from Bell Labs introduced the sound spectrograph, the fi rst instrument to produce human voice-prints in the short-time Fourier-transform domain. Ever since, speech recognition has been constantly evolving. From isolated word recognition with small...
  • №230
  • 6,83 МБ
  • добавлен
  • изменен
Singapore: Springer, 2019. — 426 p. This book is about recent research in the area of profiling humans from their voice, which seeks to deduce and describe the speaker's entire persona and their surroundings from voice alone. It covers several key aspects of this technology, describing how the human voice is unique in its ability to both capture and influence the human persona --...
  • №231
  • 14,72 МБ
  • добавлен
  • изменен
Springer, 2010. — 177 p. Speech Processing has rapidly emerged as one of the most widespread and wellunderstood application areas in the broader discipline of Digital Signal Processing. Besides the telecommunications applications that have hitherto been the largest users of speech processing algorithms, several nontraditional embedded processor applications are enhancing their...
  • №232
  • 1,84 МБ
  • добавлен
  • изменен
Oxford University Press, 1994. — 314 p. The most sophisticated and efficient means of communication between humans is spoken natural language (NL). It is a rare circumstance when two people choose to communicate via another means when spoken natural language is possible. Ochsman and Chapanis [OC74] conducted a study involving two person teams solving various problems using...
  • №233
  • 5,79 МБ
  • добавлен
  • изменен
Cambridge. Tecnical Report Number 740, 2009. ISSN 1476-2986 The focus of this research is on analysis of a wide range of emotions and mental states from non-verbal expressions in speech. In particular, on inference of complex mental states, beyond the set of basic emotions, including naturally evoked subtle expressions and mixtures of expressions.
  • №234
  • 2,57 МБ
  • добавлен
  • изменен
Springer, 2010. — 209 p. International Conference on Nonlinear Speech Processing, NOLISP 2009. Vic, Spain, June 25-27, 2009. Revised Selected Papers. This volume contains the proceedings of NOLISP 2009, an ISCA Tutorial and Workshop on Non-Linear Speech Processing held at the University of Vic (Catalonia, Spain) during June 25-27, 2009. NOLISP 2009 was preceded by three editions...
  • №235
  • 3,28 МБ
  • добавлен
  • изменен
Springer, 2013. — 415 p. Summarising a research programme that lasted formore than 6 years is a demanding task due to the wealth of deliverables, publications and final results of each of the projects concerned. In addition to the content-related topics, which interest scientists, research programmes also lead to new insights for policy makers and programme managers. The...
  • №236
  • 3,70 МБ
  • добавлен
  • изменен
EURASIP Journal on Audio, Speech, and Music Processing, 2010. — 90 p. One of the most important aspects of spoken language is its large degree of variability. Variability in speech is caused by many different sources, for instance, changes of the acoustic environment or transmission channel and differences between speakers or various speaking styles. Successful speech processing...
  • №237
  • 5,08 МБ
  • добавлен
  • изменен
Springer, 1996. — 682 p. This book is one outcome of the NATO Advanced Studies Institute (ASI) Workshop, "Speechreading by Man and Machine," held at the Chateau de Bonas, Castera-Verduzan (near Auch, France) from August 28 to September 8, 1995 - the first interdisciplinary meeting devoted the subject of speechreading ("lipreading"). The forty-five attendees from twelve countries...
  • №238
  • 12,55 МБ
  • добавлен
  • изменен
Springer, 2010. — 354 p. This book describes the development and evaluation of a novel type of spoken language dialogue system that proactively interacts in the conversation with two users. Spoken language dialogue systems are increasingly deployed in more and more application domains and environments. As a consequence, the demands posed on the systems are rising rapidly. In...
  • №239
  • 1,23 МБ
  • добавлен
  • изменен
Springer, 2007. — 279 p. The last meeting of the Management Committee of the COST Action 277: Nonlinear Speech Processing was held in Heraklion, Crete, Greece, September 20–23, 2005 during the Workshop on Nonlinear Speech Processing (WNSP). This was the last event of COST Action 277. The Action started in 2001. During the workshop, members of the Management Committee and invited...
  • №240
  • 4,05 МБ
  • добавлен
  • изменен
Springer, 2011. — 82 p. Spoken dialog systems have been the object of intensive research interest over the past two decades, and hundreds of scientif c articles as well as a handful of text books such as [25, 52, 74, 79, 80, 83] have seen the light of day. What most of these publications lack, however, is a link to the real world, i.e., to conditions, issues, and environmental...
  • №241
  • 892,52 КБ
  • добавлен
  • изменен
Springer, 2013. — 278 p. Since the release of the first Internet Phone in 1995, Voice over Internet Protocol (VoIP) has grown exponentially, from a lab-based application to today’s established technology, with global penetration, for real-time communications for business and daily life. Many organisations are moving from the traditional PSTN networks to modern VoIP solutions and...
  • №242
  • 6,83 МБ
  • добавлен
  • изменен
CRC Press, 2000. — 798 p. Speech has evolved over a period of tens of thousand of years as the primary means of communication between human beings. Since the evolution of speech and of homo sapiens have proceeded hand-inhand, it seems reasonable to assume that human speech production mechanisms, and the resulting acoustic signal, are optimally adapted to human speech perception...
  • №243
  • 2,97 МБ
  • добавлен
  • изменен
Springer, 2008. — 403 p. The remarkable advances in computing and networking have sparked an enormous interest in deploying Automatic Speech Recognition on Mobile Devices and Over Communication Networks, and the trend is accelerating. This yields an abundance of practical systems, operational algorithms and scientific publications. There is, however, no integrated book available...
  • №244
  • 2,13 МБ
  • добавлен
  • изменен
Cambridge University Press, 2009. — 642 p. Speech processing technology has been a mainstream area of research for more than 50 years. The ultimate goal of speech research is to build systems that mimic (or potentially surpass) human capabilities in understanding, generating and coding speech for a range of human-to-human and human-to-machine interactions. In the area of speech...
  • №245
  • 4,95 МБ
  • добавлен
  • изменен
Cambridge University Press, 2009. — 642 p. Speech processing technology has been a mainstream area of research for more than 50 years. The ultimate goal of speech research is to build systems that mimic (or potentially surpass) human capabilities in understanding, generating and coding speech for a range of human-to-human and human-to-machine interactions. In the area of speech...
  • №246
  • 3,91 МБ
  • добавлен
  • изменен
Springer, 2018. — 82 p. With the invention of less expensive means of internet access, voice communication via social media is on the rise, which often comprises threats and distortions. Incorrect speaker/speech identification may sometimes lead to ambiguities in speaker identification and misunderstandings. Therefore, proper identification of speech is a must in speech...
  • №247
  • 2,77 МБ
  • добавлен
  • изменен
2014. — 88 p. — ASIN B00NV4DZ86. Learn to love Dragon Naturally Speaking with just 100+ Commands Get off to a flying start, improve your skills, speak with confidence - using this new 60 page, illustrated colour guide. Dragon speech recognition can transform the way people work with their computers - students, doctors, writers, family historians, people with dyslexia or...
  • №248
  • 3,08 МБ
  • добавлен
  • изменен
Springer, 2013. — 142 p. Speech is the most natural mode of communication and yet attempts to build systems which support robust habitable conversations between a human and a machine have so far had only limited success. A key reason is that current systems treat speech input as equivalent to a keyboard or mouse, and behaviour is controlled by pre-defined scripts that try to...
  • №249
  • 1,85 МБ
  • добавлен
  • изменен
Springer, 2012. — 301 p. IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012 Proceedings. It was a pleasure and an honor to organize the IberSPEECH 2012: Joint VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop, that took place during November 21–23 in Madrid, Spain, hosted by the ATVS Biometric Research Group, Universidad Autónoma de Madrid. This...
  • №250
  • 5,57 МБ
  • добавлен
  • изменен
Springer, 2011. — 292 p. 5th International Conference on Nonlinear Speech Processing, NOLISP 2011, Las Palmas de Gran Canaria, Spain, November 7-9, 2011 Proceedings. This volume contains the proceedings of NOLISP 2011, an ISCA Tutorial and Workshop on Non-Linear Speech Processing held at the University of Las Palmas de Gran Canaria (Canary Islands, Spain) during November 7–9,...
  • №251
  • 3,96 МБ
  • добавлен
  • изменен
Springer, 2008. — 176 p. Applications of Discrete Wavelet Transform and Wavelet Denoising to Speech Classification, Speech Enhancement and Robust Speech Recognition In this work, we study the application of wavelet analysis for robust speech processing. Reliable time-scale features (TS) which characterize the relevant phonetic classes such as voiced (V), unvoiced (UV), silence...
  • №252
  • 9,65 МБ
  • добавлен
  • изменен
John Wiley, 2011. — 471 p. There are a number of books and textbooks on speech processing or natural language processing (even some covering speech and language processing), there are no books focusing on spoken language understanding (SLU) approaches and applications. In that respect, living between two worlds, SLU has not received the attention it deserves in spoken language...
  • №253
  • 3,29 МБ
  • добавлен
  • изменен
Springer, 2000. — 302 p. This book originates from the Fifth European Summer School on Language and Speech Communication that was held in the summer of 1997 in Leuven, Belgium, under the auspices of the European Language and Speech Network (ELSNET). The central topic of the summer school was "Lexicon Development for Language and Speech Processing"; the choice of this theme was...
  • №254
  • 4,72 МБ
  • добавлен
  • изменен
Springer, 2005. — 371 p. The chapters in this book jointly contribute to what we shall call the field of natural and multimodal interactive systems engineering. This is not yet a well-established field of research and commercial development but, rather, an emerging one in all respects. It brings together, in a process that, arguably, was bound to happen, contributors from many...
  • №255
  • 8,02 МБ
  • добавлен
  • изменен
Springer, 1995. — 589 p. Text-to-speech synthesis involves the computation of a speech signal from input text. Accomplishing this requires a system that consists of an astonishing range of components, from abstract linguistic analysis of discourse structure to speech coding. Several implications flow from this fact. First, text-to-speech synthesis is inherently multidisciplinary,...
  • №256
  • 3,46 МБ
  • добавлен
  • изменен
John Wiley, 2006. — 644 p. The digital processing, storage, and transmission of speech signals have gained great practical importance. The main application areas are digital mobile radio, acoustic human–machine communication, and digital hearing aids. In fact, these applications are the driving force behind many scientific and technological developments in this field. A specific...
  • №257
  • 19,33 МБ
  • добавлен
  • изменен
Springer, 2013. — 146 p. In this book, hierarchical structures based on neural networks are investigated for automatic speech recognition. These structures are mainly evaluated in the task of phoneme recognition under the Hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) paradigm. The baseline hierarchical scheme consists of two levels where each level is based on a...
  • №258
  • 1,79 МБ
  • добавлен
  • изменен
John Wiley, 2013. — 501 p. The term computer speech recognition conjures up visions of the science-fiction capabilities of HAL2000 in 2001, A Space Odessey, or Data, the anthropoid robot in Star Trek, who can communicate through speech with as much ease as a human being. However, our real-life encounters with automatic speech recognition are usually rather less impressive,...
  • №259
  • 7,75 МБ
  • добавлен
  • изменен
Springer, 2018. — 417 p. The recent progress on machine learning and signal processing has enabled the development of technologies for automatic analysis of sound scenes and events by computational means. This has attracted several research groups and companies to investigate this new field, which has potential in several applications and also has several research challenges. This...
  • №260
  • 7,21 МБ
  • добавлен
  • изменен
Eamon Dolan/Houghton Mifflin Harcourt, 2019. — 259 p. — ISBN 10 1328799301, 13 978-1328799302. The next great technological disruption is coming The titans of Silicon Valley are racing to build the last, best computer that the world will ever need. They know that whoever successfully creates it will revolutionize our relationship with technology—and make billions of dollars in the...
  • №261
  • 4,22 МБ
  • добавлен
  • изменен
Wai. C. Chu. Speech Coding Algorithms. Foundation and Evolution of Standardized Coders Mobile Media Laboratory. DoCoMo USA Labs. San Jose, California Wiley &Sons publishing. 578 pages. Speech coding is a highly mature branch of signal processing deployed in products such as cellular phones, communication devices, and more recently, voice over internet protocol This...
  • №262
  • 3,48 МБ
  • дата добавления неизвестна
  • изменен
Morgan Kaufmann, 1990. — 630 p. Despite several decades of research activity, speech recognition still retains its appeal as an exciting and growing field of scientific inquiry. Many advances have been made during these past decades; but every new technique and every solved puzzle opens a host of new questions and points us in new directions. Indeed, speech is such an intimate...
  • №263
  • 17,12 МБ
  • добавлен
  • изменен
Springer, 2013. — 207 p. In the present book, speech transmission quality is modeled on the basis of perceptual dimensions that are relevant for today’s public-switched and packet-based telecommunication systems. The complete transmission path from the mouth of the speaker to the ear of the listener is regarded, and both narrowband (300–3400 Hz) as well as wideband (50–7000 Hz)...
  • №264
  • 3,18 МБ
  • добавлен
  • изменен
Kluwer, 2004. — 124 p. Speech is the most natural fonn of communication among humans. As machines become ever more capable and their use more widespread due to advances in computing. the need to allow natural communication between a human and a machine also gains critical significance. In order to realize such a system, it is essential that the speech communication process is well...
  • №265
  • 3,93 МБ
  • добавлен
  • изменен
Cambridge: Cambridge University Press, 2015. — 424 p. With this comprehensive guide you will learn how to apply Bayesian machine learning techniques systematically to solve various problems in speech and language processing. A range of statistical models is detailed, from hidden Markov models to Gaussian mixture models, n-gram models and latent topic models, along with...
  • №266
  • 12,78 МБ
  • добавлен
  • изменен
Springer, 2017. — 436 p. — ISBN 9783319646794. The text provides insights and detailed descriptions of some of the new concepts and key technologies in the field, including novel architectures for speech enhancement, microphone arrays, robust features, acoustic model adaptation, training data augmentation, and training criteria. The contributed chapters also include descriptions...
  • №267
  • 4,22 МБ
  • добавлен
  • изменен
New York: Springer, 2017. — 436 p. The text provides insights and detailed descriptions of some of the new concepts and key technologies in the field, including novel architectures for speech enhancement, microphone arrays, robust features, acoustic model adaptation, training data augmentation, and training criteria. The contributed chapters also include descriptions of real-world...
  • №268
  • 8,83 МБ
  • добавлен
  • изменен
Springer, 1987. — 168 p. This book has its origins in a programme of work conducted at British Telecom Research Laboratories, aimed at developing easily usable, intelligent systems, based on human-computer interaction via spoken and written language, particularly the former. This involved the authors, as members of the Human Factors Division, in conduct-, ing a series of...
  • №269
  • 3,06 МБ
  • добавлен
  • изменен
Delmar, Cengage Learning, 2009. — 396 p. — ISBN 1435427270. Understanding Voice Over IP Technology provides students with the in-depth knowledge of Voice over IP and the TCP/IP protocol that it is based on. Voice over IP technology, or making telephone calls over data networks such as the Internet, has now reached the tipping point, and is expected to eventually become the...
  • №270
  • 12,70 МБ
  • добавлен
  • изменен
John Wiley, 2009. — 584 p. Серьезная книга по современным речевым технологиям As the authors of Distant Speech Recognition note, automatic speech recognition is the key enabling technology that will permit natural interaction between humans and intelligent machines. Core speech recognition technology has developed over the past decade in domains such as office dictation and...
  • №271
  • 19,73 МБ
  • добавлен
  • изменен
Elsevier, 2015. — 194 p. In the information communication field, speech communication via network becomes an important way to transfer information. With the development of information technology, speech communication is widely used for military, diplomatic, and economic purposes as well as in cultural life and scientific research. Therefore, speech secure communication and the...
  • №272
  • 4,47 МБ
  • добавлен
  • изменен
Springer, 2014. — 215 p. Speech and hearing sciences are fundamental to numerous technological advances of the digital world in the past decade, from music compression in MP3 to digital hearing aids, from network based voice enabled services to speech interaction with mobile phones. Mathematics and computation are intimately related to these leaps and bounds. On the other hand,...
  • №273
  • 15,70 МБ
  • добавлен
  • изменен
Kluwer, 1997. — 247 p. This book originates from the 2nd European Summer School on Language and Speech Communication that was held in the summer of 1994 in Utrecht, The Netherlands. During two weeks, 90 participants enjoyed 14 courses that were focussed on the theme "Corpus-Based Methods in Language and Speech Processing". The enthusiasm of the participants for the topic and the...
  • №274
  • 3,58 МБ
  • добавлен
  • изменен
Springer, 2014. — 321 p. — ISBN-10: 1447157788, ISBN-13: 978-1-4471-5778-6. This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In...
  • №275
  • 7,56 МБ
  • добавлен
  • изменен
Springer, 2013. — 383 p. 15th International Conference, SPECOM 2013, Pilsen, Czech Republic, September 2013, Proceedings. The Speech and Computer International Conference (SPECOM) is a regular event organized annually or bi-annually since the first SPECOM in 1996 that was held in St. Petersburg, Russian Federation. It is a conference with long tradition that attracts researchers...
  • №276
  • 3,68 МБ
  • добавлен
  • изменен
Москва: Радио и связь, 2004. — 164 с. В книге рассматриваются методы обработки цифровой речи, предназначенные для формирования последовательности векторов признаков и два типа задач классификации речевого сигнала: распознавание слитной речи, идентификация диктора по его голосу. В задаче формирования векторов признаков основное внимание уделяется методам обнаружения и фильтрации...
  • №277
  • 2,45 МБ
  • добавлен
  • изменен
Москва: Изд-во "Радио и связь", 2004. 164 с. Аннотация. В книге рассматриваются методы обработки цифровой речи, предназначенные для формирования последовательности векторов признаков и два типа задач классификации речевого сигнала: распознавание слитной речи, идентификация диктора по его голосу. В задаче формирования векторов признаков основное внимание уделяется методам...
  • №278
  • 2,15 МБ
  • дата добавления неизвестна
  • изменен
М.: Воениздат, 1974. — 136 с.: ил. В брошюре излагается одна из наиболее сложных проблем нашего времени — автоматическое распознавание речевых сигналов и машинное (искусственное) воспроизводство связной речи. Брошюра охватывает все основные аспекты этой проблемы, в ней сформулированы предпосылки, обусловившие необходимость создания техники для прямого речевого общения человека...
  • №279
  • 11,37 МБ
  • добавлен
  • изменен
Киев: Наук. думка, 1987. – 264 с. В монографии рассматриваются вопросы автоматического анализа, распознавания, смысловой интерпретации, синтеза и компрессированной передачи речевых сигналов применительно к устному диалогу человека и ЭВМ на формализованных и естественных языках предметных областей для использования в человеко-машинных системах сбора, обработки информации и...
  • №280
  • 3,57 МБ
  • дата добавления неизвестна
  • изменен
Деркач М.Ф., Гумецкий Р.Я., Гура Б.М., Чабан М.Е. Львов: Вища школа, 1983. — 168 с. В монографии рассматриваются динамические спектрограммы звуков, слогов, слов и слитных фраз русской речи. Основное внимание уделено отображению на спектрограммах работы артикуляционных органов в процессе произношения речевых сигналов. Особое значение придается изучению динамики артикуляционного...
  • №281
  • 4,08 МБ
  • добавлен
  • изменен
К.: Полиграф Консалтинг, 2005. — 138 с. В книге представлено спектрально-временное описание речевых сигналов как функций многих переменных. Приведено решение задач нахождения параметров частотной функции речеобразующей системы в одно- и двухмерном случаях по спектральной функции речевого сигнала. Приведены также некоторые алгоритмы и классификация задач базы знаний распознавания...
  • №282
  • 10,57 МБ
  • добавлен
  • изменен
М.: Мир, 1985. — 237 с. — (В мире науки и техники). Книга рассказывает о теоретических исследованиях и практических разработках в технике синтеза речи. Автор приводит также конкретные схемы электронных блоков, используемых в реальных синтезаторах речи. Основы компьютерного синтеза речи. Как мы говорим Немного о лингвистике. Этика поведения компьютера - синтезатора речи. Немного...
  • №283
  • 4,32 МБ
  • дата добавления неизвестна
  • изменен
М.: Мир, 1985. — 237 с. — (В мире науки и техники). Книга рассказывает о теоретических исследованиях и практических разработках в технике синтеза речи. Автор приводит также конкретные схемы электронных блоков, используемых в реальных синтезаторах речи. Книга адресована широкому кругу читателей, интересующихся достижениями современной техники; особенно полезна она будет...
  • №284
  • 30,48 МБ
  • добавлен
  • изменен
Санкт-Петербургский институт информатики и автоматизации Российской Академии Наук, 2013, -316 с. В монографии очерчен круг проблем, связанных с особенностями автоматического анализа разговорной русской речи в интерактивных диалоговых системах. Описаны методы дистанционной записи речи, учета вариативности произношения в разговорной речи, компактного представления словаря, а...
  • №285
  • 6,32 МБ
  • добавлен
  • изменен
СПб.: ГУАП, 2013. — 314 с. В монографии очерчен круг проблем, связанных с особенностями автоматического анализа разговорной русской речи в интерактивных диалоговых системах. Описаны методы дистанционной записи речи, учета вариативности произношения, компактного представления словаря, а также синтаксическо-статистического моделирования языка в системах автоматического распознавания...
  • №286
  • 42,80 МБ
  • добавлен
  • изменен
Руководство к лабораторно-практическим занятиям по дисциплине "Безопасность жизнедеятельности. Часть 2 Информационная безопасность". Изд-во ТТИ ЮФУ. Таганрог, 2011. 48 с. Предназначено для студентов радиотехнических специальностей вуза с целью изучения разновидностей, характеристик, принципов построения и алгоритмических моделей аналоговых временных и частотных скремблеров...
  • №287
  • 2,21 МБ
  • добавлен
  • изменен
У. А. Ли, Э. П. Нейбург, Т. Б. Мартин, Дж. Р. Уэлч, В. У. Зу, Р. М. Шварц, Дж. Е. Шуп, А. Р. Смит, М. Р. Самбур, Ф. Хейс-Роз, Г. Гудмэн, Р. Редди. Методы автоматического распознавания речи: В 2-х книгах. Пер. с англ. /Под ред. У. Ли. – М.: Мир, 1983. – Кн. 1. 328 с., ил. Монография написана ведущими специалистами США, Франции, Италии, Японии и Польской Народной Республики в...
  • №288
  • 8,69 МБ
  • дата добавления неизвестна
  • изменен
Дж. А. Барнет, М. И. Бернстейн и др. Методы автоматического распознавания речи: В 2-х книгах. Пер. с англ. /Под ред. У. Ли. – М.: Мир, 1983. – Кн. 2. 392 с., ил. Монография написана ведущими специалистами США, Франции, Италии, Японии и Польской Народной Республики в области распознавания речи. В русском переводе выпускается в двух книгах. Книга 2 посвящена конкретным системам...
  • №289
  • 9,92 МБ
  • дата добавления неизвестна
  • изменен
Пер. с англ. — Под ред. Ю. Н. Прохорова и В. С. Звездина. — М.: Связь, 1980. – 308 с.: ил. В книге излагается в полном объеме комплекс вопросов, связанных с обработкой речевых сигналов с помощью методов линейного предсказания. Представлены алгоритмы анализа речи и процедуры ее синтеза по множеству информативных параметров, доведенные до программ на языке ФОРТРАН. Рассмотрены...
  • №290
  • 2,74 МБ
  • дата добавления неизвестна
  • изменен
Монографія. — Херсон: вид-во ФОП Вишемирський В.С., 2018. — 168 с. Проаналізовано існуючі на сьогоднішній день методи аналізу голосового сигналу людини. Досліджено сучасні методи аутентифікації особистості, які основані на аналізі голосового сигналу. Розроблено метод локальних максимумів, який дає точніші результати сегментації голосового сигналу у порівнянні з існуючими методами....
  • №291
  • 27,19 МБ
  • добавлен
  • изменен
Под ред. Сапожкова М. А. — М.: Радио и связь, 1987. — 168 с. Во многих научных центрах в СССР и за рубежом ведутся интенсивные исследования в области передачи сигналов речи по узкополосным каналам связи, автоматического распознавания речевых команд в системах обработки и передачи данных, обучению людей с дефектами слуха и речи, иноязычных и др. Данным исследованиям посвящены...
  • №292
  • 5,72 МБ
  • дата добавления неизвестна
  • изменен
Кишинёв: Штиинца, 1987. — 175 с. Рассматриваются общие вопросы построения систем автоматического распознавания и синтеза речи. Содержатся сведения о речеобразовании и речевых сигналах, цифровой обработке речи, даётся краткое описание современных отечественных и зарубежных систем распознавания и синтеза речи. Книга рассчитана на массового читателя, студентов технических вузов,...
  • №293
  • 9,05 МБ
  • добавлен
  • изменен
Учебно-методическое пособие для студентов специальности «Электронные вычислительные средства» дневной формы обучения. — Минск: БГУИР, 2005. — 51 с. Учебно-методическое пособие содержит описание алгоритмов, применяемых для обработки речи: детектора речи, анализа на основе линейного предсказания, векторного квантования. Даны примеры применения векторного квантования для...
  • №294
  • 1,85 МБ
  • добавлен
  • изменен
Перев. Попова Р., Кемерово, 2000. - 79 с. Дата выхода оригинальной работы - 1993 г. В этой работе мы рассмотрим компоненты алгоритмов обработки сигнала. Эти алгоритмы приводятся как часть общего обзора задачи параметризации сигнала, которазя делится на три направления: измерение, преобразование и статистическое моделирование. В соответствии с этой целью в работу включено...
  • №295
  • 824,70 КБ
  • добавлен
  • изменен
М.: Государственное издательство литературы по вопросам связи и радио, 1962. — 391 с. Приводятся экспериментальные способы определения разборчивости методом артикуляционных измерений и числовые методы расчета разборчивости передаваемой речи. Очень актуальная по сей день книга, оказывается.
  • №296
  • 5,31 МБ
  • дата добавления неизвестна
  • изменен
М.: Радио и связь, 1989. — 248 с., ил. Монография посвящена описанию современного состояния развития техники, использующей возможности речевой связи между человеком и машиной (роботом).
  • №297
  • 2,98 МБ
  • дата добавления неизвестна
  • изменен
М.: Радио и связь, 1989. — 248 с. Монография посвящена описанию современного состояния развития техники, Использующей возможности речевой связи между человеком и машиной (роботом). Эта область научных исследований и технических разработок прогрессивно развивается в наиболее развитых в техническом отношении странах, что связано в первую очередь с освоением вычислительной техники и...
  • №298
  • 5,38 МБ
  • добавлен
  • изменен
Изд. 3-е. — М.: КомКнига, 2012. — 328 с. Книга посвящена проблемам управления техническими устройствами с помощью устной речи, что имеет непосредственное отношение к развитию робототехнических систем, управляемых голосом. В работе отражены различные аспекты лингвистического компонента в подобных системах. Подчеркивается особое значение исследований в области фундаментального и...
  • №299
  • 49,11 МБ
  • добавлен
  • изменен
М.: Радио и связь, 1981. — 496 с., ил. Рассматриваются вопросы цифровой обработки речевых сигналов в системах передачи информации и управления ЭВМ голосом. Излагаются проблемы цифрового представления речевых сигналов: временная дискретизация, интерполяция, квантование, проектирование цифровых фильтров. Обсуждаются способы построения цифровых систем передачи, систем...
  • №300
  • 38,54 МБ
  • дата добавления неизвестна
  • изменен
Пер. с англ. Под ред. М. В. Назарова и Ю. Н. Прохорова. — М.: Радио и связь, 1981. — 496 с.: ил. Рассматриваются вопросы цифровой обработки речевых сигналов в системах передачи информации и управления ЭВМ голосом. Излагаются проблемы цифрового представления речевых сигналов: временная дискретизация, интерполяция, квантование, проектирование цифровых фильтров. Обсуждаются...
  • №301
  • 8,13 МБ
  • добавлен
  • изменен
Тбилиси: Мецниереба, 1976. — 183 с. Монография посвящена проблеме автоматической идентификации голосов. В ней затронут круг вопросов, связанных с исследованием индивидуальных особенностей голоса, проявляющейся в процессе реальной речевой активности человека. Подробно обсуждается роль как отдельных фонем и их сочетаний, так и более сложных семантических единиц речи в передаче...
  • №302
  • 5,11 МБ
  • добавлен
  • изменен
М.: Государственное издательство литературы по вопросам связи и радио, 1963. — 452 с. Книга посвящена преобразованиям речи применительно к задачам техники связи и кибернетики. Книга рассчитана на специалистов в области техники связи, автоматики, кибернетики, инженеров, аспирантов и научных сотрудников, изучающих вопросы преобразования речи.
  • №303
  • 5,42 МБ
  • добавлен
  • изменен
М.: Наука, 1992. — 392 с. — ISBN 5-02-014665-Х. Синтез речи с использованием ЭВМ является составной частью современной информационной технологии. Методы синтеза речи находят широкое применение в информационно-справочных системах, в системах обучения с помошыо ЭВМ и т. д. Читатель, обратившись к этой книге, сможет познакомиться с различными методами моделирования процессов...
  • №304
  • 11,85 МБ
  • дата добавления неизвестна
  • изменен
Учебное пособие. — СПб.: Университет ИТМО, 2016. — 138 с. В учебном пособии рассматриваются методы автоматического распознавания речи. Материал пособия разбит на 16 разделов. Первые два раздела посвящены вопросам речеобразования и восприятия слуховой системой. В каждом разделе приведены краткие теоретические и/или практические сведения. Пособие может быть использовано при...
  • №305
  • 3,73 МБ
  • добавлен
  • изменен
Учебное пособие. — Санкт-Петербург: Университет ИТМО, 2017. — 152 с. В учебном пособии рассматриваются методы автоматического распознавания речи. Материал пособия разбит на 16 разделов. Первые два раздела посвящены вопросам речеобразования и восприятия слуховой системой. В каждом разделе приведены краткие теоретические и/или практические сведения. Пособие может быть использовано...
  • №306
  • 3,79 МБ
  • добавлен
  • изменен
М.: Связь, 1968. — 395 с. В монографии Дж. Фланагана, известного американского ученого, подробно рассматриваются широкий круг вопросов, связанных со свойствами речи как переносчика информации, основные ее параметры, проблемы анализа, синтеза и автоматического распознавания. Оцениваются характеристики каналов речевой связи. Большое внимание уделяется рассмотрению проблем...
  • №307
  • 9,21 МБ
  • добавлен
  • изменен
Пер. с англ. А. А. Пирогова. — М.: Связь, 1968. — 397 с. В монографии Дж. Фланагана, известного американского ученого, подробно рассматриваются широкий круг вопросов, связанных со свойствами речи как переносчика информации, основные ее параметры, проблемы анализа, синтеза и автоматического распознавания. Оцениваются характеристики каналов речевой связи. Большое внимание...
  • №308
  • 4,66 МБ
  • дата добавления неизвестна
  • изменен
М.: Радио и связь, 2000. — 456 с. Рассматриваются проблемы цифровой обработки и передачи речи в системах со сжатием, статистическим уплотнением, пакетной коммутацией, IР-телефонии, сетях АТМ и Frame Relay. Анализируются принципы построения, характеристики и особенности функционирования кодеров формы, вокодеров, гибридных кодеров, реализующих алгоритмы CELP, LD-CELP, ACELP, МВЕ,...
  • №309
  • 10,96 МБ
  • добавлен
  • изменен
В этом разделе нет файлов.

Комментарии

В этом разделе нет комментариев.