Clear Audio is an experimental noise VOIP shaping using machine learning and natural language processing. Even today with the economy of technical resources and growing demand of cell tower its harder to do high bit rate voice conversations without a mechanism for compression. A gigabyte is not a gigabyte. If one were to take a large file say like a virtual machine of Linux and save it as a VMDK it can take about a gigabyte. If one were to compress the VMDK and convert it to a tar.gz it would become even smaller. What happened? There is a process called lossy file compression. This is about finding patterns in sound and eliminating redundancy. In the domain of voice we have Posterior probability where we can understand a conditional probability.
The book I am a Strange Loop talks about how people do the same thing over and over so they are predictable. The book Uncharted: Big Data and an Emergence of Human History talks about big data of the what people entered into the Google search engine. Using the Google n-gram application people can associate words with a certain pattern. We can project with other conversations what sounds might be uttered on a certainty probability scale.
What this project entails is a voice to text translator and a sound anticipatory.
Books I've been reading for this project:
Uncharted: Big Data and an Emergence of Human History
Surfaces and Essences by Hofstadter
I am a Strange Loop
Multirate Signal Processing for Communication Systems
Software Engineering for Embedded Systems by Oshana and Kraeling
No comments:
Post a Comment