Question

7

Real time speech recognition using WebRTC, Node.js and speech recognition engine

rated 0 times [ 13] [ 6] / answers: 1 / hits: 17233 / 10 Years ago, sun, june 1, 2014, 12:00:00

A. What I am trying to implement.

A web application allowing real-time speech recognition inside web browser (like this).

B. Technologies I am currently thinking of using to achieve A.

JavaScript

Node.js

WebRTC

Microsoft Speech API or Pocketsphinx.js or something else (cannot use Web Speech API)

C. Very basic workflow

Web browser establishes connection to Node server (server acts as a signaling server and also serves static files)

Web browser acquires audio stream using getUserMedia() and sends user's voice to Node server

Node server passes audio stream being received to speech recognition engine for analysis

Speech recognition engine returns result to Node server

Node server sends text result back to initiating web browser

(Node server performs step 1 to 5 to process requests from other browsers)

D. Questions

Would Node.js be suitable to achieve C?

How could I pass received audio streams from my Node server to a speech recognition engine running separately from the server?

Could my speech recognition engine be running as another Node application (if I use Pocketsphinx)? So my Node server communicates to my Node speech recognition server.

Answers

Only authorized users can answer the question. Please sign in first, or register a free account.

jeniferjaliyahf

Add To Favorites

Follow

Total Points: 650

Total Questions: 104

Total Answers: 86

Location: Grenada

Member since Sun, Dec 20, 2020

3 Years ago

answered 10 Years ago dustin · Accepted Answer

Would Node.js be suitable to achieve C?

Yes, though there are no hard requirements for that. Some people are running servers with gstreamer, for example check

http://kaljurand.github.io/dictate.js/

node should be fine too.

How could I pass received audio streams from my Node server to a speech recognition engine running separately from the server?

There are many ways for node-to-node communication. One of them is http://socket.io. There are also plain sockets. The particular framework depends on your requirements for fault-tolerance and scalability.

Could my speech recognition engine be running as another Node application (if I use Pocketsphinx)? So my Node server communicates to my Node speech recognition server.

Yes, sure. You can create a node module to warp pocketsphinx API.

UPDATE: check this, it should be similar to what you need:

http://github.com/cmusphinx/node-pocketsphinx