Auotmatic speech recogition

$80-240 HKD

Closed

Posted

9 months ago

$80-240 HKD

Paid on delivery

Implement real-time phone speech recognition in project with Asterisk PBX and Kaldi/Vosk. I use Asterisk-specific module ([login to view URL]) to carry out ASR operations without compatibility issues. So far if anybody speaks anything while calling, it gives very clear text output. The problem I'm struggling is how to enable streaming ASR immediately during the conversation, i.e. since Dial() application of Asterisk dialplan gets executed. That's a subject of this job - create script (most likely, with some Asterisk REST Interface components) which works as follows: 1) since Dial() application starts running, real-time audio stream gets processed via ASR engine that is waiting for inputs inside of docker container (because I deploy Kaldi as a software built in Vosk server which is compatible with Asterisk, here is the out-of-box program implementation released on Github: [login to view URL]) 2) once conversation begins and voice streaming is detected, audial data flow heads the ASR powered by Vosk server (within the docker container); 3) while the data flow continues because of the ongoing conversation between people, the ASR generates transcribed outputs (files) that must be forwarded to an HTTP server to evaluate the contents of them (don't worry about this part, it's beyond this specific work , Certainly); And it should also be possible to hang up the call according to the transcript. 4) since conversation gets wrapped up, last phrases get processed via ASR to pass final outputs to the HTTP server mentioned above; 5) whenever inbound call occurs, same steps to be carried out: audial data capture - speech recognition within the docker container - text file through to the HTTP server. That all to be compliant with real time requirements, so data flow needs fast and seamless throughput before and after ASR processing, as a matter of course. While searching for any helpful content on the Internet, I encountered this Stack Overflow question [login to view URL] It makes clear the same purpose, just in other words than in my description. However, I demand implementation of the system design with Kaldi/Vosk rather then Google Speech. As for language to be used for development, I would leave some options. So, Python/Java/JS are acceptable to do that. if you are interested please hurry up to help me in this. you should have full capability to do it.

Software Architecture

Project ID: 37082560

About the project

4 proposals

Remote project

Active 8 mos ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

4 freelancers are bidding on average $821 HKD for this job

@egoha2023

Hi, Alaa R. I already checked your requirements - Auotmatic speech recogition. As a Senior AI Expert with over 5 years of experience, I have a proven track record of developing and implementing cutting-edge AI solutions for a wide range of industries. My expertise includes machine learning, natural language processing, computer vision, and deep learning, image processing, among other areas. I am highly skilled in programming languages such as Python, R, and Java, and have experience working with popular AI frameworks such as TensorFlow, Keras, and PyTorch. If you are interested in me, please click the chat button for me. Then our cooperation and results seem to have already been successful. Thanks. Justin.

$391 HKD in 3 days

0.0

(0 reviews)

0.0

@solomiiahuliak

⭐⭐⭐Hello Alaa R. Good morning!⭐⭐⭐ I am excited to submit my proposal for the "Auotmatic speech recogition" position. I have developed a strong set of skills that make me confident in my ability to deliver high-quality work to your project. My approach to any project is to first gain a deep understanding of the client's needs and requirements. I will work closely with you to ensure that I understand your project goals and objectives, and that I am able to deliver results that meet or exceed your expectations. In terms of technical skills, I have extensive experience in Software Architecture, Linux, VoIP, Asterisk PBX and Python. Please send a message to discuss more about this project. Appreciate your prompt response. ❤️Solomiia❤️

$391 HKD in 4 days

0.0

(0 reviews)

0.0

@ProXiMaSky

Hello greetings As far as I understand you want to implement real time phone audio to text to your ASR project Let's have a chat and discuss details I am year+ experienced Developer who can help you with your project as i have worked plenty of times with asterisk apis etc. Kindly come to chat to work with me Thanks Best Regards ProXiMa

$2,000 HKD in 7 days