Find Jobs
Hire Freelancers

Auotmatic speech recogition

$80-240 HKD

Closed
Posted 9 months ago

$80-240 HKD

Paid on delivery
Implement real-time phone speech recognition in project with Asterisk PBX and Kaldi/Vosk. I use Asterisk-specific module ([login to view URL]) to carry out ASR operations without compatibility issues. So far if anybody speaks anything while calling, it gives very clear text output. The problem I'm struggling is how to enable streaming ASR immediately during the conversation, i.e. since Dial() application of Asterisk dialplan gets executed. That's a subject of this job - create script (most likely, with some Asterisk REST Interface components) which works as follows: 1) since Dial() application starts running, real-time audio stream gets processed via ASR engine that is waiting for inputs inside of docker container (because I deploy Kaldi as a software built in Vosk server which is compatible with Asterisk, here is the out-of-box program implementation released on Github: [login to view URL]) 2) once conversation begins and voice streaming is detected, audial data flow heads the ASR powered by Vosk server (within the docker container); 3) while the data flow continues because of the ongoing conversation between people, the ASR generates transcribed outputs (files) that must be forwarded to an HTTP server to evaluate the contents of them (don't worry about this part, it's beyond this specific work , Certainly); And it should also be possible to hang up the call according to the transcript. 4) since conversation gets wrapped up, last phrases get processed via ASR to pass final outputs to the HTTP server mentioned above; 5) whenever inbound call occurs, same steps to be carried out: audial data capture - speech recognition within the docker container - text file through to the HTTP server. That all to be compliant with real time requirements, so data flow needs fast and seamless throughput before and after ASR processing, as a matter of course. While searching for any helpful content on the Internet, I encountered this Stack Overflow question [login to view URL] It makes clear the same purpose, just in other words than in my description. However, I demand implementation of the system design with Kaldi/Vosk rather then Google Speech. As for language to be used for development, I would leave some options. So, Python/Java/JS are acceptable to do that. if you are interested please hurry up to help me in this. you should have full capability to do it.
Project ID: 37082560

About the project

4 proposals
Remote project
Active 8 mos ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
4 freelancers are bidding on average $821 HKD for this job
User Avatar
Hi, Alaa R. I already checked your requirements - Auotmatic speech recogition. As a Senior AI Expert with over 5 years of experience, I have a proven track record of developing and implementing cutting-edge AI solutions for a wide range of industries. My expertise includes machine learning, natural language processing, computer vision, and deep learning, image processing, among other areas. I am highly skilled in programming languages such as Python, R, and Java, and have experience working with popular AI frameworks such as TensorFlow, Keras, and PyTorch. If you are interested in me, please click the chat button for me. Then our cooperation and results seem to have already been successful. Thanks. Justin.
$391 HKD in 3 days
0.0 (0 reviews)
0.0
0.0
User Avatar
⭐⭐⭐Hello Alaa R. Good morning!⭐⭐⭐ I am excited to submit my proposal for the "Auotmatic speech recogition" position. I have developed a strong set of skills that make me confident in my ability to deliver high-quality work to your project. My approach to any project is to first gain a deep understanding of the client's needs and requirements. I will work closely with you to ensure that I understand your project goals and objectives, and that I am able to deliver results that meet or exceed your expectations. In terms of technical skills, I have extensive experience in Software Architecture, Linux, VoIP, Asterisk PBX and Python. Please send a message to discuss more about this project. Appreciate your prompt response. ❤️Solomiia❤️
$391 HKD in 4 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello greetings As far as I understand you want to implement real time phone audio to text to your ASR project Let's have a chat and discuss details I am year+ experienced Developer who can help you with your project as i have worked plenty of times with asterisk apis etc. Kindly come to chat to work with me Thanks Best Regards ProXiMa
$2,000 HKD in 7 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of EGYPT
Beni Suef, Egypt
0.0
0
Payment method verified
Member since May 18, 2022

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.