Skip to content

RestartDK/signscribe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SignScribe

SignScribe Logo

1. What is this?

SignScribe is an AI solution that provides real time sign language translation. It allows deaf people (5% of global population) to follow live events and content online by generating sign language images in real time.

3. Gemini models and Pipecat

Firstly we used Pipecat to orchestrate the entire pipeline, which is structured as follows:

  1. Pipecat listens to the audio of the person speaking and, using the Cloud Speech-to-Text API on Google, transcribes the audio to text.
  2. The transcribed text is then passed to the Gemini to translate the text into ASL grammar.
  3. The ASL grammar is then passed to the Gemini Image Generation API to generate the sign language images.
  4. The sign language images are then passed to the Pipecat pipeline to be displayed to the user.

4. Tools used

  1. Pipecat
  2. Gemini
    • Cloud Speech-to-Text API
    • Generative Language API

5. What we did new during the hackathon

We started on this project from scratch. We got the idea on our way to the hackathon this morning so all code was written today. Both technologies are new to us so we had to learn them on the fly!

6. Feedback

First time using GCP it took a bit to get everything set up.

About

Real time voice to sign language webapp to help deaf people communicate with people on a day to day basis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors