Sahar issues

Results 15 issues of


                                            Sahar

Optimize inference using JAX

Follow [these steps](https://colab.research.google.com/github/borisdayma/dalle-mini/blob/main/dev/inference/inference_pipeline.ipynb#scrollTo=lYyOyZGH9dpb) and modify the backend accordingly.

enhancement

Generate and show rankings with CLIP

Add the backend the ability to generate rankings using CLIP, then send these back along with the generated images so the user can see the score each image got. This...

enhancement

Add support for a light-mode command line

* Should look roughly as in the attached screenshot * Should be defined by a boolean parameter isDarkMode ![image](https://user-images.githubusercontent.com/6180201/124025196-75ffaa00-d9f0-11eb-8976-f7ae9459e816.png)

feature request

Add support for AWS transcribe

see https://docs.aws.amazon.com/transcribe/latest/dg/streaming.html https://aws.amazon.com/blogs/developer/transcribe-streaming-sdk-for-python-preview/

feature request

Add support for stop words being sent from client

Can be implemented either on the client-side or backend. If in the backend, consider using this code in the `listen_print_loop` method. ``` if re.search(r"\b(exit|quit)\b", transcript, re.I): print("Exiting..") break ```

feature request

Add support for client-side transcription using DeepSpeech

Consider using react-speech-recognition as well

feature request

Scaffold prompts - combining consecutive prompts for a smoother experience

enhancement

Allow users to select a seed image for Stable Diffusion generation

enhancement

Dockerized the fullstack app

+ ensure it works on Windows and Mac

Use pyannote-audio for speaker diarization

Logic will be to combine Whisper + pyannote.audio based on timestamps to output something along the lines of: ``` Person A: Hi Person B: Hello, how are you Person A:...

enhancement

help wanted