Not sure why it's not working
Thanks for this app, it's been a long time I'm looking for it! I still don't have it working and I'm looking for advice.
What I've done so far:
- I installed it through Flatpack.
- I'm on AMD with a basic GPU
*-display
description: VGA compatible controller
product: Lucienne [1002:164C]
vendor: Advanced Micro Devices, Inc. [AMD/ATI] [1002]
physical id: 0
bus info: pci@0000:05:00.0
version: c2
width: 64 bits
clock: 33MHz
capabilities: pm pciexpress msi msix vga_controller bus_master cap_list
configuration: driver=amdgpu latency=0
resources: iomemory:fc0-fbf iomemory:fc0-fbf irq:47 memory:fce0000000-fcefffffff memory:fcf0000000-fcf01fffff ioport:1000(size=256) memory:d0400000-d047ffff
I did installed the Speech Note AMD add-on.
Question: Is it worth it or should I uninstall it?
- installed
English (FasterWhisper Large-v3 Turbo) / enmodel
But now when I press listen, I get a brief animation of the sound wave icon in the bottom left and then nothing, no sound is picked up...
https://github.com/user-attachments/assets/d901c67c-dd37-4162-9e76-d96e26527773
Thanks for your help 🙏
Hey.Thanks for the report.
Could you please start the app with --verbose command-line option and paste here the output?
flatpak run net.mkiol.SpeechNote --verbose
I can see that UI is full of color-related bugs. Did you enable "Custom graphical style" in the settings?
Thanks for looking into this @mkiol 🙂
I can see that UI is full of color-related bugs. Did you enable "Custom graphical style" in the settings?
Yes I did, I thought that was the way to enable dark mode. Once I disabled it, the app started working correctly!
- I still see the warning asking if I want to enable hardware acceleration...?
I installed it, as mentioned in my first post 👆
- Do you think I should keep it installed even if my GPU is basic?
Here's the output after I disabled custom graphical style
flatpak run net.mkiol.SpeechNote --verbose
QSocketNotifier: Can only be used with threads started with QThread
qt.qpa.qgnomeplatform: Could not find color scheme ""
[I] 03:02:19.882953039.882 0x7fcf3df6fd00 init:49 - logging to stderr enabled
[D] 03:02:19.883065203.883 0x7fcf3df6fd00 () - app version: 4.7.1
[D] 03:02:19.883071489.883 0x7fcf3df6fd00 () - required addon version: 1.3
[D] 03:02:19.883592640.883 0x7fcf3df6fd00 parse_cpuinfo:121 - cpu flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip rdpid overflow_recov succor smca
[D] 03:02:19.883895959.883 0x7fcf3df6fd00 parse_cpuinfo:129 - cpuinfo: processor-count=12, flags=[avx, avx2, fma, f16c, sse4.1, bmi2, ]
[D] 03:02:19.883979418.883 0x7fcf3df6fd00 () - translation: "en_US"
[D] 03:02:19.883994294.883 0x7fcf3df6fd00 () - starting standalone app
[D] 03:02:19.884949087.884 0x7fcf3df6fd00 () - dbus app service registration successul
[D] 03:02:19.885191993.885 0x7fcf3df6fd00 () - app: net.mkiol dsnote
[D] 03:02:19.885206241.885 0x7fcf3df6fd00 () - config location: "/home/bangbuild/.var/app/net.mkiol.SpeechNote/config"
[D] 03:02:19.885215879.885 0x7fcf3df6fd00 () - data location: "/home/bangbuild/.var/app/net.mkiol.SpeechNote/data/net.mkiol/dsnote"
[D] 03:02:19.885223491.885 0x7fcf3df6fd00 () - cache location: "/home/bangbuild/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote"
[D] 03:02:19.885230126.885 0x7fcf3df6fd00 () - settings file: "/home/bangbuild/.var/app/net.mkiol.SpeechNote/config/net.mkiol/dsnote/settings.conf"
[D] 03:02:19.885237739.885 0x7fcf3df6fd00 () - platform: "wayland"
[D] 03:02:19.886091402.886 0x7fcf3df6fd00 () - flatpak addon detected: amd "1.3.0"
[D] 03:02:19.886111097.886 0x7fcf3df6fd00 () - addon-flags 2
[D] 03:02:19.886175281.886 0x7fcf3df6fd00 () - amd gpu detected
[D] 03:02:19.886180030.886 0x7fcf3df6fd00 () - system-flags: 4
[D] 03:02:19.886194976.886 0x7fcf3df6fd00 () - enforcing num threads: 0
[D] 03:02:20.147651133.147 0x7fcf3df6fd00 () - starting service: app-standalone
[D] 03:02:20.152348899.152 0x7fcf3df6fd00 () - mbrola dir: "/app/bin"
[D] 03:02:20.152387032.152 0x7fcf3df6fd00 () - espeak dir: "/app/bin"
[D] 03:02:20.152572879.152 0x7fcf29ac9680 loop:88 - py executor loop started
[D] 03:02:20.152585729.152 0x7fcf29ac9680 set_env:84 - set env: PYTHONIOENCODING = utf-8
[D] 03:02:20.152593272.152 0x7fcf29ac9680 set_env:84 - set env: HF_HUB_DISABLE_TELEMETRY = 1
[D] 03:02:20.152598091.152 0x7fcf29ac9680 set_env:84 - set env: HF_HUB_OFFLINE = 1
[D] 03:02:20.152603399.152 0x7fcf29ac9680 set_env:84 - set env: HF_HUB_LOCAL_DIR_AUTO_SYMLINK_THRESHOLD = 100000000000
[D] 03:02:20.152643907.152 0x7fcf29ac9680 set_env:84 - set env: HF_HUB_CACHE = /home/bangbuild/.var/app/net.mkiol.SpeechNote/cache/net.mkiol/dsnote
[D] 03:02:20.155649921.155 0x7fcf3df6fd00 () - module already unpacked: "rhvoicedata"
[D] 03:02:20.155757476.155 0x7fcf3df6fd00 () - module already unpacked: "rhvoiceconfig"
[D] 03:02:20.159113742.159 0x7fcf3df6fd00 () - module already unpacked: "espeakdata"
[D] 03:02:20.159428025.159 0x7fcf3df6fd00 () - default stt model not found: "en_fasterwhisper_large3_turbo"
[D] 03:02:20.159439269.159 0x7fcf3df6fd00 () - default tts model not found: "en"
[D] 03:02:20.159445415.159 0x7fcf3df6fd00 () - default mnt lang not found: "en"
[D] 03:02:20.159450723.159 0x7fcf3df6fd00 () - new default mnt lang: "en"
[D] 03:02:20.159460361.159 0x7fcf3df6fd00 () - service refresh status, new state: busy
[D] 03:02:20.159466018.159 0x7fcf3df6fd00 () - service state changed: unknown => busy
[D] 03:02:20.159473072.159 0x7fcf3df6fd00 () - delaying features availability
[D] 03:02:20.161361775.161 0x7fcf3df6fd00 () - runtime prefix: "/app"
[D] 03:02:20.161637576.161 0x7fcf3df6fd00 () - available styles: ("Default", "Fusion", "Imagine", "Material", "org.kde.breeze", "org.kde.desktop", "Plasma", "Universal")
[D] 03:02:20.161742966.161 0x7fcf3df6fd00 () - style paths: ("/usr/lib/qml/QtQuick/Controls.2")
[D] 03:02:20.161763988.161 0x7fcf3df6fd00 () - import paths: ("/usr/lib/qml", "/app/bin", "qrc:/qt-project.org/imports")
[D] 03:02:20.161771531.161 0x7fcf3df6fd00 () - library paths: ("/usr/share/runtime/lib/plugins", "/usr/lib/plugins", "/app/bin")
[D] 03:02:20.161783823.161 0x7fcf3df6fd00 () - using auto qt style
[D] 03:02:20.161790527.161 0x7fcf3df6fd00 () - XDG_CURRENT_DESKTOP: GNOME
[D] 03:02:20.161797092.161 0x7fcf3df6fd00 () - switching to style: "org.kde.breeze"
[D] 03:02:20.162022957.162 0x7fcf3df6fd00 () - desktop file: "net.mkiol.SpeechNote"
[D] 03:02:20.163291195.163 0x7fcf2a2ca680 () - config version: 97 97
[D] 03:02:20.166339882.166 0x7fcf29ac9680 libs_availability:64 - checking: torch cuda
[D] 03:02:20.200419293.200 0x7fcf2a2ca680 () - models changed
[D] 03:02:22.695884270.695 0x7fcf3df6fd00 state_pa_callback:30 - pa authorizing
[D] 03:02:22.696148896.696 0x7fcf3df6fd00 state_pa_callback:33 - pa setting name
[D] 03:02:22.700232344.700 0x7fcf3df6fd00 state_pa_callback:36 - pa ready
[D] 03:02:22.700618283.700 0x7fcc1d5ff680 source_info_pa_callback:206 - pa source: alsa_output.pci-0000_05_00.6.analog-stereo.monitor Monitor of Family 17h/19h/1ah HD Audio Controller Analog Stereo
[D] 03:02:22.700677159.700 0x7fcc1d5ff680 source_info_pa_callback:206 - pa source: alsa_input.pci-0000_05_00.6.analog-stereo Family 17h/19h/1ah HD Audio Controller Analog Stereo
[D] 03:02:22.700710543.700 0x7fcc1d5ff680 source_info_pa_callback:206 - pa source: alsa_input.usb-Alpha_Imaging_Tech._Corp._Razer_Kiyo-02.analog-stereo Gaming Webcam [Kiyo] Analog Stereo
[D] 03:02:22.743382737.743 0x7fcf3df6fd00 () - starting app: app-standalone
[D] 03:02:22.743976174.743 0x7fcf3df6fd00 () - app service state: unknown => busy
[D] 03:02:22.744001875.744 0x7fcf3df6fd00 () - app busy: false => true
[D] 03:02:22.744011024.744 0x7fcf3df6fd00 () - app connected: false = > true
[W] 03:02:22.744245689.744 0x7fcf3df6fd00 () - hot keys are supported only under x11
[W] 03:02:22.783023215.783 0x7fcf3df6fd00 ():36 - file:///usr/lib/qml/QtQuick/Controls.2/org.kde.breeze/ScrollView.qml:36:25: QML ScrollBar: Binding loop detected for property "x"
[W] 03:02:22.795479518.795 0x7fcf3df6fd00 ():36 - file:///usr/lib/qml/QtQuick/Controls.2/org.kde.breeze/ScrollView.qml:36:25: QML ScrollBar: Binding loop detected for property "x"
[W] 03:02:22.805784238.805 0x7fcf3df6fd00 ():36 - file:///usr/lib/qml/QtQuick/Controls.2/org.kde.breeze/ScrollView.qml:36:25: QML ScrollBar: Binding loop detected for property "x"
[W] 03:02:22.823683368.823 0x7fcf3df6fd00 ():463 - qrc:/qml/main.qml:463:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[W] 03:02:22.823727088.823 0x7fcf3df6fd00 ():454 - qrc:/qml/main.qml:454:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[W] 03:02:22.838592896.838 0x7fcf3df6fd00 virtual QVariant ModelSource::item(int) const:81 - ModelSource: Invalid role -1 "color"
[W] 03:02:22.838670210.838 0x7fcf3df6fd00 virtual QVariant ModelSource::item(int) const:81 - ModelSource: Invalid role -1 "color"
[W] 03:02:22.847154810.847 0x7fcf3df6fd00 ():209 - qrc:/qml/ScrollTextArea.qml:209:17: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.847184632.847 0x7fcf3df6fd00 ():209 - qrc:/qml/ScrollTextArea.qml:209:17: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.856018995.856 0x7fcf3df6fd00 ():24 - qrc:/qml/Notepad.qml:24:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[W] 03:02:22.863206164.863 0x7fcf3df6fd00 ():209 - qrc:/qml/ScrollTextArea.qml:209:17: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.863237662.863 0x7fcf3df6fd00 ():209 - qrc:/qml/ScrollTextArea.qml:209:17: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.881050120.881 0x7fcf3df6fd00 ():209 - qrc:/qml/ScrollTextArea.qml:209:17: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.881081549.881 0x7fcf3df6fd00 ():209 - qrc:/qml/ScrollTextArea.qml:209:17: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.889297681.889 0x7fcf3df6fd00 ():30 - qrc:/qml/Translator.qml:30:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }
[W] 03:02:22.915429427.915 0x7fcf3df6fd00 ():148 - qrc:/qml/MainToolBar.qml:148:29: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.915460925.915 0x7fcf3df6fd00 ():148 - qrc:/qml/MainToolBar.qml:148:29: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.917440211.917 0x7fcf3df6fd00 ():76 - qrc:/qml/MainToolBar.qml:76:29: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[W] 03:02:22.917470731.917 0x7fcf3df6fd00 ():76 - qrc:/qml/MainToolBar.qml:76:29: QML MenuItem: Binding loop detected for property "__reserveSpaceForIcon"
[D] 03:02:22.923410406.923 0x7fcf3df6fd00 onCompleted:180 - default font pixel size: 16
[D] 03:02:22.930444205.930 0x7fcf3df6fd00 () - default tts model not found: "en"
[D] 03:02:22.930476192.930 0x7fcf3df6fd00 () - default mnt lang not found: "en"
[D] 03:02:22.930483246.930 0x7fcf3df6fd00 () - new default mnt lang: "en"
[D] 03:02:22.930492744.930 0x7fcf3df6fd00 () - service refresh status, new state: busy
[D] 03:02:22.930509576.930 0x7fcf3df6fd00 () - service refresh status, new state: busy
[D] 03:02:23.106317258.106 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:23.106550526.106 0x7fcf3df6fd00 () - stt models changed
[D] 03:02:23.108801144.108 0x7fcf3df6fd00 () - update listen
[D] 03:02:23.108839067.108 0x7fcf3df6fd00 () - app stt configured: false => true
[D] 03:02:23.126877180.126 0x7fcf3df6fd00 () - app active stt model: "" => "en_fasterwhisper_large3_turbo"
[D] 03:02:23.127114848.127 0x7fcf3df6fd00 () - update listen
[D] 03:02:23.127137267.127 0x7fcf3df6fd00 () - tts models changed
[D] 03:02:23.127313126.127 0x7fcf3df6fd00 () - update listen
[W] 03:02:23.127326396.127 0x7fcf3df6fd00 () - no available tts models for in mnt
[W] 03:02:23.127333589.127 0x7fcf3df6fd00 () - no available tts models for out mnt
[D] 03:02:23.127341272.127 0x7fcf3df6fd00 () - ttt models changed
[D] 03:02:23.129955550.129 0x7fcf3df6fd00 () - mnt langs changed
[D] 03:02:23.130279052.130 0x7fcf3df6fd00 () - update listen
[W] 03:02:23.130291903.130 0x7fcf3df6fd00 () - no available mnt langs
[W] 03:02:23.130297071.130 0x7fcf3df6fd00 () - no available mnt out langs
[D] 03:02:23.133253359.133 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:23.231245274.231 0x7fcf3df6fd00 () - [dbus app] ActiveSttModel called
[D] 03:02:23.707993707.707 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:23.724287199.724 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:24.240836674.240 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:24.727908265.727 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:24.744631906.744 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:25.245433834.245 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:25.731963181.731 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:25.749337878.749 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:26.248388340.248 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:26.695046735.695 0x7fcf29ac9680 libs_availability:76 - torch hip version: 6.2.41133-dd7f95766
[D] 03:02:26.695070970.695 0x7fcf29ac9680 libs_availability:86 - checking: coqui tts
[D] 03:02:26.696980694.696 0x7fcf29ac9680 libs_availability:94 - checking: whisperspeech tts
[D] 03:02:26.697730435.697 0x7fcf29ac9680 libs_availability:102 - checking: faster-whisper
[D] 03:02:26.726923019.726 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:26.743427640.743 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:27.243621046.243 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:27.662277500.662 0x7fcf29ac9680 libs_availability:110 - checking: ctranslate2-cuda
[D] 03:02:27.662308021.662 0x7fcf29ac9680 libs_availability:112 - ctranslate2 version: 4.3.1
[D] 03:02:27.662414737.662 0x7fcf29ac9680 libs_availability:117 - ctranslate2-cuda check py error: ValueError: This CTranslate2 package was not compiled with CUDA support
[D] 03:02:27.662421442.662 0x7fcf29ac9680 libs_availability:121 - checking: transformers
[D] 03:02:27.662430591.662 0x7fcf29ac9680 libs_availability:123 - checking: accelerate
[D] 03:02:27.726851886.726 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:27.743399388.743 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:27.758873369.758 0x7fcf29ac9680 libs_availability:131 - checking: unikud
[D] 03:02:27.759136738.759 0x7fcf29ac9680 libs_availability:142 - checking: mimic3 tts
[D] 03:02:28.243499766.243 0x7fcf3df6fd00 () - [dbus app] State called
[D] 03:02:28.732650865.732 0x7fcf3df6fd00 () - trying features availability update: false
[D] 03:02:28.850611640.850 0x7fcf29ac9680 libs_availability:150 - checking: gruut
[D] 03:02:28.850642649.850 0x7fcf29ac9680 libs_availability:154 - checking: gruut-de
[D] 03:02:28.851323247.851 0x7fcf29ac9680 libs_availability:162 - checking: gruut-es
[D] 03:02:28.851898874.851 0x7fcf29ac9680 libs_availability:170 - checking: gruut-fr
[D] 03:02:28.852297036.852 0x7fcf29ac9680 libs_availability:178 - checking: gruut-it
[D] 03:02:28.853043005.853 0x7fcf29ac9680 libs_availability:186 - checking: gruut-ru
[D] 03:02:28.853583502.853 0x7fcf29ac9680 libs_availability:194 - checking: gruut-fa
[D] 03:02:28.854137758.854 0x7fcf29ac9680 libs_availability:202 - checking: gruut-sw
[D] 03:02:28.854537387.854 0x7fcf29ac9680 libs_availability:210 - checking: gruut-nl
[D] 03:02:28.855033395.855 0x7fcf29ac9680 libs_availability:221 - checking: mecab
[D] 03:02:28.863230671.863 0x7fcf29ac9680 libs_availability:223 - checking: unidic-lite
[D] 03:02:28.864581251.864 0x7fcf29ac9680 libs_availability:230 - py libs availability: [coqui-tts=true, faster-whisper=true, ctranslate2-cuda=false, mimic3-tts=true, whisperspeech-tts=true, transformers=true, unikud=true, gruut_de=true, gruut_es=true, gruut_fa=true, gruut_fr=true, gruut_nl=true, gruut_it=true, gruut_ru=true, gruut_sw=true, mecab=true, torch-cuda=false, torch-hip=true]
[D] 03:02:29.733585647.733 0x7fcf3df6fd00 () - trying features availability update: true
[D] 03:02:29.733626154.733 0x7fcf3df6fd00 () - features availability ready
[W] 03:02:29.734000221.734 0x7fcf3df6fd00 has_lib:1278 - failed to open libcudart.so: libcudart.so: cannot open shared object file: No such file or directory
[W] 03:02:29.734143325.734 0x7fcf3df6fd00 has_lib:1278 - failed to open libcudnn.so: libcudnn.so: cannot open shared object file: No such file or directory
[W] 03:02:29.734314015.734 0x7fcf3df6fd00 has_lib:1278 - failed to open libcudnn.so.9: libcudnn.so.9: cannot open shared object file: No such file or directory
[W] 03:02:29.734482052.734 0x7fcf3df6fd00 has_lib:1278 - failed to open libcudnn.so.8: libcudnn.so.8: cannot open shared object file: No such file or directory
[W] 03:02:29.815329117.815 0x7fcf3df6fd00 try_open_lib:61 - failed to open whisper lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[W] 03:02:29.815470893.815 0x7fcf3df6fd00 try_open_lib:61 - failed to open whisper lib: libwhisper-cublas.so: cannot open shared object file: No such file or directory
[D] 03:02:30.259501042.259 0x7fcf3df6fd00 () - updating models using availability: tts_coqui, tts_mimic3, tts_mimic3_de, tts_mimic3_es, tts_mimic3_fr, tts_mimic3_it, tts_mimic3_ru, tts_mimic3_sw, tts_mimic3_fa, tts_mimic3_nl, tts_rhvoice, tts_whisperspeech, stt_fasterwhisper, stt_ds, stt_vosk, stt_whispercpp, mnt_bergamot, ttt_hftc option_r,
[D] 03:02:30.259533029.259 0x7fcf3df6fd00 () - updating model using availability internal
[D] 03:02:30.261576709.261 0x7fcf3df6fd00 () - default tts model not found: "en"
[D] 03:02:30.261601502.261 0x7fcf3df6fd00 () - default mnt lang not found: "en"
[D] 03:02:30.261609115.261 0x7fcf3df6fd00 () - new default mnt lang: "en"
[D] 03:02:30.261651159.261 0x7fcf3df6fd00 () - service refresh status, new state: idle
[D] 03:02:30.261670155.261 0x7fcf3df6fd00 () - service state changed: busy => idle
[D] 03:02:30.261699698.261 0x7fcf3df6fd00 () - scan cuda: true
[D] 03:02:30.261705984.261 0x7fcf3df6fd00 () - scan hip: true
[D] 03:02:30.261711292.261 0x7fcf3df6fd00 () - scan opencl: true
[D] 03:02:30.261716530.261 0x7fcf3df6fd00 () - scan opencl_legacy: false
[D] 03:02:30.261723095.261 0x7fcf3df6fd00 () - scan openvino: true
[D] 03:02:30.261728822.261 0x7fcf3df6fd00 () - scan openvino_gpu: false
[D] 03:02:30.261735037.261 0x7fcf3df6fd00 () - scan vulkan: true
[D] 03:02:30.261739926.261 0x7fcf3df6fd00 () - scan vulkan_igpu: true
[D] 03:02:30.261746072.261 0x7fcf3df6fd00 () - scan vulkan_cpu: false
[D] 03:02:30.261752218.261 0x7fcf3df6fd00 () - hw feature flags: stt-whispercpp-hip, stt-whispercpp-openvino, stt-whispercpp-opencl, stt-whispercpp-vulkan, tts-coqui-hip tts-whisperspeech-hip
[D] 03:02:30.261773310.261 0x7fcf3df6fd00 add_cuda_devices:809 - scanning for cuda devices
[D] 03:02:30.261777570.261 0x7fcf3df6fd00 add_cuda_dev_devices:724 - scanning for cuda devices
[W] 03:02:30.261996102.261 0x7fcf3df6fd00 cuda_dev_api:328 - failed to open cuda lib: libcuda.so: cannot open shared object file: No such file or directory
[W] 03:02:30.262029486.262 0x7fcf3df6fd00 add_cuda_devices:816 - failed to open cuda lib
[D] 03:02:30.262042686.262 0x7fcf3df6fd00 add_vulkan_devices:1080 - scanning for vulkan devices
[D] 03:02:30.348147290.348 0x7fcf3df6fd00 add_vulkan_devices:1116 - vulkan extensions: VK_KHR_device_group_creation,VK_KHR_display,VK_KHR_external_fence_capabilities,VK_KHR_external_memory_capabilities,VK_KHR_external_semaphore_capabilities,VK_KHR_get_display_properties2,VK_KHR_get_physical_device_properties2,VK_KHR_get_surface_capabilities2,VK_KHR_surface,VK_KHR_surface_protected_capabilities,VK_KHR_wayland_surface,VK_KHR_xcb_surface,VK_KHR_xlib_surface,VK_EXT_acquire_drm_display,VK_EXT_acquire_xlib_display,VK_EXT_debug_report,VK_EXT_debug_utils,VK_EXT_direct_mode_display,VK_EXT_display_surface_counter,VK_EXT_headless_surface,VK_EXT_surface_maintenance1,VK_EXT_swapchain_colorspace,VK_KHR_portability_enumeration,VK_LUNARG_direct_driver_loading,
[D] 03:02:30.357987709.357 0x7fcf3df6fd00 add_vulkan_devices:1176 - vulkan device: 0 id=5708, name=AMD Radeon Graphics (RADV RENOIR), type=integrated-gpu
[D] 03:02:30.358011594.358 0x7fcf3df6fd00 add_vulkan_devices:1176 - vulkan device: 1 id=0, name=llvmpipe (LLVM 17.0.6, 256 bits), type=cpu
[D] 03:02:30.358016343.358 0x7fcf3df6fd00 add_vulkan_devices:1206 - vulkan cpu device => skipping
[D] 03:02:30.359860278.359 0x7fcf3df6fd00 add_hip_devices:832 - scanning for hip devices
[D] 03:02:30.359931306.359 0x7fcf3df6fd00 add_hip_devices:842 - hip version: driver=60241134, runtime=60241134
[D] 03:02:30.359937941.359 0x7fcf3df6fd00 add_hip_devices:851 - hip number of devices: 1
[D] 03:02:30.359946182.359 0x7fcf3df6fd00 add_hip_devices:860 - hip device: 0, name=AMD Radeon Graphics, gcn-arch=912, gcn-arch-name=gfx90c:xnack+
[D] 03:02:30.359997026.359 0x7fcf3df6fd00 add_openvino_devices:1005 - scanning for openvino devices
[D] 03:02:30.361593585.361 0x7fcf3df6fd00 add_openvino_devices:1017 - openvino version: build=2024.1.0-15008-f4afc983258-releases/2024/1, description=OpenVINO Runtime
[D] 03:02:30.703594397.703 0x7fcf3df6fd00 add_openvino_devices:1044 - openvino number of devices: 2
[D] 03:02:30.703641050.703 0x7fcf3df6fd00 add_openvino_devices:1050 - openvino device: 0, name=CPU, full-name=AMD Ryzen 5 5500U with Radeon Graphics
[D] 03:02:30.703715920.703 0x7fcf3df6fd00 add_openvino_devices:1050 - openvino device: 1, name=GPU, full-name=gfx90c:xnack+ (iGPU)
[D] 03:02:30.703722205.703 0x7fcf3df6fd00 add_openvino_devices:1059 - openvino gpu device => skipping
[D] 03:02:30.704453717.704 0x7fcf3df6fd00 add_opencl_devices:872 - scanning for opencl devices
[D] 03:02:30.704764229.704 0x7fcf3df6fd00 add_opencl_devices:889 - opencl number of platforms: 2
[D] 03:02:30.704776242.704 0x7fcf3df6fd00 add_opencl_devices:914 - opencl platform: 0, name=AMD Accelerated Parallel Processing, vendor=Advanced Micro Devices, Inc.
[D] 03:02:30.704787137.704 0x7fcf3df6fd00 add_opencl_devices:928 - opencl number of devices: 1
[D] 03:02:30.704794679.704 0x7fcf3df6fd00 add_opencl_devices:952 - opencl device: 0, platform name=AMD Accelerated Parallel Processing, device name=gfx90c:xnack+, types=[GPU, ]
[D] 03:02:30.704804108.704 0x7fcf3df6fd00 add_opencl_devices:914 - opencl platform: 1, name=Clover, vendor=Mesa
[D] 03:02:30.704810883.704 0x7fcf3df6fd00 add_opencl_devices:928 - opencl number of devices: 1
[D] 03:02:30.704820241.704 0x7fcf3df6fd00 add_opencl_devices:952 - opencl device: 0, platform name=Clover, device name=AMD Radeon Graphics (radeonsi, renoir, LLVM 17.0.6, DRM 3.59, 6.12.9-200.fc41.x86_64), types=[GPU, ]
[D] 03:02:30.704825898.704 0x7fcf3df6fd00 add_opencl_devices:971 - opencl clover device => skipping
[D] 03:02:30.704978011.704 0x7fcf3df6fd00 () - amd gpu detected
[D] 03:02:30.704986113.704 0x7fcf3df6fd00 () - hw accel detected
[D] 03:02:30.704992259.704 0x7fcf3df6fd00 () - system-flags: 5
[D] 03:02:30.705452509.705 0x7fcf3df6fd00 () - service refresh status, new state: idle
[D] 03:02:30.716223206.716 0x7fcf3df6fd00 () - app service state: busy => idle
[W] 03:02:30.720738200.720 0x7fcf3df6fd00 () - no available mnt langs
[W] 03:02:30.720765856.720 0x7fcf3df6fd00 () - no available mnt out langs
[W] 03:02:30.720772072.720 0x7fcf3df6fd00 () - no available tts models for in mnt
[W] 03:02:30.720776472.720 0x7fcf3df6fd00 () - no available tts models for out mnt
[W] 03:02:30.720781710.720 0x7fcf3df6fd00 () - invalid task, reseting task state
[D] 03:02:30.724868929.724 0x7fcf3df6fd00 () - app busy: true => false
[D] 03:02:30.725516353.725 0x7fcf3df6fd00 () - stt models changed
[D] 03:02:30.725599743.725 0x7fcf3df6fd00 () - update listen
[D] 03:02:30.725623069.725 0x7fcf3df6fd00 () - tts models changed
[D] 03:02:30.725650307.725 0x7fcf3df6fd00 () - update listen
[W] 03:02:30.725822534.725 0x7fcf3df6fd00 () - no available tts models for in mnt
[W] 03:02:30.725830706.725 0x7fcf3df6fd00 () - no available tts models for out mnt
[D] 03:02:30.725835036.725 0x7fcf3df6fd00 () - ttt models changed
[D] 03:02:30.728420470.728 0x7fcf3df6fd00 () - mnt langs changed
[D] 03:02:30.728450012.728 0x7fcf3df6fd00 () - update listen
[W] 03:02:30.728462025.728 0x7fcf3df6fd00 () - no available mnt langs
[W] 03:02:30.728466285.728 0x7fcf3df6fd00 () - no available mnt out langs
[D] 03:02:30.734879759.734 0x7fcf3df6fd00 () - [dbus app] State called
I'm glad that something is finally working :)
I installed it [AMD add-on], as mentioned in my first post
No, uninstall it. The add-on provides the AMD ROCm framework, which most likely will not work well with your GPU. The message "To speed up processing..." is a suggestion to enable hardware acceleration in the settings (BTW, you can dismiss this warning permanently by clicking the button on the right). Your integrated-GPU claims it supports Vulkan, so you can try enabling Vulkan for the WisperCpp engine in Settings->Speech to Text->Engine options. Usually integrated GPUs don't work well, so chances are you'll have to turn off hardware acceleration and use only the CPU.
Potentially Vulkan can speed up Speech-to-Text processing, but only if you use WhisperCpp models. FasterWhisper only works on NVIDIA GPUs or just CPUs - AMD GPUs are not supported.
The name FasterWhisper can be a bit misleading, as it implies that it is "faster" than other models, but this is not necessarily true. If Vulkan can run on your AMD card, WhisperCpp will provide the best performance.
Aha! This is great! Alright so I:
- uninstalled the add-on
- selected vulkan
- and downloaded WhisperCpp
And it works pretty great! thanks a lot man 🙂
Last few questions:
- Is the other option
openVINOsuperior toVulkan?
Or should I deselect something in there?
- Would you agree with this:
In summary, if you need faster transcription and can accept potential trade-offs in accuracy or resource usage, WhisperCpp Large-v3 Turbo would be the better choice. If accuracy is your top priority and speed is less critical, then WhisperCpp Large-v3 would be more suitable.
- Does this option could allow me to have Speech Note's output directly in the browser for example? (like MacWhisper) If so, how am I suppose to use it... ?
- How do you get your flawless dark mode and what are those options for?
Thanks a lot @mkiol !
Glad it works :)
Is the other option openVINO superior to Vulkan?
No, it isn't. OpenVINO in Speech Note can potentially provide better speed when using the CPU, but under very specific conditions (e.g. the sentence is long). Having Vulkan, OpenVINO is almost useless and I plan to remove or hide it in the next version.
Would you agree with this:
In summary, if you need faster transcription and can accept potential trade-offs in accuracy or resource usage, WhisperCpp Large-v3 Turbo would be the better choice. If accuracy is your top priority and speed is less critical, then WhisperCpp Large-v3 would be more suitable.
I agree. A comparison of the accuracy of each models can be found here: https://github.com/openai/whisper/discussions/2363#discussion-7264254. Lower WER the better accuracy.
For non-English-speaking users, it is worth mentioning that "Turbo" does not have a "translation" capability, so you cannot speak Spanish (or other lang) and receive text in English. Whisper models other than the Turbo (both WhisperCpp and FasterWhisper) have this capability. BTW, In Speech Note it can be enabled with a switch next to the Listen button.
Or should I deselect something in there?
No. If everything works, don't change the advanced settings. Speech Note has them because there are so many different hardware and operating system settings, and I wanted a "toolbox" to solve unforeseen compatibility problems :)
Does this option could allow me to have Speech Note's output directly in the browser for example? (like MacWhisper) If so, how am I suppose to use it... ?
Exactly. You place the cursor on any text-accepting field (e.g., in a browser), then launch STT (e.g., using a global keyboard shortcut), say something, and the decoded text is automatically interested. At the moment, this only works on X11 desktops. In the next version (4.8.0), support for Wayland will be added.
The easiest way to launch it is via a global keyboard shortcut, but on Wayland this option is hidden. It will be fully supported in the next version.
How do you get your flawless dark mode and what are those options for?
Unfortunately, this is an unsolved problem. Speech Note's UI is written in Qt and looks great under KDE Plasma. Dark them is broken under GNOME and frankly I don't have any ideas how to solve this problem. To be honest, other Qt applications (not all) are also affected by this issue. You might try setting "Fusion" as a "custom graphic style".
That's great @mkiol! Thanks so much for all those details.
- So look, I don't have that switch in my version... and also when I speak in French to the turbo version it transcribe in English perfectly!
-
Also it would be great if you could write a brief explanation on how to setup the STT option when you release the 4.8.0 version 👍 (I'm on Wayland)
-
The fusion is a bit better but not perfect, I don't mind keeping the light theme until there's a solution...
Thanks man!