Post by Stan BrownPost by Larry WolffThe PDFs I have come across perfectly as text so that's not the problem.
The text can be spoken to English on Windows also, so that's not a problem.
The problem is converting that spoken words to an audio file.
I searched far and wide, and while I found lots of online clickbait, I
can't yet find a freeware Windows 10 program to convert it to audio.
That's the sort of thing Audacity can do, if I'm not mistaken.
<https://www.audacityteam.org/>
Audacity is mainly an audio editor. It takes an audio input
file, and outputs an audio file on output. For example,
you can "normalize" or "compand" audio, interact with the
amplitude. Or the frequency response.
Historically, Audacity didn't even have FFMPEG in it. It
did not have multimedia conversion capability, like open
an MP4 video and extract an M4A audio stream. The input
formats originally were quite limited. The LAME MP3 module,
was their "feature" (not bundled, had to be acquired in a
brown paper bag). Most of the other formats would be
closer to PCM (pulse code modulation). MP3 is now "legal",
so the brown paper bag is optional.
I don't think working with SAPI would be a high runner.
Mainly because SAPI is Microsoft, and Audacity is FOSS
and SAPI-equivalent on Linux (assuming there is one),
would mean writing the software twice. And you know
how FOSS only accepts cross-platform standards, so they
"only have to write the code once". They hate having to support
custom standards on each platform.
Which is why LibreOffice, Firefox, and the like, might
be using OpenGL for their graphics, rather than
DirectX3D. As it is, the Firefox graphics person was
griping about having to support X11 and Wayland at the
same time (during the Wayland change-over interval,
which could be a while). X11 runs GLXGears at 20000 FPS,
while Wayland manages 12000 FPS (slower, gee, thanks).
If you wrote Firefox browser code, and you supported
OpenGL on Linux and DirectX on Windows, the graphics
person would have extra work to do.
On VLC, it means VLC can directly use a TV tuner to
play a TV channel (in Linux). The code does not exist on Windows,
as far as I know you'd need to use Media Center software
interface (on the assumption the TV tuner card is
Media Center compatible and comes with a Media Center
driver). I own two tuner cards, one ancient, and the
ancient one is not Media Center, and the newer one is
Media Center. That means, if VLC ever gets that working,
by writing Windows-specific [Media Center] code in VLC
for it, only the newer card would work.
*******
https://en.wikipedia.org/wiki/Audacity_%28audio_editor%29
"Importing, exporting and conversions
Audacity natively imports and exports WAV, AIFF, MP3, Ogg Vorbis,
and all file formats supported by libsndfile library. Due to
patent licensing concerns, the FFmpeg library necessary to import
and export proprietary formats such as M4A (AAC) and WMA is
not bundled with Audacity but has to be downloaded separately.[23]
In conjunction with batch processing features, Audacity can be used
to convert files from one format to another, or to digitize records,
tapes or MiniDiscs.
"
"Customizability and extensibility
Audacity supports ... audio effect plugins ...
In January 2024, Intel introduced some AI-powered capabilities for
Audacity as part of its OpenVINO plugin suite.[29][30]
"
That last item is more likely to be a lever to expand Audacity, than
perhaps waiting for them to be adventurous enough to include a
"small" version of FFMPEG.
If they were to include calls to SAPI,
they'd probably have heart failure :-) Their lawyer would tell them
to stop :-) By not being adventurous, they can "just edit audio files".
And stay out of court. It's legal chill that makes people less adventurous.
Paul