I made a Gtk GUI to open LaTeX Beamer file, so that you can extract some pages

thevoidzero@lemmy.world · 7 days ago

If there was an easy way to use rust or something on webassemly and use that instead of JS. I’d be so happy, but I can’t find how to do it without npm.

thevoidzero@lemmy.world · 7 days ago

I think that’s why Github suggests MIT as default. Unaware people will just put that. Most open source people just code things they want without thinking much on other aspects. We really need some sort of enforcement to stop companies banking on voluntary work done for the community.

thevoidzero@lemmy.world · 8 days ago

You can use your library for commercial projects that you have. Just have dual license that requires payment for commercial use or something similar. You don’t have to pay yourself

thevoidzero@lemmy.world · 16 days ago

Most open source tool have the same thing that it feels like it’s made by engineers. I think that’s because it’s true, most FOSS tools are made by engineers for engineers. Because most project start with someone needing something and then creating it and sharing it.

Chances of a programmer needing something and then making it is a lot higher, than an artist needing it and then making it as then there’d be a need to have the necessary skills to make the software. As someone not from CS field I’ve seen how much of redundant programs are present for CS related tasks while barely some exists for other fields because the overlap of programmer and that field is low specifically FOSS programmers. And a few programmers that field would have don’t have the high level software development skills, so most open source tools made by them are “works on my machine, or works for this specific task” even though with less than 1% more effort they could have made a generalized tool.

thevoidzero@lemmy.world · 18 days ago

Miniserve is a program for computers. Though you can run it in termux. It just opens a tiny server instance for file sharing, you can download/upload files to a directory directory. Will also show QR for the link.

thevoidzero@lemmy.world · 19 days ago

Don’t modify the config in /etc/, copy them in ~/.config/ and then modify them. You’ll always be able to just look at the /etc/ for defaults.

thevoidzero@lemmy.world · 2 months ago

Arch already just works, Majaro breaks more (at least for the one month I tried it while getting into Linux).

thevoidzero@lemmy.world · 2 months ago

Reading filesystem is not about which distribution you have but drivers on disc. If you have FAT the defaults should work, for NTFS you might have to install the ntfs driver. I don’t use mint but it’s the linux way so either it’s already there or you can install it. Once you have driver just mount it like a normal drive and it’s done.

thevoidzero@lemmy.world · 3 months ago

For the OCR, have you tried tesseract? For printed documents it can take image input and generate a pdf with selectable text. I don’t OCR much but it has been useful when I tried a few times.

You might be able to have a script that takes the scanner input into tesseract and output a pdf. It only works on a single image per run so I had to make script to run it on whole pdf by separating it and stitching it back together.

thevoidzero@lemmy.world · 3 months ago

Someone already talked about the XY problem, so I’ll say this.

Why sound notification instead of notification content? If your notification program (dunst in my case) have pattern matching or calling scripts based on patterns and the script has access to which app, notification title, contents etc. then it’s just about calling something in your bash script.

And any time you wanna add that functionality to something else, add one more line with a different pattern or add a condition in your script. Comparing text is lot more reliable than audio.

Of course your use case could be completely different, so maybe give some examples of use case so people can give you different ways to solve that instead of just the one you’re thinking of.

thevoidzero@lemmy.world · 4 months ago

I was thinking the same, smartphones would definitely do everything it can to make images sharp so it’s probably not going to be easily stackable.

Still it feels like something should be there to combine the pictures to make better drawing, as there’re softwares to generate 3D models from smartphone pictures.

thevoidzero@lemmy.world · edit-2 4 months ago

Did you solve it? Recently there was a problem with graphics thing and downgrading mess from 1.24 to 1.23 helped me. It was in arch with AMD graphics, but some people said Nvidia ones also had the problem.

Edit: mesa not mess

thevoidzero@lemmy.world · 4 months ago

Similar. But I do contribute by adding things I want to some projects I use if it’s simple enough.

And my pile of shit has like 40 stars, so maybe I have one or two other users besides me.

thevoidzero@lemmy.world · 4 months ago

And how do I find a job like that?

thevoidzero@lemmy.world · 5 months ago

Hi there, I did say it’s easily doable, but I didn’t have a script because I run things based on the image before OCR manually (like the negating the dark mode I tried in this script; when doing manually it’s just one command as I know whether it’s dark mode of not myself; similar for the threshold as well).

But here’s a one I made for you:

#!/usr/bin/env bash

# imagemagic has a cute little command for importing screen into a file
import -colorspace gray /tmp/screenshot.png
mogrify /tmp/screenshot.png -color-threshold "100-200"
# extra magic to invert if the average pixel is dark
details=`convert /tmp/screenshot.png -resize 1x1 txt:-`
total=`echo $details | awk -F, '{print $4}'`
value=`echo $details | awk '{print $7}'`
darkness=$(( ${value#_(%_)} * 100 / $total ))
if (( $darkness &lt; 50 )); then
   mogrify -negate /tmp/screenshot.png
fi

# now run the OCR
text=`tesseract /tmp/screenshot.png -`
echo $text | xclip -selection c
notify-send OCR-Screen "$text"

So the middle part is to accommodate images in dark mode. It negates it based on the threshold that you can change. Without that, you can just have import for screen capture, tesseract for running OCR. and optionally pipe it to xclip for clipboard or notify-send for notification.

In my use case, I have keybind to take a screenshot like this: import png:- | xclip -selection c -t image/png which gives me the cursor to select part of the screen and copies that to clipboard. I can save that as an image (through another bash script), or paste it directly to messenger applications. And when I need to do OCR, I just run tesseract in the terminal and copy the text from there.

thevoidzero@lemmy.world · edit-2 5 months ago

Not for handwritten text, but for printed fonts, getting OCR is as easy as just making a box in screen with current technology. So I don’t think we need AI things for that.

Personally I use tesseract. I have a simple bash script that when run let’s me select a rectangle in screen, save that image and run OCR in a temp folder and copy that text to clipboard. Done.

Edit: for extra flavor you can also use notify-send to send that text over a notification so you know what the OCR produced without having to paste it.

thevoidzero@lemmy.world · 5 months ago

About the malware thing. Won’t the Linux use increasing in organizations give incentive for attackers to make malwares targeting linux? It’s not like we’re malware free, it’s just that average user is informed enough and there is low use of linux making it not worth as much to target desktop users.

thevoidzero@lemmy.world · 5 months ago

Thank you. I’ve been using grk4-rs so I went over their examples and the book. Looking at the c examples might be good as well. And I’ll learn about the GObjects. So far I’ve been winging it based on examples I see and the properties of different classes.

thevoidzero@lemmy.world · 5 months ago

Thank you, I’ll go over them. I think I got a little bit of the basics now. But I need a little advanced knowledge. And most tutorials seems to be for gtk3 and many widgets are now deprecated.

Originally my plan was to make each slide edited separately when you right click and do edit. And be able to drag them up and down and everything, but my skills were not enough. After trying to make that for a week I gave up and made this simpler version. So listview and treeview models from that tutorial could give me more info.

thevoidzero@lemmy.world · 5 months ago

Yeah, the bindings were a great help. I also tried tauri, iced, beavy, and a lot of libraries. At the end just using gtk4 seemed to be the best for my use case. I went through the grk4-rs book, and then started experimenting. I don’t like GUI at all, but for simple cases it works well

thevoidzero@lemmy.world · 5 months ago

I made a Gtk GUI to open LaTeX Beamer file, so that you can extract some pages