Just wondering since I know a lot of people quietly use a screen-area-select -> tesseract OCR -> clipboard shortcut.

  • I separate subjects of interest into different Firefox windows, in different workspaces – so I have an extension title them and a startup script parse text to ask the compositor to put them in the correct workspace (lets me restart more conveniently).
  • I have automatically-set different-orientation wallpapers for using my 2-in-1 depending on whether I use it in portrait or landscape (kind of just for looks, but I don’t think if anyone else adds a wallpaper change to their screen rotation keybind).
  • oldfart@lemm.ee
    link
    fedilink
    arrow-up
    3
    ·
    3 days ago

    ChatGPT wrote a Python program that does select->Tesseract OCR for me, but it doesn’t always work right with two monitors. I’m too stupid to correct it. How have you done yours, what are you using for selecting the area?

    • fool@discuss.tchncs.deOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      3 days ago

      pasting from my keybind config

      # snippet based on end4 dotfiles -- FIXME edge case where a
      #     preexisting tmp.png might be overwritten
      # English
      bind = Super+Shift,T,exec,grim -g "$(slurp $SLURP_ARGS)" "tmp.png" && tesseract -l eng "tmp.png" - | wl-copy && rm "tmp.png"
      # Korean
      bind = Super+Shift,K,exec,grim -g "$(slurp $SLURP_ARGS)" "tmp.png" && tesseract -l kor "tmp.png" - | wl-copy && rm "tmp.png"
      # Japanese
      bind = Super+Shift,J,exec,grim -g "$(slurp $SLURP_ARGS)" "tmp.png" && tesseract -l jpn "tmp.png" - | wl-copy && rm "tmp.png"
      

      I just pipe grim and slurp (i.e. select part of the Wayland screen then copy) into a temporary png, tesseract it into the clipboard, then delete the temporary png.

      edit: clarified