V1 of line extractor #9

Merged
ewellenr merged 7 commits from textextractor into main 2023-10-24 15:09:19 -04:00

7 Commits

Author SHA1 Message Date
6c80e5661c Merge pull request 'V1 of line extractor' (#8) from textextractor-test into textextractor
Reviewed-on: #8
2023-10-24 15:08:23 -04:00
2706935750 First implementation of line isolator.
Isolates and returns grayscale images of each line at original
resolution. (which is much less since it's a small selected
part of the original image).

Signed-off-by: Ethan Wellenreiter <ewellenreiter@gmail.com>
2023-10-24 14:36:13 -04:00
9250037998 First steps toward isolating lines as pictures.
Clustering but need to deskew before subclustering.
Looking to dewarp page now (done in the autocropper branch).

Signed-off-by: Ethan Wellenreiter <ewellenreiter@gmail.com>
2023-10-23 17:56:56 -04:00
df56a41ac2 Merge branch 'main' of ssh://ssh.git.ewellenr.ca:2222/ewellenr/receipt_indexer into textextractor 2023-10-21 19:31:35 -04:00
83306830ac Separating text extractor and text classifier.
Exactly what the title says.

Signed-off-by: Ethan Wellenreiter <ewellenreiter@gmail.com>
2023-10-21 19:27:46 -04:00
a4d75fc6bd Updating scripts to work with this branch
Just making textextractor one of the branches that can be chosen
in the scripts.

Signed-off-by: Ethan Wellenreiter <ewellenreiter@gmail.com>
2023-10-18 23:30:35 -04:00
4015328160 Initial prep for textextractor branch.
Making the docker file and an early Jupyter
Notebook for test work.

Signed-off-by: Ethan Wellenreiter <ewellenreiter@gmail.com>
2023-10-18 23:13:18 -04:00