Skip to content

Nurech/sist2_index_files

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 

Repository files navigation

sist2_index_files

This web UI based localhost program makes your pdf, image, powerpoint (etc.) files content searchable by plain text. Execute single *.bat file to orchestrate the launching of docker containers which do all the magic. From whereever the batfile is launched all folders and files are recursivley scanned down. My advice is to make a folder (e.g.: ~/myFolder) with contents you want to scan. Launch the bat file in ~/myFolder.

sist2: https://github.com/simon987/sist2

demo of sist2: https://sist2.simon987.net/

Additional conf of sist2: https://github.com/simon987/sist2/blob/master/docs/USAGE.md

The script run_setup.bat automates running sist2 on Windows 10/11 to index your local files and make content searchable in web UI.

  1. Ensure docker is running.
  2. Then, running bat file will write docker-compose.yml at current dir.
  3. Then bat file will run docker-compose.yml with docker.
  4. Then docker will pull and run 4 images.
  5. The 4 images will index all files at current run dir and open web UI where you can see the files.

Alternatively, edit and run docker-compose.yml locally.

1. Download and install Docker 4.22.1
2. Save run_setup.bat to desktop
3. Edit run_setup.bat "SCAN_DIR" to specify what directory needs to be scanned & indexed
    ├ default dir is %cd% (the location where run_setup.bat currently is)
    └ e.g.: drop the bat file to whatever current directory you want to be scanned and run it.
4. Run run_setup.bat
    ├ creates file 'docker-compose.yml' at run location
    ├ creates folder 'my_index' at run location to store index data
    └ runs docker-compose.yml to start containers

Tip: If something does not work, run the containers manually one by one.
    ├ make sure elastic is running and ready to accept incoming messages
    ├ for sist2 containers run order is: Scan (wait for exit) -> Index (wait for exit) -> Web
    └ or just close all docker containers and re-run run_setup.bat

Tip: In UI go to settings -> Highlight context size in characters
to increase the length of text preview

Know that docker pull for sist2 is ~292.78MiB
Elsasticsearch is ~1.24GiG.
The created index for ~40 files is ~7MiB.
Total of ~1.5GiB.

image image image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors