Export Samsung Notes to txt, jpg, Google Keep

2 minute read

If you buy a new Handy which is not a Samsung and you wanna restore your notes on this one using Google Keep or either export them all in txt files and get the pictures.

I built a small batch file using ghostscript, iconv, Gow (with sed), xpdf-tools with pdfimages, ImageMagick with mogrify and a python script: txt-to-google-keep-notes.py from sliceofbytes.

The procedure is as follow:

Export the notes from your handy in pdf (put them on Google Drive e.g.)
get it into txt file via ghostscript Follow what is selected on the left window and the content of the file on the right window
remove last line and first spaces
delete accents and special characters
separate title and content
extract pictures and convert them in jpg
remove all what is not needed
Import txt in your Google keep via txt-to-google-keep-notes.py
- change the part for your username and password (you do this at your own risk, I take no responsabilities that someone is stiling your password by doing this)
- run it with python installed and python your-python-script.py
check your Google Keep
Import manually your pictures if needed (only big manual step, but pretty fast)

All the source are present in the batch file below:

@echo off
SETLOCAL ENABLEDELAYEDEXPANSION

	
for %%a in (*.pdf) do (

	REM Extract text from pdf in txt file
	gswin64c.exe -sDEVICE=txtwrite -o %%~na.txt %%a
		REM Source : https://stackoverflow.com/questions/3650957/how-to-extract-text-from-a-pdf
	
	REM Remove last line and remove all double spaces, remove first character if space
	sed "$ d" %%~na.txt | sed "s/  //g" | sed "s/^ //" > %%~na_clean1.txt
		REM rm space with sed : https://stackoverflow.com/questions/6534556/how-to-remove-and-all-of-the-escape-sequences-in-a-file-using-linux-shell-sc and  https://askubuntu.com/questions/537956/sed-to-delete-blank-spaces
		REM Remove first character if : https://superuser.com/questions/157344/delete-the-first-known-character-in-a-string-with-sed
		REM Other tips found
			REM use sed for special characters using coding hex in the table : https://en.wikipedia.org/wiki/ASCII#ASCII_control_characters
				REM e.g. replace SOH with LF sed "s/\x01/\x0A/g" 
				REM do not use ' but "
	
	
	REM special character like accent (I could not use sed with the cmd from windows)
	type %%~na_clean1.txt | iconv.exe -f utf-8 -t ascii//TRANSLIT | tr -d "'^`"> %%~na_noaccent.txt
		REM iconv to discociate the accent with letter : https://stackoverflow.com/questions/10207354/how-to-remove-all-of-the-diacritics-from-a-file - download : https://sourceforge.net/projects/gnuwin32/files/libiconv/1.9.2-1/
		REM only iconv functions on cmd, all of this does not: 
			REM sed -e 's/[àâ]/a/g;s/[ọõ]/o/g;s/[í,ì]/i/g;s/[ê,ệ]/e/g' 
			REM sed -i 'y/āáǎàēéěèīíǐìōóǒòūúǔùǖǘǚǜĀÁǍÀĒÉĚÈĪÍǏÌŌÓǑÒŪÚǓÙǕǗǙǛ/aaaaeeeeiiiioooouuuuüüüüAAAAEEEEIIIIOOOOUUUUÜÜÜÜ/' file
	
	REM Get first line to change filename
	sed -n 1p %%~na_noaccent.txt > %%~na_title
	set /p fn=<%%~na_title
	
	REM Remove first line 
	sed "1d" %%~na_noaccent.txt > %%~na_final.txt
	
	type %%~na_final.txt > !fn!.txt
	
	rm %%~na.txt
	rm %%~na_clean1.txt
	rm %%~na_noaccent.txt
	rm %%~na_final.txt
	rm %%~na_title
	
	
	REM EXTRACT PICTURES PDF
	pdfimages.exe -j "%%a" "!fn!"
		REM Tools here : https://www.xpdfreader.com/download.html
		REM https://stackoverflow.com/questions/17065274/how-to-extract-images-from-pdf-using-ghostscript-or-imagemagick
	magick mogrify -format jpg *.ppm
	del /S *.ppm
	
	
)

REM import in keep : https://gist.github.com/sliceofbytes/f5eab8911c761ff6760362beb17e6477

Share on

X Facebook LinkedIn Bluesky

Dorian Gravier

Export Samsung Notes to txt, jpg, Google Keep

Share on

You May Also Enjoy

Compare 2 files and get the output colored from icdiff in html

Fix chrome to open http or local link, file in a new tab when you use scoop

Html table with images to xlsx with python

Add margin to epub manga images for the ebook Sony PRS-T2