Kindle Word Wise databases

kll.en.en.klld/WordWise.kll.en.en.db

Find in Kindle device at path /system/kll/

Or install Kindle Android app and extract Word Wise file at: /data/data/com.amazon.kindle/databases/wordwise/WordWise.kll.en.en.db

android_metadata

locale

en_US

lemmas

id

lemma

1

a

2

from a to z

3

from A to Z

4

from point a to point b

5

from (point) A to (point) B

metadata

key

value

maxTermLength

3

termTerminatorList

, ; . “ ‘ ! ?

definitionLanguage

en

id

cn-kll.en.en

lemmaLanguage

en

version

2016-03-09

revision

52

tokenSeparator

encoding

1

pos_types

id

label

0

noun

1

verb

2

adjective

3

adverb

4

article

5

number

6

conjunction

7

other

8

preposition

9

pronoun

10

particle

11

punctuation

senses

id

display_lemma_id

term_id

term_lemma_id

pos_type

source_id

sense_number

synset_id

corpus_count

full_def

short_def

example_sentence

1

910

878

910

0

1

1.0

34284

0

YSBuYW1lIG9yIHRpdGxl

YSBuYW1lIG9yIHRpdGxl

YW4gaG9ub3JhcnkgYXBwZWxsYXRpb24=

Some lemmas have multiple senses:

sqlite> SELECT * FROM lemmas WHERE id = 910;
id|lemma
910|appellation
sqlite> SELECT count(*) FROM lemmas;
74623
sqlite> SELECT count(*) FROM senses;
84770
>>> import base64
>>> base64.b64decode("YSBuYW1lIG9yIHRpdGxl")
b'a name or title'
>>> base64.b64decode("YW4gaG9ub3JhcnkgYXBwZWxsYXRpb24=")
b'an honorary appellation'

Base64 - Wikipedia

base64 — Base16, Base32, Base64, Base85 Data Encodings — Python 3.9.0 documentation

codecs — Codec registry and base classes — Python 3.9.0 documentation

sources

id

label

0

1

Merriam-Webster

2

3

现代英汉词典

4

5

kll.en.zh.klld/WordWise.kll.en.zh.db

android_metadata

Same as en.

lemmas

Same as en.

metadata

key

value

maxTermLength

3

termTerminatorList

, ; . “ ‘ ! ?

definitionLanguage

zh

id

cn-kll.en.zh

lemmaLanguage

en

version

2016-04-07

revision

52

tokenSeparator

encoding

1

pos_types

Same as en.

senses

id

display_lemma_id

term_id

term_lemma_id

pos_type

source_id

sense_number

synset_id

corpus_count

full_def

short_def

example_sentence

1

910

878

910

0

3

1.0

34284

0

5ZCN56ew77yb56ew5Y+377yb56ew5ZG8

5ZCN56ew77yb56ew5Y+377yb56ew5ZG8

>>> import base64
>>> base64.b64decode("5ZCN56ew77yb56ew5Y+377yb56ew5ZG8").decode("utf-8")
'名称;称号;称呼'

sources

Same as en.

LanguageLayer.en.ASIN.kll

Kindle device path: /documents/book_name.sdr

Android app path: /data/user/0/com.amazon.kindle/databases/WordWise.en.ASIN.acr.db

metadata

key

value

acr

CR!AX4P53SCH15WF68KNBX4NWWVZXKG

targetLanguages

en

sidecarRevision

9

bookRevision

8d271dc3

sourceLanguage

en

enDictionaryVersion

2016-09-14

enDictionaryRevision

57

enDictionaryId

kll.en.en

sidecarFormat

1.0

acr: Palm DB name, first 32 bytes of MOBI file. asset_id in KFX metadata.

bookRevision: Unique-ID of MOBI header.

glosses

start

end

difficulty

sense_id

low_confidence

2625

1

113403

0

2644

1

114411

0

2682

2

106210

0

2763

2

33584

0

2812

2

10189

0

difficulty = 1 -> Fewer Hints

difficulty = 5 -> More Hints

start: bytes offsets for MOBI and AZW3, Unicode character offsets for KFX.