DescriptionZipf-heot-0 Hebrew - Books of the Torah.svg
English: Zipf law plot (frequency as function of frequency rank) for the first five books (Torah, Pentateuch) of the Hebrew Bible. The original text is the Hebrew language version (the Masoretic text), with vowel points but without cantillation marks. That texts is a 10th century compilation of texts written probably around ~500 BCE, based on even earlier texts. The file was obtained from the Sacred Texts site, maintained by John B. Hare, and was converted to an ad-hoc single-byte encoding designed to look vaguely phonetic under an ISO-Latin-1 font.
The books and the respective word frequency files are:
The word frequency files '*/*/*/gud.wfr' are available at the UNICAMP website. The original annotated full texts are in the companion files */*/org/main.src. The extracted texts -- one word per line, without punctuation -- are in */*/*/gud.tlw.
to share – to copy, distribute and transmit the work
to remix – to adapt the work
Under the following conditions:
attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.