Jump to content

Talk:Unicity distance

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Meanings and abrevation of.. FER Ec/Io CAI Finger RFI

Please improve sentence structure

[edit]

This paragraph gives me parsing problems, Should the second sentence be joined to the first as as it is read, since if? I cannot tell.

Another way to increase the unicity distance is to increase the number of possible valid sequences in the files as it is read. Since if for at least the first several blocks any bit pattern can effectively be part of a valid message then the unicity distance has not been reached. This is possible on long files when certain bijective string sorting permutations are used, such as the many variants of bijective BWT transforms. — Preceding unsigned comment added by 143.111.80.33 (talk) 17:15, 11 March 2013 (UTC)[reply]

Broken character, questionable example

[edit]

Basically the bigger the unicity distance the better. For a one time pad, given the unbounded entropy of the key space, we have U = , which is consistent with the one-time pad being theoretically unbreakable.

The  is broken for me (Ubuntu, Firefox, with Japanese/Korean support). --Sydius (talk) 00:55, 19 August 2008 (UTC)[reply]

Basically the bigger the unicity distance the better. For a one time pad of unlimited size, given the unbounded entropy of the key space, we have , which is consistent with the one-time pad being theoretically unbreakable.
I improved the expression here, but the example is not really on target. I would not speak in terms of infinity, which will seem too theoretical even to the theoretically-minded, but rather point out that a one-time pad is used only once, for a message much shorter than the unicity length. A calculation with a ten-thousand word message in English would be great here. For this example: How long a key to get a unicity length of a million? How many key candidates with this key length (surely much greater than one)? It might not be enough for a conviction, but might still justify an arrest. 178.38.151.183 (talk) 17:33, 30 November 2014 (UTC)[reply]

Logic of permutation method escapes me

[edit]

Another way to increase the unicity distance is to increase the number of possible valid sequences in the files as it is read. Since if for at least the first several blocks any bit pattern can effectively be part of a valid message then the unicity distance has not been reached. This is possible on long files when certain bijective string sorting permutations are used, such as the many variants of bijective Burrows-Wheeler transforms.[dubiousdiscuss]

I don't quite get this.

First of all, it is not very precisely stated (the first two sentences).

Second, if a permutation is used, it does not change the entropy of the message space since the probabilities are only permuted. Unless there is some implicit framework assumption about the length of the text window for entropy measurements, the length of the permutations versus the length of the messages, the length of code blocks, or something. But then this should be made clear.

And/or: if someone can provide a citation it would help a lot or solve the problem. 178.38.151.183 (talk) 18:54, 30 November 2014 (UTC)[reply]

Removed. Four years is long enough time to provide a citation. Suffusion of Yellow (talk) 02:17, 2 July 2018 (UTC)[reply]

Can we have some examples?

[edit]

There are a few moderately "famous" short encrypted messages. Not all have been solved. It would be interesting if someone who knows about this could analyse what would be unicity distance for these. If I may presume to list some examples:

The message in the introduction to Robert Harris' book "Enigma" (which I believe has been solved) The possinbly encrypted message on the scrap of paper found in the pocket of "Somerton Man".

No doubt there are others. 2001:8003:E40F:9601:998C:AA9D:2DFB:522C (talk) 01:06, 24 December 2024 (UTC)[reply]

To calculate the Unicity distance, you need to know the key space of the cipher. For many of these famous undecrypted messages, the cipher is unknown, and so the key space is unknown. David Malone (talk) 20:20, 24 December 2024 (UTC)[reply]

Unicity distance of substitution cipher - a consideration not included.

[edit]

Without disputing the mathematics of this, surely this is missing an important concept? In every language, there are (in general) some letters that always occur together For example, in English, and disregarding "modern" proper nouns, a Q is ALWAYS followed by a U. This also implies that a Q cannot occur at the end of a word.

Also, again generalising, some letters can never occur in adjacent pairs. You wont ever see a ZK in Engligh. As an extension of this, some letters can never be doubles. For example, in English, you would never find a double X or double H.

Surely this means, then, that in different languages, the Unicity distance of substitution cipher depens heavliy on the language, and such rules? 2001:8003:E40F:9601:998C:AA9D:2DFB:522C (talk) 01:46, 24 December 2024 (UTC)[reply]

Yes - you need to know the plaintext redundancy, which depends on the language being used. David Malone (talk) 20:21, 24 December 2024 (UTC)[reply]