Do you know of any tool that can do it?

Let’s imagine I have the following lines in a txt file:

234545453535
828272726266
356565656565
245454545454
245454545454
245454545454
245454545454
234545453535
828272726266
111111111111
111111111111
000009000000
111111111111

I guess you already know what I to express.

What I wish is a tool that could read that txt file (for example) and say that W,X,Y,Z (for example) were the entries most repeated.

I’m talking about thousands of entries, so doing it manually or by some sort of graph would be suicide, I think.

I know that with sheet processor I can use a filter, but it will only be possible to see it entrie by entrie.

Are you familiar with any tool that could do it?

Regards

Is this a one-off task or will you need to repeat it?

A quick search for cryptanalysis tools will likely provide more results.

Ganzúa download | SourceForge.net provides frequency analysys for characters, bigrams and trigrams.

http://www.cryptograms.org/ may prove interesting too.

No problem, I did mean the frequency of the task itself.

I assume this needs to be a “free” solution? (related to my first question really, since free rules out using some snazzy, although expensive, data analysis tool on trial)

You mentioned thousands of records… will the data set to be analysed grow? Any processing time-frame restrictions?

I deleted some stuff on seeing gibran’s post (repeats). :slight_smile:

That type of information isn’t really proper frequency analysis. I suspect that sticking the data into a MySQL database & running a fairly straight forward SQL query can give those answers. gibran, what do you think?

Not sure about the whole task at hand but IMHO your suggestion will likely prove more efficient.
AFAIK ganzua is meant to address cryptograms and it looks like it cannot export frequency counts.

OK, my SQL is rusty & I’ve not personnally used MySQL (although I know it’s free). But, I do know that SQL itself is quite capable of performing this task with a SELECT utilising a combination of COUNT & GROUP BY… with maybe needing an Outer Join to exclude COUNT returns of 1 (if required).