It looks like they know what they're doing. I was especially curious about dedup...

guessWhy · on Jan 22, 2013

Yes, but the first part makes no sense. If the 128-bit key is indeed chosen at random for each file (as it should be), the probability that the same key will be chosen again for a second upload of the same file is effectively zero (1/2^128).

nathan_f77 · on Jan 22, 2013

Exactly. Read: There is no real deduplication of data. So if 1 file is reported, there's no way to track all the other copies, or automatically ban a specific file hash.

RaphiePS · on Jan 22, 2013

It'd make more sense to derive the key from the data. That way, if two files are duplicates, their encrypted versions will also be duplicates.

alexkus · on Jan 23, 2013

Then it's easy for law enforcement to force Mega to remove all known versions of certain copyrighted material as they can now prove that Mega are hosting copies of that particular bit of copyrighted material, .e.g.

If they find a ripped version of a movie/ebook/whatever they can just encrypt it using Mega's scheme (which would now derive the key from the data) and get a single version of the file out. They then tell Mega to remove any files that match that encrypted file.

If all files are encrypted with a random key there's no way for law enforcement to do this.

wmf · on Jan 22, 2013

As mentioned in the last ten Mega crypto threads, security pedants aren't satisfied with the level of privacy that convergent encryption provides.

guessWhy · on Jan 23, 2013

I better idea might be to use convergent encryption only for really large files. Practically this would mean deduplication of software, movies, etc.

AnthonyMouse · on Jan 23, 2013

That has a good practical benefit (deduplication of files that most benefit from deduplication), but it doesn't actually solve the security problems at all, it's just choosing to make the trade off one way for large files and a different way for smaller files. If you have a legitimate reason to want privacy against data confirmation attacks then you need what you need regardless of file size.

The whole thing with deduplication is a little bit overblown anyway. You don't want a hundred copies of the same big file, but is that what really happens? Nobody wants to upload the same file a hundred times, especially if the file is very large. Once there is already a copy, passing around a link to it is much easier than uploading it again. So the most common cause for it to happen is when two totally unrelated people upload the same bit-for-bit identical file, which happens, but not so often as to be prohibitive.

And in many cases file-level deduplication is difficult or impossible anyway because users make changes to the files (like editing embedded metadata or pointlessly encapsulating a single already-compressed file into a .rar archive), so the benefits you get from deduplication are not nothing, but there are situations where it is or isn't a reasonable trade off to make against privacy.

guessWhy · on Jan 22, 2013

They don't seem to do that, though. Note that they claim that it's a random key and that deduplication is "much more likely" to happen when files are copied. If they would derive the key from the data in a deterministic way, they could always dedup and the previous statement (deduplication of copied files is more likely) could not be true.

edit: Clarify.

kylemaxwell · on Jan 22, 2013

It looks like they know what they're doing.

Based on all the analyses published so far, it does not look like that at all. In your view, what makes it appear that their crypto was implemented in anything resembling a proper fashion?

htf · on Jan 23, 2013

Yeah they did some things wrong apparently. I don't even know enough to pass judgment.

wmf · on Jan 22, 2013

Basically this isn't dedupe. If two people upload the same file, Mega will store two copies.

eps · on Jan 22, 2013

But is it possible to share a file and a key, so that a receiver of a share wouldn't need to re-encrypt the file and could instead stash a copy as is?

I bet they support this. If I were Mega, I'd be doing this yesterday.

wmf · on Jan 22, 2013

Sure, they do that. But I don't consider symlinking to be dedupe either.

jd007 · on Jan 22, 2013

So it basically dedupes whatever you copy to a different folder on your own account. I guess this is the best they can do without knowing anything about the files (though not really that useful). To get true deduplication, convergent encryption is needed, which reveals more information about what you are storing (e.g. if I store the same file as you I will know what your file is)