>>6,7Just normalize the data. Convert pdfs to text files. Run audio through a filter that will mangle anything not human detectable. Run similar filters on images. Blur pixels. Add noise. Decompile binaries, eliminate dead code, and recompile. When recompiled, make the code execute itself by decrypting itself multiple times in 256 different ways.