We already are pretty sure that alien meddling was instrumental in our transition to humanity a mere 200,000 years ago. Thus the idea that earth was seeded with artificial DNA early on is not unreasdonable and surely inspires detailed mapping and comparison of our genome to that of our so called close relatives.
Our Alien DNA
Here we show that the terrestrial code displays a thorough precision-type orderliness matching the criteria to be considered an informational signal. Simple arrangements of the code reveal an ensemble of arithmetical and ideographical patterns of the same symbolic language. Accurate and systematic, these underlying patterns appear as a product of precision logic and nontrivial computing rather than of stochastic processes (the null hypothesis that they are due to chance coupled with presumable evolutionary pathways is rejected with P-value < 10–13).
The signal displays readily recognizable hallmarks of artificiality.
A little background will help here… Prime numbers are those that cannot be made by multiplying together any other whole numbers: 1, 3, 5, 7, 11, 13 and on up, as high as you want to go. No known formula on natural process generates them. If you see primes, you know that mind is not far behind.
A widely accepted idea for interstellar message construction is to send digital pulses that repeat with a number that is the product of two primes multiplied together. This would suggest that a two-dimensional picture is being sent. For example, if you received a message that was repeating a sequence of 143 pulses, you, or your machine, would say, "Oh, 143 is 11 times 13, and these are both prime numbers. Let's make an 11-by-13 array and see if there's a picture encoded here." This technique - using primes to create easily decoded 2-D images - is a pillar of SETI theory.
Now back to the message in that virus, bacteriophage φX174. It's the first organism for which the entire genome was decoded. A remarkable feature discovered was the presence of "overlapping genes". These are sequences of nucleotides that can be read in two different frames, to encode for two completely different proteins. In other words, the DNA sequence CAATGGAACAACTCA, can be read as the three-letter "words" CAA TGG AAC AAC TCA, and this will instruct a cell to start building a protein by putting together the five amino acids specified by these triplets. However, starting at a different letter, the same sequence can also be read as ATG GAA CAA CTC, and so on, which will build a completely different protein. It is as if you could write an English sentence criticizing a bickering couple, "CAN YOU TWO NAG", that also contains a message about the ninth inning of a baseball game, "ANY OUT WON", except that to make proteins you would have to continue in this overlapping mode for hundreds of words, with both sentences making complete sense.
But wait, there's more. This organism (φX174) contained not one, but three pairs of these overlapping genes. And, if you count the number of letters in these overlapping sequences, you find that they are 121, 91, and 533. Each of these is the product of two prime numbers (11x11, 7x13, and 13x41). How weird is that? Here was the widely accepted signature of an intelligent message, turning up in the strangest of places.
The human genome...consists of some 2.9 billion of those letters — the equivalent of about 750 megabytes of data — but only about 3 percent of it goes into composing the 22,000 or so genes that make us what we are.
The remaining 97 percent, so-called junk DNA, looks like gibberish. It’s the dark matter of inner space. We don’t know what it is saying to or about us, but within that sea of megabytes there is plenty of room for the imagination to roam, for trademark labels and much more. The King James Bible, to pick one obvious example, only amounts to about five megabytes.