For the unaware, Data Lineage is basically (really in very short words) a study of the data from its source to its eventual target, similar to what we'd do for our generation tree, we analyze the generation analysis of the data we are dealing with.
Starting from the source of the data, it travels through different subsystems, sometimes going through transformations, and thus possibly changing shape too...
Informatica had a very interesting blog post around this (already in 2007), which can turn out to be fairly informative.