If you're like me, you may have taken these recommendations from the Hive wiki for enabling block compression in your sequence files.
Alas, we found that block compression wasn't actually happening. In looking at the Hadoop 0.20.203.0 source, the logic associated with the "io.seqfile.compressiong.type" setting is marked as deprecated.
We found it necessary to use the newer "mapred.output.compression.type" setting instead.
No comments:
Post a Comment