Training this week is starting to go by quickly. Today we’ll be covering these topics:
- Hive programming
- ngrams
- HCatalog
Even in a limited testing environment it is easy to see the benefits of Hive. We also looked at a comparison of Hive versus SQL. For a couple of the labs we used ngrams to search email data. Towards the end of the session we discussed HCatalog, which is the central schema repository.
As I mentioned earlier, this week is going by quickly and I know we are just scratching the surface of some of these topics. This class is the same as almost all IT training, there will be a lot of work to do on my own to get the most out of this.