Category Archives: Technology

Remove Duplicate Text Lines from single or two files in Linux

Remove Duplicate Text Lines from single or two files in Linux

How to remove duplicate text lines or get only unique text lines in linux? It is very simply achievable by “Sort” and “uniq” command.

Continue reading

Category: Technology

Hive Installation

There are several ways you can install Hadoop and Hive. An easy way to install a complete Hadoop system, including Hive, is to download a preconfigured virtual machine (VM) that runs in VMWare1 or VirtualBox2.

1. http://vmware.com.

2. https://www.virtualbox.org/

Continue reading

Category: Technology | Tags: , , ,

Introduction of Hive

Hive, Originally developed by Facebook, It provides us data warehousing facilities on top of an existing Hadoop cluster.
Along with that it provides an SQL like interface which makes your work easier, in case you are coming from an SQL background. You can create tables in Hive and store data there. Along with that you can even map your existing HBase tables to Hive and operate on them.

Continue reading

MySQL,Cassandra and MongoDB

In point of view all of them MySQL,Cassandra and MongoDB are good. There are several ways to achieve best out of them with all of the technologies. It is more a question of how you use them. Your ideal solution may use a combination of these, with some consideration for usage patterns.

Continue reading

Category: Technology | Tags: , ,