Neo4j is an open-source graph database, implemented in Java.The developers describe Neo4j as “embedded, disk-based, fully transactional Java persistence engine […]
Penulis: jamur
Pig UDF
Pig provides extensive support for user defined functions (UDFs) as a way to specify custom processing. Pig UDFs can currently […]
Hive Architecture
Command line interface: It’s the default and the most common way of accessing hive. Hiveserver : Runs hive as a […]
Pig Overview
Hive Vs Pig Feature Hive Pig Language SQL-like PigLatin Schemas/Types Yes (explicit) Yes (implicit) Partitions Yes No Server Optional (Thrift) […]
Hive Complex Data Types with Examples
There are three complex types in hive, arrays: It is an ordered collection of elements.The elements in the array must […]
Hive Internal & External Table
A Hive table is a logical concept that’s physically comprised of a number of files in HDFS. Tables can either […]
Hive Services
Cli —The command line interface to Hive (the shell). This is the default service. Hiveserver –Runs Hive as a server […]
Hive Shell is run on two modes
The shell is the primary way that we will interact with Hive, by issuing commands in HiveQL. HiveQL is Hive’s […]
Aggregate Functions in Hive
The following are built-in aggregate functions are supported in Hive: count(*), count(expr), count(DISTINCT expr[, expr_.]) count(*) – Returns the total […]
Hive Built-In Functions
Functions in Hive are categorized as below. Numeric and Mathematical Functions: These functions mainly used to perform mathematical calculations. Date […]