Manage Bigdata with Ambari: Part-1

Installing and managing big data stack (Hadoop ecosystem projects, Apache Spark, Kafka etc…) are time-consuming and tedious work.  It requires a lot of configuration file change, node access management in the cluster,  executes series of commands on each node in the cluster, some time you have to start service in a specific order and some specific…

Read More

Create Password-less login in Linux & Mac OS

Sometimes on Linux cluster, we need to automate work on various nodes, this requires login permission on each node for executing commands. OpenSSH is one of the favorite tools on Linux/ Unix/ Windows based system for remote login. OpenSSH is the premier connectivity tool for remote login with the SSH protocol. It encrypts all traffic…

Read More

Scala 101

Scala is a general-purpose programming language it supports both functional programming and Objects oriented programming paradigm. Scala is a strong static type system. Scala is an acronym for “Scalable Language”*.  Scala program runs on top of Java Virtual Machine. Download Scala You can download a current version of scala from https://www.scala-lang.org . Configuring Scala Path and Environment For…

Read More

Data Center Concept

Applications like Facebook, Google search, twitter, Amazon.com got millions of hit daily, these applications generate terabyte to petabyte bytes of data in a day, that needs to be processed, store and prepare a response for the user in sub seconds for better user experience. To handle these massive data, it requires specialized distributed system that can…

Read More