MongoDB

How to setup & optimize MongoDB in Ubuntu in the public cloud

Businesses imply lots of data and that makes the problem of handling and managing it harder. Traditionally, the industry has been using RDBMS systems over the decades now, but with the advent of Big Data in the 21st century, NoSQL (Not only SQL) databases came into the picture for large scale unstructured and semi-structured data. In this post, I am …

Docker Ubuntu

How to install & operate Docker on Ubuntu in the public cloud

Docker is one of the most popular products in organizations these days. It makes the process of managing applications in containers very easy. Docker provides portability, performance, agility, scalability and isolation to the applications, since it uses containers, which are more portable and require less resources than virtual machines. In this tutorial, you will go through the steps required to …

RDS Licenses

CloudSigma How To Series: RDS Licenses

In this tutorial of CloudSigma How-to Series we will walk you through the Remote Desktop License feature needed for all Microsoft Windows servers licensed through the platform. We have now enabled the ability to purchase Microsoft RDS licenses via the Webapp. According to the Microsoft Services Provider License Agreement (SPLA), all users within the Windows Server operating system need to …

VPN server under Docker

Run your own VPN server under Docker with OpenVPN Access Server

In this tutorial, I will be setting up an Open VPN server under Docker with CloudSigma. With CloudSigma’s cloud locations spread all over the globe, you can create a server in any of them and access the content from there. Creating the machine First, I am creating a machine with the following resources: 20 GHz CPU 16 GB RAM 20 …

Installing Hadoop Tutorial

Installing Hadoop on a Single Node in Five Simple Steps

Welcome to our guide on installing Hadoop in five simple steps. To start with, the Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather …

Distributed operating system

Setting up the distributed operating system (DC/OS)

According to dcos.io, DC/OS is a distributed operating system based on the Apache Mesos distributed systems kernel. Having a distributed operating system allows users to enable the management of multiple machines as if they were a single computer. In addition, it automates resource management, schedules process placement, facilitates inter-process communication, and simplifies the installation and management of distributed services. Its …

GitLab repositories featured image

Host your own Git Repositories with GitLab

In this post, I am going to demonstrate the installation of GitLab. With GitLab, we can host our own repositories at a central place with the ease of the Git features. GitLab is the first single application for all stages of the DevOps lifecycle. Only GitLab enables Concurrent DevOps, unlocking organizations from the constraints of the toolchain. In addition, GitLab …

Realtime Twitter Twitter Data Ingestion using Flume

Realtime Twitter Data Ingestion using Flume

With more than 330 million active users, Twitter is one of the top platforms where people like to share their thoughts. More importantly, twitter data can be used for a variety of purposes such as research, consumer insights, demographic insights and many more. In addition, twitter data insights are especially useful for businesses as they allow for the analysis of …

cloudera tutorial featured image

Setting up a Big Data Cluster on Cloudera Tutorial

CDH is Cloudera’s 100% open source platform distribution, including Apache Hadoop and built specifically to meet enterprise demands. CDH delivers everything you need for enterprise use right out of the box. By integrating Hadoop with more than a dozen other critical open source projects, Cloudera has created a functionally advanced system that helps you perform end-to-end Big Data workflows. (Source). …