#

hdfs-cluster

Here are 12 public repositories matching this topic...

hrchlhck / k8s-bigdata

Apache Spark with HDFS cluster within Kubernetes

docker kubernetes big-data apache-spark hadoop hibench hdfs-cluster k8s-bigdata intel-hibench

Updated Jul 11, 2023
Python

flaviostutz / spark-scala-jupyter

Jupyter notebook server prepared for running Spark with Scala kernels on a remote Spark master

scala spark jupyter jupyter-notebook hdfs spark-sql hdfs-docker scala-spark hdfs-cluster

Updated Apr 25, 2020
Jupyter Notebook

MengmSun / hadoop-in-docker

Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.

docker hadoop docker-compose hdfs hadoop-cluster hadoop-docker hdfs-docker hdfs-cluster

Updated Apr 17, 2022
Shell

whoami-anoint / EasyHadoop

Simplified Hadoop Setup and Configuration Automation

data-science big-data hdfs ec2-instance big-data-analytics apache-hadoop big-data-projects hdfs-cluster big-data-essentials

Updated Sep 2, 2023
Shell

GirishCodeAlchemy / News-sentiment-ML-ETL-pipeline

News Sentiment Analysis using ETL pipeline

machine-learning kafka spark jupyter-notebook python3 kafka-consumer hdfs newsapi kafka-producer hdfs-dfs hdfs-client hdfs-cluster

Updated Jan 9, 2024
Jupyter Notebook

amccurry / pack

Hdfs Block Storage System

docker docker-container driver mount hdfs block-storage volume volume-plugin docker-volume-driver hdfs-cluster

Updated Jul 15, 2022
Java

dininduviduneth / reddit-explore-uu

In this project we have used comments from reddit to play around with multiple functionalities of Apache Spark, HDFS and Docker.

docker spark docker-compose jupyter-notebook hdfs-cluster

Updated Feb 7, 2024
Jupyter Notebook

ShirshaDatta / Hadoop-CheatSheet

Your go-to-cheatsheet to learn apache-Hadoop.

hadoop dfs masternode hdfs-client slave-nodes redhat-enterprise-linux hdfs-cluster multitier-architecture hadoop-cheatcheet jdk-

Updated Jan 25, 2021
Shell

xiaojie-qian / Rail-tunnel-recommendation-SQL

Modern Big Data Analysis: recommend which pair of United States airports should be connected with a high-speed passenger rail tunnel.

sql big-data hive impala cloudera s3-storage shell-script hue hdfs-cluster

Updated Jun 5, 2022
Shell

bdbao / Hadoop-VM

This project sets up a Hadoop (v3.2.3) cluster on a virtual machine (Multipass) on macOS. It includes instructions for configuring HDFS, YARN, and uploading files via command-line and web interface.

data-storage multipass apache-hadoop hdfs-cluster

Updated Oct 6, 2024
Shell

ommore1523 / BigdataClusterOnDocker

Hadoop Spark cluster setup for version 3X

docker spark hadoop cluster pyspark hdfs-cluster

Updated Mar 15, 2024
Dockerfile

keramiozsoy / apache-spark-yarn-mode-aws-101

An example of installation Apache Spark on AWS

python aws scala spark apache-spark yarn hive hadoop jupyter jupyter-notebook hdfs spark-shell apache-hadoop hdfs-cluster

Updated Apr 17, 2024

Improve this page

Add a description, image, and links to the hdfs-cluster topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hdfs-cluster topic, visit your repo's landing page and select "manage topics."