Building a Data Pipeline with Kafka, Spark Streaming and Cassandra
https://www.baeldung.com/kafka-spark-data-pipeline
스파크를 다루는 기술 책
https://thebook.io/006908/part02/ch06/01/02-01/
data engineering cookbook
https://github.com/andkret/Cookbook#introduction
data engineering roadmap
https://github.com/hasbrain/data-engineer-roadmap
How To Connect Spark to Your Own Datasource
https://www.slideshare.net/mongodb/how-to-connect-spark-to-your-own-datasource
Spark memory 관리 설명
Spark 성능 측정 방법
리눅스 상황 60초 내에 파악하기
https://b.luavis.kr/server/linux-performance-analysis
MongoDB Sharding 에 대해 알아야 할 모든 것
https://www.mongodb.com/presentations/webinar-everything-you-need-know-about-sharding?jmp=docs
Hadoop-Mongo integration
http://www.ikanow.com/how-well-does-mongodb-integrate-with-hadoop/
Hadoop 성능 측정
https://blog.cloudera.com/what-is-hadoop-metrics2/
머신러닝 기초 설명
https://github.com/PacktPublishing/Mastering-Machine-Learning-with-scikit-learn-Second-Edition
Scikit Learn User Guide
https://scikit-learn.org/stable/user_guide.html
구글에서 제공하는 머신러닝 튜토리얼
선형대수
https://darkpgmr.tistory.com/103
Python Notebook 공유하는 곳
https://datascienceschool.net/view-notebook/17608f897087478bbeac096438c716f6/
Linear Regression의 쉬운 풀이
https://brunch.co.kr/@itschloe1/9
Docker 에 대해 알아보기, 동작 설명
http://blog.drakejin.me/Docker-araboza-1/
https://tech.ssut.me/what-even-is-a-container/
Hadoop IO time 을 측정하는 방법 검색 결과
https://stackoverflow.com/questions/42164449/calculate-time-taken-by-reducers-hadoop
https://github.com/linkedin/white-elephant
https://www.quora.com/MapReduce-Whats-the-best-way-to-measure-MR-job-runtime
YCSB 워크로드
https://github.com/brianfrankcooper/YCSB/wiki/Running-a-Workload
Hadoop with MongoDB storage 질문 답변
https://stackoverflow.com/questions/52337696/hadoop-with-mongodb-storage
How does Apache Spark know about HDFS data nodes? 질문 답변
https://stackoverflow.com/questions/28481693/how-does-apache-spark-know-about-hdfs-data-nodes
'눈가락' 카테고리의 다른 글
[IT] 입력과 출력이 같은 함수 : Identity(항등) 함수 (0) | 2019.11.20 |
---|---|
[Python] bar 그래프 그리기 예제 (0) | 2019.11.14 |
[IT] Linux Terminal 에서 jar 파일 내부 class 확인하는 방법 (0) | 2019.11.06 |
[Linux] process 종료시키는 방법 (0) | 2019.10.17 |
데이터 엔지니어가 되기 위한 과정들 (0) | 2019.10.05 |