https://spark.apache.org/docs/latest/index.html
- pandas API on Spark for pandas workloads
- Downloads are pre-packaged for a handful of popular Hadoop versions
- Spark runs on both Windows and UNIX-like systems, and it should run on any platform that runs a supported version of Java
- it is necessary for applications to use the same version of Scala that Spark was compiled for
For example, when using Scala 2.13, use Spark compiled for 2.13
- use this class in the top-level Spark directory.
- with this approach, each appliction is given a maximum amount of resources it can use
and holds onto them for its whole duration.
- Resource allocation can be configured as follows, based on the cluster type.
- At a high level, Spark should relinquish executors when they are no longer used and acquire when they are needed.
- We need a set of heuristics to determine when to remove and request executors.
- By default, Spark's scheduler runs jobs in FIFO fashion.
- If the jobs at the head of the queue don't need to use the whole cluster,
later jobs can start torun right away, but if the jobs at the head of the queue are large,
then ddlater jobs may be delayed significantly.
- Under fair sharing, Spark assigns tasks between jobs in a "round robin" fashion,
so that all jobs get a roughly equal share of cluster resources.
- This feature is disabled by default and available on all coarse-grained cluster managers.
- Without any intervention, newly submitted jobs go into a default pool
- This is done as follows
- This setting is per-thread to make it easy to have a thread run multiple jobs on behalf of the same user.
- If you would like to clear the pool that a thread is associated with, simply call this.
- jobs run in FIFO order.
- each user's queries will run in order instead of later queries taking resources from that user's earlier ones.
- At a high level, every Spark application consists of a driver program that runs the user's main function and executes various parallel opperations on a cluster.
- ...the cluster that can be operated on in parallel.
- This guide shows each of these features in each of Spark's supported languages.
- it's easiest to follow along with if you launch Spark's interactive shell.
-
- It is not only Value but also Pointer, both of these together make up the node.
- We do it by just having the next value of A node be the B node.
- the same is true of the C node.
- if you look at how we're going to have to traverse this, we are going to have to start at head.
- that's what we are going to do down here with this print statement.
- the syntac is a little bit different than if you are going to use dictionaries.
-
- Our managers deal with all kinds of clients every day. So I can say that we maintain the highest level of service.
- In my work I follow the best practices to maintain clean and easy to understand python codes.
- In my previous project I carried out the responsibilities of both the Project Manager and the Team Leader.
carry out : to do something, to perform
both A and B
- As a QA specialist I worked with a test environment where I tested many aspects of the platform
to ensure that it works as desired.
- A cloud architect oversees application architecture
and deploys it in cloud environments like public cloud, private cloud and hybrid cloud.
to oversee : to watch over and control something to make sure that the work is good or satisfactory, to supervise
- I took a course where I learned how to design and write programs that are easy to maintain.
to design : to create, draw, or construct something
- I will set up all the necessary equipment in my home office to work remotely on this project
- I'm an IT Technician, so I install and configure different software on all computers in the office.
to install : to put a new program or piece of software into a computer
to configure : to chagne setting of software on a computer
- As a Jr Software Engineer, I assist and participate in the research, design, development and testing software and tools.
to assist : to help someone or something
- I am a web designer, so I know hot to provide the best UX for your website visitors.
- Project managers usually estimate new projects by analogy, using previous projecs and past experience.
to estimate : to give a general idea of the cost of work or the time you need to do the work
analogy : a comparison of two things based on their being alike in some way
- Sometimes I need to google my questions, for ex "how to execute the code inside of function in JS"
- Working on my project I improved my time management and organizational skills.
- press the F2 key on your keyboard
- The screen resolution is 1366x768
- I prefer work with desktop
- Workstation PCs have multiple processor cores.
- Some tablets have a long battery life
- The volume on my speakers won't ture up.
- My printer broke down, so I printed out these documentations at work.
- 'ram' is not countable, so only possible to say 'ram is' or 'ram was', not 'rams' or 'rams are'
- with a cable : wired mouse, wired connection(Ethernet)
- without a cable : wireless mouse, wireless connection(WiFi)
- ISP stands for Internet Service Provider.
- so many folers on my desktop
- start or shut down a computer.
- turn on or turn off a computer
- to crash / to freeze up : when a computer suddenly stops working
- to look up a word or address : to find something.
we can use 'nslookup' command in terminal to query to DNS server.
- It will take about two hours to key in all this data.
to key in : to enter info into computer
- a shortcut key : 단축키.
Use Ctrl + L shortcut to see the last saved version.
- 'perform' is used a lot more than my thought. for ex, the server performs instructions written in code.
- use only the numbers in given array.
- I assigned the number 33 to age variable.
- fraction : 분수
- numerator : 분자
- denominator : 분모
ex, 1/3 : one thrid, 2/3 : two thirds, 1/2 : a half(second), 1/4 : a quarter, 3/4 : three quarters
- decimal : 소수
- decimal point : 소수점
- floating point : 부동소수점
ex, 1.23 : one point two three, 15.1 : fifteen point one
'double' also has a point but 'float' and 'double' are different each.
let's look up how they consist differently.
- I created an array of strings.
- To debug is to investigate the program and fix bugs.
- Comment is a text written around code that is ignored by the computer.
It is used for writing extra info about your code to help you undertand it later.
so we can say, 'leave comments in your code.'
- 'Comment out' is to turn a piece of code into a comment with the help of special characters.
like, //, #, -- ... etc
you can comment out some lines to see how it works without them.
- Constant is a variable that never changes its value.
for ex, val a = 1, final int a= 1
we can say "In Java, a constant is assigned using the final keyword"
"the PI constant has the value of 3.14"
- If you try to divide a number by zero, your program will crash.
A program crashes when it stops running because of an error.
- An 'executable' is a program which is ready to be run.
Short for executable file, executable program
A common filename extension .exe means that it is an executable file.
it sounds '엨즤큐터블'
- To declare(선언) in programming means to say that something exists
usually a variable, a function, or a class.
I've only declared a function, but I haven't written it yet.
- To implement(구현) means to write and complete something in code
for example, to implement a function or a class
I declared a function and implemented it. It works well!
- To instantiate(인스턴스화) means to create an object from a class.
I instantiated another object of the Student class.
it sounds '인스탠시에이트'
- A loop is a piece of code that runs itself many times.
It can also be used as a verb - to loop or to iterate
I used a "for" loop to run this code for every value in the array.
I iterate throught every element in the list.
- He read some data values from another source over the internet.
- Syntax is the grammatical rules of a programming language.
Syntax determines if code is written correctly or not.
- find any typing mistakes if you got a syntax error.