PistonHeads » Gassing Station » The Pie & Piston » Science!

Statistics question

OP Posts Only

Author

Discussion

Dr Jekyll

Original Poster

23,820 posts

276 months

[report]

[news]

Saturday 18th January 2020

I'm looking at a book on statistics for beginners.

It refers to decision trees without actually explaining them but that's OK, I know what they are. Then it goes on to talk about 'Random Forests'. Not something I've come across but from context I can figure out roughly what they are. But after a quick introduction saying you have to decide whether use a decision tree or a random forest there is this paragraph.

Book said:

If there are M input variable amounts then m<M is going to be specified from the beginning, and it will be held as a constant. The reason that this is so important is that it means that each tree that you have is randomly picked from their own variable using M.

What on earth does this mean?

Chester35

505 posts

70 months

[report]

[news]

Sunday 19th January 2020

It means that is not a statistics book for beginners.

I statistic book for beginners should allow you to see the wood rather than the trees.

smile

At least in 90% of the time of course.

V8LM

5,399 posts

224 months

[report]

[news]

Sunday 19th January 2020

Also surprised that machine learning is covered in a beginner's book on statistics.

Flooble

5,600 posts

115 months

[report]

[news]

Sunday 19th January 2020

Probably easiest to read around it.

https://medium.com/machine-learning-researcher/ran...

https://www.stat.berkeley.edu/~breiman/RandomFores...

WatchfulEye

505 posts

143 months

[report]

[news]

Tuesday 21st January 2020

It means that in a random forest, each tree uses only some of information available.

So,if you have 5 (M) variables into a classification task (e.g. Leaf length, leaf width, stem length, stem diameter, petal number) then each individual tree is developed using fewer than 5 (m).

For example if m is 3: tree 1 (leaf length, stem length, petal count) ; tree 2 (lesf width, stem length, stem diameter) ; tree 3 (stem length, stem diameter, petal count) m; tree 4(leaf length, leaf width, stem diameter)

OP Posts Only

Gassing Station | Science! | Top of Page | What's New | My Stuff

Posting Rules

Auctions

Buy

Selling

News

Forums

My account

Statistics question

About

Buy

Sell

About

Forums

Services

Buy

Sell

Forums

Services

About