CIS 241, Dr. Ladd
spacebar
to go to the next slide, esc
/menu to navigate
It groups or reduces the number of features (dimensions) in your data into a more manageable number of columns.
SVD is a linear algebra method for refactoring a matrix (dataset) into a smaller matrix by finding multiple lines of best fit.
It’s closely related to Factor Analysis.
n_components
: the number of components the model will producewhiten
: an adjustment of results, good for when you will use components in another modelsvd_solver
: the exact SVD method you’ll use, usually can be left as ‘auto’random_state
.fit_transform()
to fit the model and get components in the same function.The resulting data will have the same number of rows but a new number of columns.
Put this data into a new dataframe to do something with it!
.components_
and .explained_variance_
attributes can help you understand your results.penguins
dataset..components_
to assess results.Good luck! 🐧🐧🐧