CIS 241, Dr. Ladd
spacebar to go to the next slide, esc/menu to navigate
It groups or reduces the number of features (dimensions) in your data into a more manageable number of columns.
SVD is a linear algebra method for refactoring a matrix (dataset) into a smaller matrix by finding multiple lines of best fit.
It’s closely related to Factor Analysis.
n_components: the number of components the model will producewhiten: an adjustment of results, good for when you will use components in another modelsvd_solver: the exact SVD method you’ll use, usually can be left as ‘auto’random_state.fit_transform() to fit the model and get components in the same function.The resulting data will have the same number of rows but a new number of columns.
Put this data into a new dataframe to do something with it!
.components_ and .explained_variance_ attributes can help you understand your results.penguins dataset..components_ to assess results.Good luck! 🐧🐧🐧