인공지능/머신러닝

[ML Algorithm] StratifiedKFold / StratifiedShuffleSplit 차이

RosyPark 2020. 6. 3. 13:38

 

1. StratifiedKFold : 두 배열을 섞지 만 각 행을 레이블로 유지합니다.

2. StratifiedKFOld + Shuffle : 교차 검증 전에 두 배열을 섞습니다. 따라서 각 행은 더 이상 해당 레이블에 연결되지 않습니다. 그렇기 때문에 정확도가 1에 비해 나쁩니다.

3. StratifiedShuffleSplit : 배열이 이미 2만큼 섞여서 더 이상 행과 레이블 사이에 링크가 없기 때문에 정확도는 여전히 나쁘고 2와 같습니다. 그러나 "독립형"으로 실행했을 때 정확도는 1만큼 좋았습니다. 따라서 기본적으로 1과 3은 동일합니다.

 

 

 

 

1. stackoverflow.com/questions/37635460/stratifiedkfold-vs-stratifiedshufflesplit-vs-stratifiedkfold-shuffle

 

StratifiedKFold vs StratifiedShuffleSplit vs StratifiedKFold + Shuffle

What is the difference between: StratifiedKFold, StratifiedShuffleSplit, StratifiedKFold + Shuffle? When should I use each one? When I get a better accuracy score? Why I do not get similar results? I

stackoverflow.com

2. scikit-learn.org/stable/modules/generated/sklearn.model_selection.StratifiedKFold.html

 

sklearn.model_selection.StratifiedKFold — scikit-learn 0.23.1 documentation

 

scikit-learn.org