今回はまず 前回 の部分を説明した後、グループ分けについて説明する。 データの特性を考慮して、グループ毎の集計を行なうと、 今までは判らなかったデータの特徴を把握することができる。
/* Lesson 6-01 */ /* File Name = les0601.sas 11/11/04 */ data gakusei; infile 'all04b.prn' firstobs=2; input sex $ shintyou taijyuu kyoui jitaku $ kodukai carryer $ tsuuwa; proc print data=gakusei(obs=5); run; proc means data=gakusei; run; proc univariate data=gakusei plot; var shintyou taijyuu kyoui kodukai; run; proc chart data=gakusei; : ヒストグラム hbar shintyou taijyuu kyoui kodukai; : 指定した変量について計算 run; : : proc sort data=gakusei; : 並べ替え(ソート) by sex; : 性別ごとに run; : : proc means data=gakusei; : 平均の計算 by sex; : 性別ごとに run; : proc univariate data=gakusei plot; : 基礎統計量の計算 var shintyou taijyuu kyoui kodukai; : 指定した変量について計算 by sex; : 性別ごとに run; : proc chart data=gakusei; : ヒストグラム hbar shintyou taijyuu kyoui kodukai; : 指定した変量について計算 by sex; : 性別ごとに run; : proc chart data=gakusei; : ヒストグラム hbar shintyou taijyuu kyoui kodukai/group=sex; : 性別ごとに併置して run; :
SAS システム 2 17:55 Tuesday, November 9, 2004 Variable N Mean Std Dev Minimum Maximum --------------------------------------------------------------------- SHINTYOU 303 167.7584158 8.2069217 145.0000000 186.0000000 TAIJYUU 272 58.7084559 9.4277698 35.0000000 100.0000000 KYOUI 102 86.5196078 7.6827316 56.0000000 112.0000000 KODUKAI 292 49279.11 49464.64 0 300000.00 TSUUWA 95 7281.56 4734.60 200.0000000 30000.00 --------------------------------------------------------------------- SAS システム 3 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=SHINTYOU Moments N 303 Sum Wgts 303 Mean 167.7584 Sum 50830.8 Std Dev 8.206922 Variance 67.35356 Skewness -0.35873 Kurtosis -0.40208 USS 8547635 CSS 20340.78 CV 4.892107 Std Mean 0.471475 T:Mean=0 355.8159 Pr>|T| 0.0001 Num ^= 0 303 Num > 0 303 M(Sign) 151.5 Pr>=|M| 0.0001 Sgn Rank 23028 Pr>=|S| 0.0001 SAS システム 4 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=SHINTYOU Quantiles(Def=5) 100% Max 186 99% 183 75% Q3 173.8 95% 180 50% Med 169 90% 178 25% Q1 162 10% 156 0% Min 145 5% 153 1% 148 Range 41 Q3-Q1 11.8 Mode 170 SAS システム 7 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=SHINTYOU Histogram # Boxplot 187.5+* 2 | .********* 18 | .*********************** 45 | .***************************************** 81 +-----+ 167.5+****************************** 59 *--+--* .************************ 48 +-----+ .*************** 29 | .******** 15 | 147.5+*** 6 | ----+----+----+----+----+----+----+----+- * may represent up to 2 counts SAS システム 8 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=SHINTYOU Normal Probability Plot 187.5+ +++* | ******+*** | ******** | ********* 167.5+ ******++ | ******+ | ****** | +****** 147.5+**+** +----+----+----+----+----+----+----+----+----+----+ -2 -1 0 +1 +2 SAS システム 21 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=KODUKAI Moments N 292 Sum Wgts 292 Mean 49279.11 Sum 14389500 Std Dev 49464.64 Variance 2.4468E9 Skewness 1.705219 Kurtosis 4.109965 USS 1.421E12 CSS 7.12E11 CV 100.3765 Std Mean 2894.699 T:Mean=0 17.02391 Pr>|T| 0.0001 Num ^= 0 242 Num > 0 242 M(Sign) 121 Pr>=|M| 0.0001 Sgn Rank 14701.5 Pr>=|S| 0.0001 SAS システム 22 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=KODUKAI Quantiles(Def=5) 100% Max 300000 99% 200000 75% Q3 70000 95% 150000 50% Med 30000 90% 120000 25% Q1 20000 10% 0 0% Min 0 5% 0 1% 0 Range 300000 Q3-Q1 50000 Mode 0 SAS システム 25 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=KODUKAI Histogram # Boxplot 325000+* 2 * . .* 2 0 175000+***** 18 0 .******** 32 | .**************** 64 +-----+ 25000+******************************************** 174 *--+--* ----+----+----+----+----+----+----+----+---- * may represent up to 4 counts SAS システム 26 17:55 Tuesday, November 9, 2004 Univariate Procedure Variable=KODUKAI Normal Probability Plot 325000+ * | | * * 175000+ ********++++ | ******++++++ | +********+ 25000+** ************************* +----+----+----+----+----+----+----+----+----+----+ -2 -1 0 +1 +2 SAS システム 31 17:55 Tuesday, November 9, 2004 KODUKAI Cum. Cum. Midpoint Freq Freq Percent Percent | 0 |************* 67 67 22.95 22.95 30000 |********************* 104 171 35.62 58.56 60000 |*********** 54 225 18.49 77.05 90000 |******* 33 258 11.30 88.36 120000 |** 12 270 4.11 92.47 150000 |*** 16 286 5.48 97.95 180000 | 2 288 0.68 98.63 210000 | 2 290 0.68 99.32 240000 | 0 290 0.00 99.32 270000 | 0 290 0.00 99.32 300000 | 2 292 0.68 100.00 | ----+---+---+---+---+- 20 40 60 80 100 SAS システム 33 17:55 Tuesday, November 9, 2004 --------------------------------- SEX=F -------------------------------- Variable N Mean Std Dev Minimum Maximum --------------------------------------------------------------------- SHINTYOU 101 159.0267327 5.4951231 145.0000000 171.0000000 TAIJYUU 70 48.5314286 4.8016767 35.0000000 59.0000000 KYOUI 38 83.1842105 4.0527286 70.0000000 90.0000000 KODUKAI 98 49209.18 46883.49 0 300000.00 TSUUWA 44 6993.18 4654.30 200.0000000 25000.00 --------------------------------------------------------------------- SAS システム 34 17:55 Tuesday, November 9, 2004 --------------------------------- SEX=M -------------------------------- Variable N Mean Std Dev Minimum Maximum --------------------------------------------------------------------- SHINTYOU 201 172.1447761 5.3634583 156.0000000 186.0000000 TAIJYUU 201 62.2462687 7.9777628 46.0000000 100.0000000 KYOUI 64 88.5000000 8.6189161 56.0000000 112.0000000 KODUKAI 192 49187.50 50935.09 0 300000.00 TSUUWA 50 7480.96 4871.02 500.0000000 30000.00 --------------------------------------------------------------------- SAS システム 53 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=F --------------------------------- Univariate Procedure Variable=SHINTYOU Moments N 101 Sum Wgts 101 Mean 159.0267 Sum 16061.7 Std Dev 5.495123 Variance 30.19638 Skewness -0.2286 Kurtosis -0.36121 USS 2557259 CSS 3019.638 CV 3.455471 Std Mean 0.546785 T:Mean=0 290.8395 Pr>|T| 0.0001 Num ^= 0 101 Num > 0 101 M(Sign) 50.5 Pr>=|M| 0.0001 Sgn Rank 2575.5 Pr>=|S| 0.0001 SAS システム 55 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=F --------------------------------- Univariate Procedure Variable=SHINTYOU Quantiles(Def=5) 100% Max 171 99% 170 75% Q3 163 95% 167 50% Med 160 90% 166 25% Q1 156 10% 152 0% Min 145 5% 149 1% 146.7 Range 26 Q3-Q1 7 Mode 156 SAS システム 58 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=F --------------------------------- Univariate Procedure Variable=SHINTYOU Stem Leaf # Boxplot 17 001 3 | 16 555566666667777 15 | 16 00000000000000111222222222333344444 35 +-----+ 15 555666666666666777788889999 27 +--+--+ 15 012222333333444 15 | 14 578899 6 0 ----+----+----+----+----+----+----+ Multiply Stem.Leaf by 10**+1 SAS システム 59 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=F --------------------------------- Univariate Procedure Variable=SHINTYOU Normal Probability Plot 172.5+ +*+++* | *******+*+* | **********+ | *********+ | +******** 147.5+*+++*+*+** +----+----+----+----+----+----+----+----+----+----+ -2 -1 0 +1 +2 SAS システム 81 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=M --------------------------------- Univariate Procedure Variable=SHINTYOU Moments N 201 Sum Wgts 201 Mean 172.1448 Sum 34601.1 Std Dev 5.363458 Variance 28.76669 Skewness -0.06893 Kurtosis 0.036975 USS 5962152 CSS 5753.337 CV 3.115667 Std Mean 0.378309 T:Mean=0 455.0373 Pr>|T| 0.0001 Num ^= 0 201 Num > 0 201 M(Sign) 100.5 Pr>=|M| 0.0001 Sgn Rank 10150.5 Pr>=|S| 0.0001 SAS システム 83 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=M --------------------------------- Univariate Procedure Variable=SHINTYOU Quantiles(Def=5) 100% Max 186 99% 184 75% Q3 175 95% 180.5 50% Med 172 90% 179.9 25% Q1 168.6 10% 166 0% Min 156 5% 163 1% 160 Range 30 Q3-Q1 6.4 Mode 170 SAS システム 86 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=M --------------------------------- Univariate Procedure Variable=SHINTYOU Histogram # Boxplot 187.5+* 2 0 .********* 18 | .*********************** 45 +-----+ 172.5+**************************************** 79 *--+--* .********************* 42 +-----+ .******* 14 | 157.5+* 1 0 ----+----+----+----+----+----+----+----+ * may represent up to 2 counts SAS システム 87 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=M --------------------------------- Univariate Procedure Variable=SHINTYOU Normal Probability Plot 187.5+ ** | ******+***+ | *********+ 172.5+ ************ | *********++ | * ********+ 157.5+*++ +----+----+----+----+----+----+----+----+----+----+ -2 -1 0 +1 +2 SAS システム 109 17:55 Tuesday, November 9, 2004 Univariate Procedure Schematic Plots Variable=SHINTYOU 200 + | | 0 180 + | | | *--+--* | *--+--* | +-----+ 160 + *--+--* | | +-----+ 0 | 0 140 + ------------+-----------+-----------+----------- SEX F M SAS システム 110 17:55 Tuesday, November 9, 2004 Univariate Procedure Schematic Plots Variable=TAIJYUU | 100 + * | 0 | *--+--* | *--+--* 50 + *--+--* +-----+ | 0 | 0 + ------------+-----------+-----------+----------- SEX F M SAS システム 112 17:55 Tuesday, November 9, 2004 Univariate Procedure Schematic Plots Variable=KODUKAI 300000 + * * | | 200000 + * 0 | 0 | | 0 | 100000 + +-----+ | | | *--+--* +-----+ +-----+ | +-----+ *--+--* *--+--* 0 + | +-----+ ------------+-----------+-----------+----------- SEX F M SAS システム 116 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=F --------------------------------- SHINTYOU Cum. Cum. Midpoint Freq Freq Percent Percent | 146 |** 2 2 1.98 1.98 150 |******* 7 9 6.93 8.91 154 |*************** 15 24 14.85 23.76 158 |************************* 25 49 24.75 48.51 162 |***************************** 29 78 28.71 77.23 166 |******************** 20 98 19.80 97.03 170 |*** 3 101 2.97 100.00 | -----+----+----+----+----+---- 5 10 15 20 25 Frequency SAS システム 120 17:55 Tuesday, November 9, 2004 -------------------------------- SEX=M --------------------------------- SHINTYOU Cum. Cum. Midpoint Freq Freq Percent Percent | 156 |* 1 1 0.50 0.50 159 |*** 5 6 2.49 2.99 162 |**** 7 13 3.48 6.47 165 |***** 10 23 4.98 11.44 168 |***************** 34 57 16.92 28.36 171 |*************************** 53 110 26.37 54.73 174 |********************* 42 152 20.90 75.62 177 |*********** 22 174 10.95 86.57 180 |********* 18 192 8.96 95.52 183 |**** 7 199 3.48 99.00 186 |* 2 201 1.00 100.00 | -----+----+----+----+----+-- 10 20 30 40 50 Frequency SAS システム 127 17:55 Tuesday, November 9, 2004 SEX SHINTYOU Cum. Cum. Midpoint Freq Freq Percent Percent | F 146 |* 2 3 0.66 0.99 150 |*** 7 10 2.31 3.30 154 |****** 15 25 4.95 8.25 158 |********** 25 50 8.25 16.50 162 |************ 29 79 9.57 26.07 166 |******** 20 99 6.60 32.67 170 |* 3 102 0.99 33.66 174 | 0 102 0.00 33.66 178 | 0 102 0.00 33.66 182 | 0 102 0.00 33.66 186 | 0 102 0.00 33.66 | M 146 | 0 102 0.00 33.66 150 | 0 102 0.00 33.66 154 | 0 102 0.00 33.66 158 | 1 103 0.33 33.99 162 |***** 12 115 3.96 37.95 166 |********* 22 137 7.26 45.21 170 |************************ 59 196 19.47 64.69 174 |*********************** 58 254 19.14 83.83 178 |************ 29 283 9.57 93.40 182 |******* 17 300 5.61 99.01 186 |* 3 303 0.99 100.00 | ----+---+---+---+---+---+ 10 20 30 40 50 60 Frequency SAS システム 135 17:55 Tuesday, November 9, 2004 SEX KODUKAI Cum. Cum. Midpoint Freq Freq Percent Percent | F 0 |*** 16 18 5.48 6.16 30000 |******** 38 56 13.01 19.18 60000 |****** 28 84 9.59 28.77 90000 |* 7 91 2.40 31.16 120000 |* 4 95 1.37 32.53 150000 | 2 97 0.68 33.22 180000 | 1 98 0.34 33.56 210000 | 1 99 0.34 33.90 240000 | 0 99 0.00 33.90 270000 | 0 99 0.00 33.90 300000 | 1 100 0.34 34.25 | M 0 |********** 51 151 17.47 51.71 30000 |************* 65 216 22.26 73.97 60000 |***** 26 242 8.90 82.88 90000 |***** 25 267 8.56 91.44 120000 |** 8 275 2.74 94.18 150000 |*** 14 289 4.79 98.97 180000 | 1 290 0.34 99.32 210000 | 1 291 0.34 99.66 240000 | 0 291 0.00 99.66 270000 | 0 291 0.00 99.66 300000 | 1 292 0.34 100.00 | ----+---+---+- 20 40 60 Frequency
注意1: 電子メールでの場合は、添付ファイルは使わないこと。
提出用メールアドレスは「hayashi@peter.rd.dnc.ac.jp」である。
また、提出日時はメールヘッダーから判断する。私からは受領確認メールを出すので、それを受け取った段階で提出作業完了とする。
注意2: 紙で提出する場合は、事務所の受付終了時刻に注意すること。提出日は事務室の受領印で判断する。
注意3: 連絡ページ
に受領した者の学籍番号を掲載するので、確認に使ってほしい。
data mon2004; infile 'd:\home\mon_all8d.csv' dlm=',' firstobs=2 truncover;
data mon2004; infile 'd:\home\mon_all8d.txt' dlm='09'x firstobs=2 truncover;