skimr icon indicating copy to clipboard operation
skimr copied to clipboard

Truncated factor levels

Open jarbet opened this issue 4 months ago • 0 comments

By default, it seems skim truncates long factor levels. Is there an option to print the entire factor level?

suppressPackageStartupMessages(library(skimr));
suppressPackageStartupMessages(library(rchallenge));

data('german', package = 'rchallenge');

# notice the factor levels are truncated in the skim output
table(german$status);
#> 
#>                        no checking account 
#>                                        274 
#>                                 ... < 0 DM 
#>                                        269 
#>                           0<= ... < 200 DM 
#>                                         63 
#> ... >= 200 DM / salary for at least 1 year 
#>                                        394
table(german$credit_history);
#> 
#>             delay in paying off in the past 
#>                                          40 
#>    critical account/other credits elsewhere 
#>                                          49 
#> no credits taken/all credits paid back duly 
#>                                         530 
#>    existing credits paid back duly till now 
#>                                          88 
#>     all credits at this bank paid back duly 
#>                                         293
table(german$purpose);
#> 
#>              others           car (new)          car (used) furniture/equipment 
#>                 234                 103                 181                 280 
#>    radio/television domestic appliances             repairs           education 
#>                  12                  22                  50                   0 
#>            vacation          retraining            business 
#>                   9                  97                  12
skim(german)
Name german
Number of rows 1000
Number of columns 21
_______________________
Column type frequency:
factor 18
numeric 3
________________________
Group variables None

Data summary

Variable type: factor

skim_variable n_missing complete_rate ordered n_unique top_counts
status 0 1 FALSE 4 …: 394, no : 274, …: 269, 0<=: 63
credit_history 0 1 FALSE 5 no : 530, all: 293, exi: 88, cri: 49
purpose 0 1 FALSE 10 fur: 280, oth: 234, car: 181, car: 103
savings 0 1 FALSE 5 unk: 603, …: 183, …: 103, 100: 63
employment_duration 0 1 FALSE 5 1 <: 339, >= : 253, 4 <: 174, < 1: 172
installment_rate 0 1 TRUE 4 < 2: 476, 25 : 231, 20 : 157, >= : 136
personal_status_sex 0 1 FALSE 4 mal: 548, fem: 310, fem: 92, mal: 50
other_debtors 0 1 FALSE 3 non: 907, gua: 52, co-: 41
present_residence 0 1 TRUE 4 >= : 413, 1 <: 308, 4 <: 149, < 1: 130
property 0 1 FALSE 4 bui: 332, unk: 282, car: 232, rea: 154
other_installment_plans 0 1 FALSE 3 non: 814, ban: 139, sto: 47
housing 0 1 FALSE 3 ren: 714, for: 179, own: 107
number_credits 0 1 TRUE 4 1: 633, 2-3: 333, 4-5: 28, >= : 6
job 0 1 FALSE 4 ski: 630, uns: 200, man: 148, une: 22
people_liable 0 1 FALSE 2 0 t: 845, 3 o: 155
telephone 0 1 FALSE 2 no: 596, yes: 404
foreign_worker 0 1 FALSE 2 no: 963, yes: 37
credit_risk 0 1 FALSE 2 goo: 700, bad: 300

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
duration 0 1 20.90 12.06 4 12.0 18.0 24.00 72 ▇▇▂▁▁
amount 0 1 3271.25 2822.75 250 1365.5 2319.5 3972.25 18424 ▇▂▁▁▁
age 0 1 35.54 11.35 19 27.0 33.0 42.00 75 ▇▆▃▁▁

Created on 2024-10-22 with reprex v2.1.1

jarbet avatar Oct 22 '24 15:10 jarbet