I have data of students from several schools. I want to show a histogram of the percentage of all students that passed the test in each school, using R.
My data looks like this (id,school,passed/failed):
432342 school1 passed
454233 school2 failed
543245 school1 failed
(The point is that I am only interested in the percent of students that passed, obviously those that didn't passed have failed. I want to have one column for each school that shows the percent of the students in that school that passed)
there are many ways to do that. one is:
df<-data.frame(ID=sample(100), school=factor(sample(3,100,TRUE),labels=c("School1","School2","School3")), result=factor(sample(2,100,TRUE),labels=c("passed","failed"))) p<-aggregate(df$result=="passed"~school, mean, data=df) barplot(p[,2]*100,names.arg=p[,1])