问题描述:
MelbCV.csv是墨尔本人行道监控数据的一个子集如下图,依据统计数据回答各个监控点一天的哪些时段是行人通过的高峰?所有监控点是否有一致的人流高峰时段?若有,是哪个(些)时段?哪个路口人流量最大?哪个路口人流量最小?依据什么得出的结论?人流量的最大值出现在哪个监控点的哪天的哪个时段?
下面是数据集:
下面是数据集的说明
colname含义类型唯一值个数众数Date日期character30所有值出现频次相同Year年份double1所有值出现频次相同,2012Month月份double1所有值出现频次相同,9Mdate日double30所有值出现频次相同Weekday_End表示周末或工作日,10代表工作日,20代表周末double2所有值出现频次相同,10,20Day表示一周里的第几天,周日是第一天double71,7Hour表示一天中的第几个小时double24所有值出现频次相同Town_Hall-West在Town_Hall-West的人行道通过人数double61335Collins Place-South在Collins Place-South的人行道通过人数double48510Australia on Collins在Australia on Collins的人行道通过人数double49816Bourke Street Mall-South在Bourke Street Mall-South的人行道通过人数double55212Bourke Street Mall-North在Bourke Street Mall-North的人行道通过人数double5484Melbourne Central在Melbourne Central的人行道通过人数double59832Flagstaff Station在Flagstaff Station的人行道通过人数double39210State Library在State Library的人行道通过人数double30824
下面是相关的部分代码:
alterMelbCV
alter[is.na(alter)]mean(MelbCV$State Library
,na.rm=TRUE)
t_peakgroup_by(alter,Hour)%>%summarise(count=n(),mean1=mean(Town_Hall-West
),
mean2=mean(Collins Place-South
),
mean3=mean(Australia on Collins
),
mean4=mean(Bourke Street Mall-South
),
mean5=mean(Bourke Street Mall-North
),
mean6=mean(Melbourne Central
),
mean7=mean(Flagstaff Station
),
mean8=mean(State Library
))
ggplot(t_peak)+ylab("人数")+
geom_line(aes(Hour,mean1,color="tomato"))+
geom_line(aes(Hour,mean2,color="orange"))+
geom_line(aes(Hour,mean3,color="mediumblue"))+
geom_line(aes(Hour,mean4,color="purple"))+
geom_line(aes(Hour,mean5,color="gold"))+
geom_line(aes(Hour,mean6,color="hotpink"))+
geom_line(aes(Hour,mean7,color="green"))+
geom_line(aes(Hour,mean8,color="black"))+
scale_x_continuous(breaks=seq(0:22))
做出的图和我设定的颜色没有一毛钱关系
原因分析:
解决方案:
转载自知乎@陆壹肆
根据上面的文章,将上面的代码做了一些修改
alterMelbCV
alter[is.na(alter)]mean(MelbCV$State Library
,na.rm=TRUE)
t_peakgroup_by(alter,Hour)%>%summarise(mean1=mean(Town_Hall-West
),
mean2=mean(Collins Place-South
),
mean3=mean(Australia on Collins
),
mean4=mean(Bourke Street Mall-South
),
mean5=mean(Bourke Street Mall-North
),
mean6=mean(Melbourne Central
),
mean7=mean(Flagstaff Station
),
mean8=mean(State Library
))
col_fillcolnames(MelbCV[,8:15])
max_numapply(t_peak,2,FUN = function(x) {
max(x)
})
max_hourapply(t_peak,2,FUN = function(x) {
which.max(x)-1
})
library(reshape2)
long_peak melt(t_peak, id="Hour")
ggplot(data=long_peak,aes(x=Hour, y=value, group=variable, color=variable))+
geom_line()
Original: https://blog.csdn.net/liquiior/article/details/122562494
Author: liquiior
Title: 数据分析(R语言)——ggplot绘制多线图结果图颜色不匹配
原创文章受到原创版权保护。转载请注明出处:https://www.johngo689.com/698373/
转载文章受原作者版权保护。转载请注明原作者出处!