-
Notifications
You must be signed in to change notification settings - Fork 38
大数据黑客马拉松wiki
欢迎来到实时大数据黑客松wiki
我们特别鼓励参赛选手可以使用自带数据参赛,如果没有自己的数据,我们很荣幸从合作伙伴得到如下数据,供黑客松使用:
**来源:**聚合数据
**数据源URL:**http://odata.juhe.cn/train/JeffersonServlet.cn/$metadata?key=e0b47155b2c3d9544934a82d58d55955
**余票查询URL:**http://odata.juhe.cn/train/yp?from=苏州北&to=北京南&date=2014-12-17&tt=G&key=e0b47155b2c3d9544934a82d58d55955
***参数描述:***key为客户申请的key;from为出发站(注:必须为准确的站名,如苏州北,不能输入只输入苏州);to为到达站(注:同from);date为日期,tt为车类型(注:默认查询所有)
**Key: ** [非公开] - 只提供给参赛选手. 需要这个数据的参赛选手请发email至[zhshang@microsoft.com]索取
**时效:**实时
余票查询返回样本:
{"d":{"results":[{"__metadata": {"id":"http://127.0.0.1:8080/train/JeffersonServlet.cn/Trains(387)","uri":"http://127.0.0.1:8080/train/JeffersonSe rvlet.cn/Trains(387)","type":"odata2_jpa2.Train"},"Arrive_time":"12:42","Day_difference":"0","End_station_name":"北京南","From_station_name":"苏州北","Gr_num":"--","Head":"G","Id":387,"Lishi":"05:08","Qt_num":"--","RunDate":"2014-12-17","Rw_num":"--","Rz_num":"--","Start_station_name":"上海虹桥","Start_time":"07:34","Swz_num":"21","To_station_name":"北京南","Train_class_name":"","Train_no":"G104","Tz_num":"--","Wz_num":"--","Yw_num":"--","Yz_num":"--","Ze_num":"532","Zy_num":"129"},{"__metadata":{"id":"http://127.0.0.1:8080/train/JeffersonServlet.cn/Trains(388)","uri":"http://127.0.0.1:8080/train/JeffersonServlet.cn/Trains(388)","type":"odata2_jpa2.Train"},"Arrive_time":"13:33","Day_difference":"0","End_station_name":"北京南","From_station_name":"苏州北","Gr_num":"--","Head":"G","Id":388,"Lishi":"05:37","Qt_num":"--","RunDate":"2014-12-17","Rw_num":"--","Rz_num":"--","Start_station_name":"上海虹桥","Start_time":"07:56","Swz_num":"9","To_station_name":"北京南","Train_class_name":"","Train_no":"G108","Tz_num":"--","Wz_num":"--","Yw_num":"--","Yz_num":"--","Ze_num":"634","Zy_num":"5"}]}}
示范场景:火车票数据和目的地的结合,或者和地图数据的结合,乃至动态推荐转车路线
###2. 全国飞机航班状态实时数据
**来源:**聚合数据
**数据源URL:**http://odata.juhe.cn/plane/JeffersonServlet.cn/$metadata?key=5594f94fe6c794861fd3204ea2eff915
**城市列表查询URL:**http://odata.juhe.cn/plane/flight/cities?key=5594f94fe6c794861fd3204ea2eff915
**机场简介查询URL:**http://odata.juhe.cn/plane/flight/airports?key=5594f94fe6c794861fd3204ea2eff915
**航班查询URL:**http://odata.juhe.cn/plane/flight/airNumber?name=TV9802&date=2014-12-17&key=5594f94fe6c794861fd3204ea2eff915
**航线查询URL:**http://odata.juhe.cn/plane/flight/airLine?start=%E6%97%A0%E9%94%A1&end=%E5%8C%97%E4%BA%AC&date=2014-12-17&key=5594f94fe6c794861fd3204ea2eff915
***参数描述:***key为客户向聚合申请的key,name为航班号,start:出发城市,end:到达城市,date:日期
**Key: ** [非公开] - 只提供给参赛选手,如有需要的参赛选手请发送email至[zhshang@microsoft.com]索取
**时效:**实时
航班查询样本:
{"d":{"results":[{"__metadata":{"id":"http://127.0.0.1:8080/plane/JeffersonServlet.cn/Planes(4)","uri":"http://127.0.0.1:8080/plane/JeffersonServlet.cn/Planes(4)","type":"odata2_jpa2.Plane"},"Aactual":"","Aexpected":"","AirAge":"","AirLineTel":"4008089188","AirVoyage":"","Airmodel":"","AllJingTing":"","ArrAirport":"贡嘎机场","ArrCode":"LXA","ArrCode4":"ZULS","ArrDelay":"","ArrPoint":"29.296075181823,90.9138842361918","ArrTel":"089196222","ArrTemperature":"","ArrTerminal":"","ArrTime":"2014-12-17 21:25","ArrTrafficState":"","ArrWeather":"","BoardingGate":"","Dactual":"","DepAirport":"虹桥机场","DepCode":"SHA","DepCode4":"ZSSS","DepDelay":"","DepPoint":"31.1968058067475,121.338294093862","DepTel":"02196990","DepTemperature":"","DepTerminal":"T2","DepTime":"2014-12-17 14:40","DepTrafficState":"","DepWeather":"","Dexpected":"","FlyTime":"405","InfoContent":"","LeaveTime":"405","OnFlight":"0,0,0,0","OnTimeRate":"90.00%","OnTimeRateHistory":"","VZdeptime":"","ZDInfoContent":"","ZJTable":"","BagClaim":"","Company":"TV","End":"拉萨","FlyDate":"2014-12-17","Food":"1","Id":4,"NowPoint":"0,0","Start":"上海虹桥","Status":"计划","Title":"TV9802"}]}}
航线查询样本:
{"d":{"results":[{"__metadata":{"id":"http://127.0.0.1:8080/plane/JeffersonServlet.cn/City_citys(187)","uri":"http://127.0.0.1:8080/plane/JeffersonServlet.cn/City_citys(187)","type":"odata2_jpa2.City_city"},"Aactual":"","Aexpected":"","AirAge":"","AirLineTel":"400888666","AirVoyage":"","Airmodel":"","AllJingTing":"","ArrAirport":"首都机场","ArrCode":"PEK","ArrCode4":"ZBAA","ArrDelay":"","ArrPoint":"40.0768050970123,116.588355358","ArrTel":"01096158","ArrTemperature":"","ArrTerminal":"","ArrTime":"2014-12-17 06:25","ArrTrafficState":"","ArrWeather":"","BoardingGate":"","Dactual":"","DepAirport":"硕放机场","DepCode":"WUX","DepCode4":"ZSWX","DepDelay":"","DepPoint":"31.4958533394482,120.426617980897","DepTel":"051085322000","DepTemperature":"","DepTerminal":"","DepTime":"2014-12-17 04:30","DepTrafficState":"","DepWeather":"","Dexpected":"","FlyTime":"115","InfoContent":"","LeaveTime":"115","OnFlight":"0,0,0,0","OnTimeRate":"","OnTimeRateHistory":"","VZdeptime":"","ZDInfoContent":"","ZJTable":"","BagClaim":"","Company":"东海航空","End":"北京","FlyDate":"2014-12-17","Food":"0","Id":187,"NowPoint":"0,0","Start":"无锡","Status":"计划","Title":"DZ6218"}]}}
示范场景:飞机和火车信息结合,提供实时优化路线推荐,或者全国实时航班可视化
###3. 天猫店铺交易记录公开数据
**来源:**聚合数据
URL: trade_20141110.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141110.json.bz2
2014/12/4 18:33:26 326.36 MB
trade_20141111.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141111.json.bz2
2014/12/4 18:37:20 4.39 GB
trade_20141112.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141112.json.bz2
2014/12/4 18:34:19 443 MB
trade_20141113.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141113.json.bz2
2014/12/4 18:33:31 380.78 MB
trade_20141114.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141114.json.bz2
2014/12/4 18:33:24 348.18 MB
trade_20141115.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141115.json.bz2
2014/12/4 18:33:34 340.33 MB
trade_20141116.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141116.json.bz2
2014/12/4 18:32:42 358.39 MB
trade_20141117.json.bz2 https://hackathon.blob.core.chinacloudapi.cn/alibabadata/trade_20141117.json.bz2
2014/12/4 18:32:40 395.52 MB
**时效:**历史数据 11/10-11/17
样本: 【非公开】- 只提供给参赛选手
示范场景:类似阿里实时监控大屏;峰值实时预警;实时趋势可视化。。。
###4. 某城市出租汽车GPS历史信息 (仅供测试)
来源: 政府机构
**URL:**http://wuhanedz.chinacloudapp.cn/dataset/gps-shi-li
**数据接口:**http://wuhanedz.chinacloudapp.cn/datastore/odata3.0/d8d4fef8-4738-403c-bc00-763e543445d0
**时效:**历史数据带时间戳
样本:
[id,startdate,lon,lat,color] : {[1088,'2013-10.12',115,28,'white'].......}
示范场景:汽车实时位置,密度可视化,监控
###5. 武汉市政府招投标和采购数据
**来源:**武汉市政府
**URL:**http://www.wedz.gov.cn/publish/zkjjkfq/zwbd/zfcg/ptp_GroupList.html
**parsed data:https://hackathon.blob.core.chinacloudapi.cn/alibabadata/zbgg.tar.gz 324.05 KB
https://hackathon.blob.core.chinacloudapi.cn/alibabadata/cggg.tar.gz 558.34 KB
**时效:**历史数据
样本:[文本] - 需要parsing
示范场景:实时招投标和采购聚类分析,应用于商业情报或者股票分析,广告等
###6. 武汉环卫局空气质量定时发布数据
**来源:**武汉市环境保护局
**URL:**http://www.whepb.gov.cn/airInfoView.jspx
**parsed data: 由于数据源网站有自动的反爬虫机制,所以很难拿到全的数据
**时效:**历史数据 1/1/2014 - 11/25/2014
样本:
【日期, 监测点位, 二氧化硫, 二氧化氮, 可吸入颗粒物, 一氧化碳, O₃(8h), O₃(1h), 细颗粒物, 空气质量指数, 首要污染物, 空气质量指数级别, 指数类别 】: {【2014年11月25日, 城 区, 9, 59, 56, 31, 10, 10, 108, 108, PM2.5, 三级, 轻度污染 】}
示范场景:实时空气污染可视化,预警,趋势分析