我在一个名为labels的表中有用户的交通信息,其中包含开始和结束时间以及使用的出行方式类型:
CREATE TABLE labels
(
user_id integer NOT NULL,
session_id int,
start_timestamp timestamp with time zone NOT NULL,
end_timestamp timestamp with time zone NOT NULL,
travelmode text,
PRIMARY KEY (user_id, start_timestamp, end_timestamp)
)
INSERT INTO labels (user_id,session_id,start_timestamp,end_timestamp,travelmode)
VALUES (11,0,'2007-06-26 11:32:29+01','2007-06-26 11:40:29+01','bus'),
(11,0,'2008-03-28 14:52:54+00','2008-03-28 15:59:59+00','train'),
(11,0,'2008-03-28 16:00:00+00','2008-03-28 22:02:00+00','train'),
(11,0,'2008-03-29 01:27:50+00','2008-03-29 15:59:59+00','train'),
(11,0,'2008-03-29 16:00:00+00','2008-03-30 15:59:59+01','train'),
(11,0,'2008-03-30 16:00:00+01','2008-03-31 03:13:11+01','train'),
(11,0,'2008-03-31 04:17:59+01','2008-03-31 15:31:06+01','train'),
(11,0,'2008-03-31 16:00:08+01','2008-03-31 16:09:01+01','taxi'),
(11,0,'2008-03-31 17:26:04+01','2008-04-01 00:35:26+01','train')在另一个表trajectories中,是每个用户的GPS采样(原始GPS,间隔约1-5秒采样):
CREATE TABLE trajectories
(
user_id int,
session_id int NOT NULL,
"timestamp" timestamp with time zone NOT NULL,
lat double precision NOT NULL,
lon double precision NOT NULL,
alt double precision,
PRIMARY KEY (session_id, "timestamp")
)
INSERT INTO trajectories (user_id,session_id,timestamp,lat,lon,alt)
VALUES (11,1002008,'2008-03-30 16:00:39+01',41.147205,95.457762,-777),
(11,1002008,'2008-03-30 16:01:38+01',41.153458,95.444897,-777),
(11,1002008,'2008-03-30 16:02:37+01',41.154867,95.429467,-777),
(11,1002008,'2008-03-30 16:03:36+01',41.154075,95.413863,-777),
(11,1002008,'2008-03-30 16:04:35+01',41.152223,95.398515,-777),
(11,1002008,'2008-03-31 02:52:11+01',43.697033,87.57619,-777),
(11,1002008,'2008-03-31 02:51:12+01',43.69425,87.579275,-777),
(11,1002008,'2008-03-31 02:50:13+01',43.689312,87.587815,-777),
(11,1002008,'2008-03-31 02:43:56+01',43.656445,87.634753,-777),
(11,1002008,'2008-03-31 02:42:56+01',43.649275,87.638028,-777),
(11,1002008,'2008-03-31 02:42:04+01',43.64454,87.63572,-777)因为我只对latitude 41.00 - 42.00和longitude 87.5 - 95.30中的出行模式(每种出行模式的计数)的统计信息感兴趣,所以我必须与包含lat lon信息的trajectories表进行连接,如果timestamp字段与该用户的labels表的start/end_timestamp一致,则匹配该字段。
我该怎么做呢?我添加了这个DB<>fiddle。
发布于 2020-06-24 18:59:34
这是您所描述的查询:
select l.travelmode, count(*)
from trajectories t join
labels l
on t.user_id = l.user_id and
t.timestamp between l.start_timestamp and l.end_timestamp
where t.lat between 41.00 and 42.00 and
t.lon between 87.5 and 95.30
group by l.travelmode;但是,对于您的示例数据,这不会返回任何行,因为时间戳在不同的年份。
https://stackoverflow.com/questions/62552092
复制相似问题