HDST-GNN: Heterogeneous Dynamic Spatiotemporal Graph Neural Networks for Multi-Object Tracking in UAV Aerial Imagery
Phillip Jiang
We present HDST-GNN, a heterogeneous dynamic spatiotemporal graph neural network for multi-object tracking (MOT) in UAV aerial imagery. The model represents each frame as a heterogeneous graph with three node types — new detections, confirmed tracklets, and lost tracklets — and builds edges adaptively based on estimated camera altitude. An occlusion-gated temporal aggregation mechanism prevents occluded objects from corrupting neighboring trajectory representations. HDST-GNN achieves 94.51% MOTA and 97.24% IDF1 on standard UAV MOT benchmarks, reducing identity switches by 81% over the SORT baseline.
RelGT-AC: A Relational Graph Transformer for Autocomplete Tasks in Relational Databases
Phillip Jiang
We introduce RelGT-AC, a relational graph transformer designed for autocomplete prediction tasks in relational databases. The model constructs a heterogeneous graph over database tables and rows, encodes text-heavy features via a pretrained language model backbone, and applies graph transformer attention across relational edges. RelGT-AC achieves up to +10 AUROC over GraphSAGE baselines on standard relational database ML benchmarks, demonstrating that graph transformer architectures generalize effectively to structured tabular domains.