publications
publications by categories in reversed chronological order.
2024
- GraphScope Flex: LEGO-like Graph Computing StackSIGMOD 2024 (to appear) / arXiv preprint arXiv:2312.12107, 2024
- RAGraph: A Region-Aware Framework for Geo-Distributed Graph ProcessingProceedings of the VLDB Endowment, 2024
- XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory StoreProceedings of the VLDB Endowment, 2024
- A Graph-Native Query Optimization FrameworkarXiv preprint arXiv:2401.17786, 2024
- Ingress: an automated incremental graph processing systemThe VLDB Journal, 2024
2023
- LONGNN: Spectral GNNs with Learnable Orthonormal BasisarXiv preprint arXiv:2303.13750, 2023
- Layph: Making Change Propagation Constraint in Incremental Graph Processing by Layering GrapharXiv preprint arXiv:2304.07458, 2023
- FLASH: A Framework for Programming Distributed Graph Processing AlgorithmsIn 2023 IEEE 39th International Conference on Data Engineering (ICDE) , 2023
- Bridging the Gap between Relational {OLTP} and Graph-based {OLAP}In 2023 USENIX Annual Technical Conference (USENIX ATC 23) , 2023
- {GLogS}: Interactive Graph Pattern Matching Query At Large ScaleIn 2023 USENIX Annual Technical Conference (USENIX ATC 23) , 2023
- Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN TrainingarXiv preprint arXiv:2305.16588, 2023
- Efficient Multi-GPU Graph Processing with Remote Work StealingIn 2023 IEEE 39th International Conference on Data Engineering (ICDE) , 2023
- Vineyard: Optimizing Data Sharing in Data-Intensive AnalyticsProceedings of the ACM on Management of Data / SIGMOD 2023, 2023
- The Linked Data Benchmark Council (LDBC): Driving competition and collaboration in the graph data management spacearXiv preprint arXiv:2307.04350, 2023
- GraphAr: An Efficient Storage Scheme for Graph Data in Data LakesarXiv preprint arXiv:2312.09577, 2023
- Unicron: Economizing Self-Healing LLM Training at ScalearXiv preprint arXiv:2401.00134, 2023
2022
- Linking entities across relations and graphsIn 2022 IEEE 38th International Conference on Data Engineering (ICDE) , 2022
- GNNLab: a factored system for sample-based GNN training over GPUsIn Proceedings of the Seventeenth European Conference on Computer Systems , 2022
- Banyan: a scoped dataflow engine for graph query servicearXiv preprint arXiv:2202.12530, 2022
- DMCS: Density modularity based community searchIn Proceedings of the 2022 International Conference on Management of Data , 2022
- ABC: attributed bipartite co-clusteringProceedings of the VLDB Endowment, 2022
2021
- FlexGraph: a flexible and efficient distributed framework for GNN trainingIn Proceedings of the Sixteenth European Conference on Computer Systems , 2021
- Automating incremental graph processing with flexible memoizationProceedings of the VLDB Endowment, 2021
- Incrementalizing graph algorithmsIn Proceedings of the 2021 International Conference on Management of Data , 2021
- GraphScope: a one-stop large graph processing systemProceedings of the VLDB Endowment, 2021
- GraphScope: a unified engine for big graph processingProceedings of the VLDB Endowment, 2021
2020
- Adaptive asynchronous parallelization of graph algorithmsACM Transactions on Database Systems (TODS), 2020
- Application driven graph partitioningIn Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data , 2020
2018
- Parallelizing sequential graph computationsACM Transactions on Database Systems (TODS), 2018
- From Think Parallel to Think SequentialACM SIGMOD Record, 2018
2017
- GRAPE: Parallelizing sequential graph computationsProceedings of the VLDB Endowment, 2017
- GRAPE: Conducting Parallel Graph Computations without Developing Parallel Algorithms.IEEE Data Eng. Bull., 2017
2014
- Bounded conjunctive queriesProceedings of the VLDB Endowment, 2014
- Conflict resolution with data currency and consistencyJournal of Data and Information Quality (JDIQ), 2014
- The IUPHAR/BPS Guide to PHARMACOLOGY: an expert-driven knowledgebase of drug targets and their ligandsNucleic acids research, 2014
2013
- Inferring data currency and consistency for conflict resolutionIn 2013 IEEE 29th International Conference on Data Engineering (ICDE) , 2013
- Data quality problems beyond consistency and deduplicationIn Search of Elegance in the Theory and Practice of Computation: Essays Dedicated to Peter Buneman, 2013
- Determining the relative accuracy of attributesIn Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data , 2013
2012
- Incremental detection of inconsistencies in distributed dataIEEE Transactions on Knowledge and Data Engineering, 2012
- Towards certain fixes with editing rules and master dataThe VLDB journal, 2012
- Botzone: A game playing system for artificial intelligence educationIn Proceedings of the International Conference on Frontiers in Education: Computer Science and Computer Engineering (FECS) , 2012
2011
- Interaction between record matching and data repairingIn Proc. SIGMOD , 2011
- CerFix: A system for cleaning data with certain fixesProceedings of the VLDB Endowment, 2011
2010
- Towards certain fixes with editing rules and master dataProceedings of the VLDB Endowment, 2010