Publications
- Yifan Yang, Shimin Chen.
"LITS: An Optimized Learned Index for Strings".
(pdf, code).
PVLDB 17(11): 3415-3427, 2024
(PVLDB'24).
- Yuzhou Cai, Shimin Chen.
"CPU and GPU Hash Joins on Skewed Data"
(pdf).
HardBD & Active workshop, co-located at ICDE'24
(HardBD & Active'24).
- Yiyuan Chen, Shimin Chen.
"Cabin: a Compressed Adaptive Binned Scan Index"
(pdf, code).
Proceedings of the ACM on Management of Data 2(1): 57:1-57:26
(SIGMOD'24).
- Zehai Yang, Shimin Chen.
"MOST: Model-Based Compression with Outlier Storage for Time Series Data"
(pdf, code).
Proceedings of the ACM on Management of Data 1(4): 250:1-250:29
(SIGMOD'24).
- Zhuoxuan Liu, Shimin Chen.
"Pea Hash: A Performant Extendible Adaptive Hashing Index"
(pdf, code).
Proceedings of the ACM on Management of Data 1(1): 108:1-108:25
(SIGMOD'23).
- Gang Liu, Leying Chen, Shimin Chen.
"Zen+: a robust NUMA-aware OLTP engine optimized for non-volatile main memory"
(pdf).
VLDB Journal 32(1): 123-148
(VLDBJ'23).
- Zichun Huang, Shimin Chen.
"Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations".
(pdf, code).
PVLDB 15(10): 2244-2256, 2022
(PVLDB'22).
- Leon Lee, Siphrey Xie, Yunus Ma, Shimin Chen.
"Index Checkpoints for Instant Recovery in In-Memory Database Systems"
(pdf).
PVLDB 15(8): 1671-1683, 2022
(PVLDB'22).
- Jiacheng Wu, Yong Zhang, Shimin Chen, Yu Chen, Jin Wang, Chunxiao Xing.
"Updatable Learned Index with Precise Positions"
(pdf).
PVLDB 14(8): 1276-1288, 2021
(PVLDB'21).
- Gang Liu, Leying Chen, Shimin Chen.
"Zen: a High-Throughput Log-Free OLTP Engine for Non-Volatile Main Memory"
(pdf, code).
PVLDB, 14(5): 835 - 848, 2021.
(PVLDB'21).
- Shaonan Ma, Kang Chen, Shimin Chen, Mengxing Liu, Jianglang Zhu, Hongbo Kang, and Yongwei Wu.
"ROART: Range-query Optimized Persistent ART"
(pdf).
19th USENIX Conference on File and Storage Technologies
(FAST'21).
- Mengsu Ding, Shimin Chen, Nantia Makrynioti, Stefan Manegold.
"Progressive Join Algorithms Considering User Preference"
(pdf,talk).
11th Conference on Innovative Data Systems Research
(CIDR'21).
- Songjie Niu, Shimin Chen.
"TransGPerf: Exploiting Transfer Learning for Modeling Distributed Graph Computation Performance"
(pdf).
Journal of Computer Science and Technology, 36(4): 778-791, 2021
(JCST'21).
- Jihang Liu, Shimin Chen, Lujun Wang.
"LB+-Trees: Optimizing Persistent Index Performance on 3DXPoint Memory"
(pdf,talk,code).
PVLDB 13(7): 1078-1090.
(PVLDB'20).
- Jihang Liu, Shimin Chen.
"Initial Experience with 3D XPoint Main Memory"
(pdf).
HardBD & Active workshop, co-located at ICDE'19
(HardBD & Active'19).
Journal version in DAPD SI on Self-Managing and Hardware-Optimized Database Systems,
currently available online.
- Mengsu Ding, Shimin Chen.
"Efficient Partitioning and Query Processing of Spatio-Temporal Graphs with Trillion Edges"
(pdf).
In proceedings of the 35th International Conference on Data Engineering, 2019
(ICDE'19).
- Zhiyi Wang, Dongyan Zhou, Shimin Chen.
"STEED: An Analytical Database System for TrEE-structured Data"
(pdf).
In PVLDB Volume 10 Issue 12, 2017 (PVLDB'17 demo).
- Zhiyi Wang, Shimin Chen.
"Exploiting Common Patterns for Tree-Structured Data"
(pdf).
In proceedings of the 2017 SIGMOD International Conference on Management of Data (SIGMOD'17).
- Shimin Chen.
"Tree-Structured Big Data Management System"
(pdf).
Communications of CCF, Volume 13, Number 8, pp.22-26, 2017
(CCCF'17).
- Mengsu Ding, Shimin Chen.
"Helius: A Lightweight Big Data Processing System"
(pdf).
Journal of Computer Application, Volume 37, Number 2, pp.305-310, 2017
(JCA'17).
- Yanfeng Zhang, Shimin Chen, Ge Yu.
"Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce"
(pdf).
IEEE Transactions on Knowledge and Data Engineering, Volume 28, Number 12, pp.3218-3230, Dec 2016
(TKDE'16).
- Dongxiao Song, Shimin Chen.
"Exploiting SIMD for Complex Numerical Predicates"
(pdf).
International Workshop on Big Data Management on Emerging Hardware, co-located at ICDE'16
(HardBD'16).
- Yanfeng Zhang, Shimin Chen, Qiang Wang, Ge Yu.
"i2MapReduce: Incremental MapReduce for Mining Evolving Big Data"
(pdf).
IEEE Transactions on Knowledge and Data Engineering, Volume 27, Number 7, pp.1906-1919, July 2015
(TKDE'15).
- Shimin Chen, Qin Jin.
"Persistent B+-Trees in Non-Volatile Main Memory"
(pdf).
VLDB 2015 (PVLDB Volume 8 Issue 7), March 2015.
(VLDB'15).
- Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki,
Phillip B. Gibbons, Radu Stoica.
"Online Updates on Data Warehouses via Judicious Use of Solid-State Storage"
(pdf).
ACM Transaction on Database Systems, Volume 40, Number 1, March 2015
(TODS'15).
- Songjie Niu, Shimin Chen.
"Optimizing CPU Cache Performance for Pregel-Like Graph Computation"
(pdf).
International Workshop on Big Data Management on Emerging Hardware, co-located at ICDE'15
(HardBD'15).
- Shimin Chen.
"Big Data Analysis and Data Velocity"
(pdf).
Journal of Computer Research and Development (in Chinese), Volume 52, 2015
(JCRD'15).
- Zhao Cao, Shimin Chen,
Dongzhe Ma, Jianhua Feng, Min Wang.
"Efficient and Flexible Index Access in MapReduce"
(pdf).
17th International Conference on Extending Database Technology
(EDBT'14).
- Zhao Cao, Shimin Chen,
Feifei Li, Min Wang, X. Sean Wang.
"LogKV: Exploiting Key-Value Stores for Event Log Processing"
(pdf).
6th biennial Conference on Innovative Data Systems Research
(CIDR'13).
- Yanfeng Zhang, Shimin Chen.
"i2MapReduce: Incremental Iterative MapReduce"
(pdf).
2nd International Workshop on Cloud Intelligence (colocated with VLDB 2013)
(Cloud-I'13).
- Peng Jiang, Huiman Hou, Lijiang Chen,
Shimin Chen,
Conglei Yao, Chengkai Li, Min Wang.
"Wiki3C: Exploiting Wikipedia for Context-aware Concept Categorization"
(pdf).
6th ACM International Conference on Web Search and Data Mining
(WSDM'13).
- Lijiang Chen, Yibing Zhao,
Shimin Chen,
Hui Fang, Chengkai Li, Min Wang.
"iPLUG: Personalized List Recommendation in Twitter"
(pdf).
14th International Conference on Web Information Systems Engineering
(WISE'13).
- Michelle L. Goodstein, Shimin Chen,
Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry.
"Chrysalis Analysis: Incorporating Synchronization Arcs in Dataflow-Analysis-based Parallel Monitoring"
(pdf).
21st International Conference on Parallel Architectures and Compilation Techniques
(PACT'12).
- Sean M. Snyder, Shimin Chen,
Panos K. Chrysanthis, Alexandros Labrinidis.
"QMD: Exploiting Flash for Energy Efficient Disk Arrays"
(pdf).
Seventh International Workshop on Data Management on New Hardware
(DaMoN'11).
- Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki,
Phillip B. Gibbons, Radu Stoica.
"MaSM: Efficient Online Updates in Data Warehouses"
(pdf).
In proceedings of the 2011 SIGMOD International Conference on Management of Data (SIGMOD'11).
- Shimin Chen, Phillip B. Gibbons,
Michael Kozuch, Todd C. Mowry.
"Log-Based Architectures:
Using Multicore to Help Software Behave Correctly"
(pdf).
ACM SIGOPS Operating Systems Review, Volume 45 Issue 1, January 2011
(OS Review'11).
- Shimin Chen, Phillip B. Gibbons, Suman Nath.
"Rethinking Database Algorithms for Phase Change Memory"
(pdf).
In proceedings of the 5th Biennial Conference on Innovative Data Systems Research
(CIDR'11).
- Manos Athanassoulis, Anastasia Ailamaki, Shimin Chen,
Phillip B. Gibbons, Radu Stoica.
"Flash in a DBMS: Where and How?"
(pdf).
IEEE Data Engineering Bulletin, Volume 33, Number 4, December 2010
(DEBull'10).
- Shimin Chen, Anastasia Ailamaki,
Manos Athanassoulis, Phillip B. Gibbons, Ryan Johnson, Ippokratis Pandis, Radu Stoica.
"TPC-E vs. TPC-C: Characterizing the New TPC-E Benchmark
via an I/O Comparison Study"
(pdf).
SIGMOD Record, Volume 39, Number 3, September 2010.
(SIGMOD Record'10).
- Shimin Chen, Phillip B. Gibbons, Suman Nath.
"PR-Join: A Non-Blocking Join Achieving Higher Early
Result Rate with Statistical Guarantees."(pdf)
In proceedings of
the 2010 SIGMOD International Conference on Management of Data (SIGMOD'10).
- Olatunji Ruwase, Shimin Chen,
Phillip B. Gibbons, Michael Kozuch, Todd C. Mowry. "Decoupled
Lifeguards: Enabling Path Optimizations for Dynamic Correctness Checking Tools."
(pdf)
In proceedings of ACM SIGPLAN 2010 Conference on Programming Language Design and Implementation
(PLDI'10).
- Michelle Goodstein, Evangelos Vlachos, Shimin Chen, Phillip Gibbons,
Michael Kozuch, Todd Mowry.
"Butterfly Analysis: Adapting
Dataflow Analysis to Dynamic Parallel Monitorin"(pdf). In proceedings
of
15th International Conference on Architectural Support for Programming
Languages and Operating Systems
(ASPLOS'10).
- Evangelos Vlachos, Michelle Goodstein, Michael Kozuch, Shimin Chen,
Babak Falsafi, Phillip B. Gibbons, Todd C. Mowry.
"ParaLog: Enabling and Accelerating
Online Parallel
Monitoring of Multithreaded Applications "(pdf). In proceedings of
15th International Conference on Architectural Support for Programming
Languages and Operating Systems
(ASPLOS'10).
- Shimin Chen.
"FlashLogging: Exploiting Flash
Devices for Synchronous Logging Performance"(pdf). In proceedings of
the 2009 SIGMOD International Conference on
Management of Data (SIGMOD'09).
- Shimin Chen,
Michael Kozuch, Theodoros Strigkos, Babak Falsafi, Phillip B. Gibbons,
Todd C. Mowry, Vijaya Ramachandran, Olatunji Ruwase, Michael Ryan,
Evangelos Vlachos. "Flexible Hardware
Acceleration for Instruction-Grain Lifeguards"(pdf
on ieee site).
IEEE Micro, Jan/Feb 2009 Special Issue: Micro's Top Picks in Computer
Architecture Conferences
(IEEE Micro
Top Picks).
- Olatunji Ruwase, Phillip B. Gibbons, Todd C. Mowry, Vijaya
Ramachandran, Shimin Chen,
Michael Kozuch, Michael Ryan. "Parallelizing
Dynamic Information Flow Tracking Lifeguards"(pdf). In proceedings of
the 20th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'08).
- Shimin Chen,
Michael Kozuch, Theodoros Strigkos, Babak Falsafi, Phillip B. Gibbons,
Todd C. Mowry, Vijaya Ramachandran, Olatunji Ruwase, Michael Ryan,
Evangelos Vlachos. "Flexible Hardware
Acceleration for Instruction-Grain Program Monitoring"(pdf). In
proceedings of the 35th International Symposium on Computer
Architecture (ISCA'08)
(accepted in IEEE Micro
Special Issue "Top Picks from Computer Architecture Conferences").
- Shimin Chen, Steven
W. Schlosser. "Map-Reduce Meets Wider Varieties of Applications." (pdf). Intel Research Pittsburgh
Tech Report, IRP-TR-08-05, May, 2008.
- Guy E. Blelloch, Rezaul A. Chowdhury, Phillip B. Gibbons,
Vijaya Ramachandran, Shimin Chen,
Michael Kozuch. "Provably Good
Multicore Cache Performance for Divide-and-Conquer Algorithms"(pdf). In
proceedings of the 2008 ACM-SIAM Symposium on Discrete Algorithms
(SODA'08).
- Shimin Chen,
Anastasia Ailamaki,
Phillip
B. Gibbons, and Todd C. Mowry. "Improving Hash Join Performance
through Prefetching"(pdf).
ACM Transactions on Database Systems, Volume 32, Number 3, August 2007 (TODS'07).
(This is
the journal version of the ICDE 2004
paper. It includes a detailed performance study on an Itanium 2
machine, detailed analyses of the prefetching techniques, and
discussion on practical issues of implementing the prefetching
techniques in production DBMSs.)
- Shimin Chen,
Phillip B. Gibbons, Michael Kozuch, Vasileios
Liaskovitis, Anastasia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor
Fix, Nikos Hardavellas, Todd C. Mowry, Chris Wilkerson. "Scheduling Threads for Constructive Cache
Sharing on CMPs" (ps, pdf). In
proceedings of the 19th ACM Symposium on Parallelism in Algorithms and
Architectures
(SPAA'07).
- Shimin
Chen, Babak Falsafi, Phillip B. Gibbons, Michael Kozuch, Todd C.
Mowry,
Radu Teodorescu, Anastasia Ailamaki, Limor Fix, Gregory R. Ganger, Bin
Lin, Steven W. Schlosser. "Log-Based
Architectures for General-Purpose Monitoring of Deployed Code" (pdf). In proceedings
of Workshop on Architectural and System Support for Improving Software
Dependability (ASID) held
with ASPLOS, 2006.
- Vasileios Liaskovitis, Shimin
Chen, Phillip B. Gibbons,
Anastasia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Michael
Kozuch, Todd C. Mowry, Chris Wilkerson. "Parallel Depth First vs. Work Stealing
Schedulers on CMP Architectures" (pdf). In proceedings of the
18th ACM Symposium on Parallelism in Algorithms and Architectures
(SPAA'06),
brief announcements track.
- Shimin Chen. "Redesigning
Database
Systems in Light of CPU Cache Prefetching"
(ps,
pdf).
Ph.D.
Thesis, Carnegie Mellon University, Computer Science Department,
December 2005.
- Shimin Chen,
Anastasia Ailamaki,
Phillip B. Gibbons, and Todd C. Mowry. "Inspector Joins"
(ps, pdf). In proceedings
of the 31st VLDB Conference, 2005 (VLDB'05).
- Shimin Chen,
Phillip B. Gibbons, and
Suman Nath. "Database-Centric Programming for Wide-Area Sensor
Systems" (ps, pdf). In proceedings of
the International Conference on Distributed Computing in Sensor
Systems, 2005 (DCOSS'05).
- Shimin Chen,
Anastasia Ailamaki,
Phillip
B. Gibbons, and Todd C. Mowry. "Improving Hash Join Performance
through Prefetching"(ps, pdf). In proceedings
of the 20th International Conference on Data Engineering, 2004
(ICDE'04).
The
full paper was published as CMU Technical Report CMU-CS-03-157,
October 2003. (Best Paper Award)
- Roger Barga, Shimin Chen,
and David
Lomet. "Improving Logging and Recovery Performance in Phoenix/App"
(pdf). In
proceedings of the 20th International Conference on Data Engineering,
2004 (ICDE'04).
- Shimin Chen,
Phillip B. Gibbons, Todd
C.
Mowry, and Gary Valentin. "Fractal Prefetching B+-Trees:
Optimizing Both Cache and Disk Performance"(ps,pdf).
In proceedings of the 2002 SIGMOD International Conference on
Management of Data (SIGMOD'02).
The full paper was published as CMU Technical Report
CMU-CS-02-115, March 2002.
- Shimin Chen,
Phillip B. Gibbons, and
Todd
C. Mowry. "Improving Index Performance through Prefetching"(ps, pdf). In proceedings of the 2001
SIGMOD International Conference on Management of Data (SIGMOD'01).
The full
paper was published as CMU Technical Report
CMU-CS-00-177, December 2000. (Runner-up for Best
Paper Award)
|
Services
- Co-Chair for HardBD & Active workshops at ICDE:
HardBD & Active'25,
HardBD & Active'24,
HardBD & Active'23,
HardBD & Active'22,
HardBD & Active'21,
HardBD & Active'20,
HardBD & Active'19,
HardBD & Active'18,
HardBD'16,
HardBD'15.
- Industrial & Application Co-Chair for EDBT'25.
- Associate Editor for PVLDB'25.
- PC senior member for EDBT'24, Paestum, Italy, March, 25-28, 2024.
- Publicity Co-Chair for ICDE'21, Chania, Crete, Greece, April 19-22, 2021.
- PC Co-Chair for BIRTE'19, co-located at VLDB'19, Los Angeles, CA, USA, August 26, 2019.
- PC Co-Chair for BIRTE'18, co-located at VLDB'18, Rio de Janeiro, Brazil, 27 August, 2018.
- PC area chair for Modern Hardware and In-Memory Database Systems in ICDE'18, Paris, France in April, 16-20, 2018.
- Associate Editor for PVLDB'17.
- PC track chair for Distributed Data Management and Analytics track in ICDCS'16, Nara, Japan, June, 2016.
- PC area chair for DB Track - Cloud Computing and Big Data Analytics in CIKM'14, Shanghai, China, November, 2014.
- Industry Co-Chair for WAIM'14, Macau, China, June, 2014.
- Co-Chair for DAMON'12, Scottsdale, AZ, USA, May, 2012.
- Organizing committee member for NSF Workshop on Sustainable Energy Efficient Data Management (SEEDM), Arlington, Virginia, May, 2011.
- Panelist on an NSF proposal review panel, 2010.
- Poster Session Chair for ASPLOS'10, Pittsburgh, PA, March, 2010.
(Call for Student Posters)
- PC member:
EUROSYS'25,
SIGMOD'25,
SIGMOD'24,
ICDE'24,
EUROSYS'23,
SOCC'22,
DEBS'22,
SIGMOD'22,
PVLDB'22,
CIDR'22,
SOCC'21,
DASFAA'21,
CIDR'21,
PVLDB'21,
DEBS'20,
ICDE'20(demo track),
DASFAA'20,
CIDR'20,
SIGMOD'19,
Eurosys'17,
NDBC'16,
CCF Big Data'16,
SIGMOD'16,
ICDE'16,
PVLDB'16,
ICPP'15,
SIGMOD'15,
DaMoN'15,
WAIM'15,
PVLDB'14,
ICDCS'14,
ICDE'14,
PVLDB'13,
SIGMOD'13,
ICDE'13,
CIDR'13,
ICDE'12,
ASBD'11,
DAMON'11,
SPAA'11,
ICDE'11,
2011 Non-Volatile Memories Workshop,
ASPLOS'11,
VLDB'10,
SIGMOD'10(demo track),
DAMON'10,
ICDE'09,
DAMON'08,
SIGMOD'08,
ICDE'08,
DAMON'07,
DAMON'06.
|