• Bielecki W and Błaszyński P. (2021). Parallel Tiled Code for Computing General Linear Recurrence Equations. Electronics. 10.3390/electronics10172050. 10:17. (2050).

    https://www.mdpi.com/2079-9292/10/17/2050

  • Maleki S, Agarwal U, Burtscher M and Pingali K. BiPart. Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. (161-174).

    https://doi.org/10.1145/3437801.3441611

  • Huang Y, Di S, Yu X, Li G and Cappello F. cuSZp: An Ultra-fast GPU Error-bounded Lossy Compression Framework with Optimized End-to-End Performance. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-13).

    https://doi.org/10.1145/3581784.3607048

  • Maximo A. (2021). GPU efficient 1D and 3D recursive filtering. Digital Signal Processing. 10.1016/j.dsp.2021.103076. 114. (103076). Online publication date: 1-Jul-2021.

    https://linkinghub.elsevier.com/retrieve/pii/S1051200421001159

  • Bahig H and Fathy K. (2020). An efficient parallel strategy for high-cost prefix operation. The Journal of Supercomputing. 10.1007/s11227-020-03473-x.

    http://link.springer.com/10.1007/s11227-020-03473-x

  • Xia Y, Jiang P and Agrawal G. Scaling out speculative execution of finite-state machines with parallel merge. Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. (160-172).

    https://doi.org/10.1145/3332466.3374524

  • Xia Y, Jiang P and Agrawal G. Enabling prefix sum parallelism pattern for recurrences with principled function reconstruction. Proceedings of the 28th International Conference on Compiler Construction. (17-28).

    https://doi.org/10.1145/3302516.3307354