Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3459637.3481906acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

'Could You Describe the Reason for the Transfer?': A Reinforcement Learning Based Voice-Enabled Bot Protecting Customers from Financial Frauds

Published: 30 October 2021 Publication History

Abstract

With the booming of the Internet finance and e-payment business, telecom and online fraud has become a serious problem which grows rapidly. In China, 351 billion RMB (approximately 0.3% of China's GDP) was lost in 2018 due to telecommunication and online fraud, influencing tens of millions of individual customers. Anti-fraud algorithms have been widely adopted by major Internet finance companies to detect and block transactions induced by scam. However, due to limited contextual information, most systems would probably mistakenly block the normal transactions, leading to poor user experience. On the other hand, if the transactions induced by scam are detected yet not fully explained to the users, the users will continue to pay, suffering from direct financial losses.
To address these problems, we design a voice-enabled bot that interacts with the customers who are involved with potential telecommunication and online frauds decided by the back-end system. The bot seeks additional information from the customers through natural conversations to confirm whether the customers are scammed and identify the actual fraud types. The details about the frauds are then provided to convince the customers that they are on the edge of being scammed. Our bot adopts offline reinforcement learning (RL) to learn dialogue policies from real-world human-human chat logs. During the conversations, our bot also identifies fraud types every turn based on the dialogue state.
The bot proposed outperforms baseline dialogue strategies by 2.8% in terms of task success rate, and 5% in terms of dialogue accuracy in offline evaluations. Furthermore, in the 8 months of real-world deployment, our bot lowers the dissatisfaction rate by 25% and increases the fraud prevention rate by 135% relatively, indicating a significant improvement in user experience as well as anti-fraud effectiveness. More importantly, we help prevent millions of users from being deceived, and avoid trillions of financial losses.

Supplementary Material

MP4 File (CIKM21-afp0248.mp4)
This is a short video presentation of our CIKM 2021 paper. In this talk, we present the reinforcement learning based voice-enabled bot which protects customers from financial frauds, share our experience about dialogue system and application in the anti-fraud domain. We first introduce the motivation, then describe the system framework and method, finally, we present the interesting experiments and online deployment in our real-world scenarios.

References

[1]
Lalit Bahl, Peter Brown, Peter De Souza, and Robert Mercer. 1986. Maximum mutual information estimation of hidden Markov model parameters for speech recognition. In ICASSP'86. IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 11. IEEE, 49--52. https://doi.org/10.1109/ICASSP.1986.1169179
[2]
Michael Bain and Claude Sammut. 1999. A Framework for Behavioural Cloning. In Machine Intelligence 15, Intelligent Agents [St. Catherine's College, Oxford, July 1995]. Oxford University, GBR, 103--129. https://dl.acm.org/doi/10.5555/647636.733043
[3]
CAICT. 2019. Mobile Electronic Finance and E-commerce Anti-fraud White Paper. http://stock.tianyancha.com/ResearchReport/eastmoney/42db4eea17567b7601771f50744d48bb.pdf. Accessed January 29, 2021.
[4]
Shaosheng Cao, XinXing Yang, Cen Chen, Jun Zhou, Xiaolong Li, and Yuan Qi. 2019. TitAnt: Online Real-Time Transaction Fraud Detection in Ant Financial. Proceedings of the VLDB Endowment, Vol. 12, 12 (Aug. 2019), 2082--2093. https://doi.org/10.14778/3352063.3352126
[5]
Rafael San Miguel Carrasco and Miguel-Ángel Sicilia-Urbán. 2020. Evaluation of Deep Neural Networks for Reduction of Credit Card Fraud Alerts. IEEE Access, Vol. 8 (2020), 186421--186432. https://doi.org/10.1109/ACCESS.2020.3026222
[6]
Hongshen Chen, Xiaorui Liu, Dawei Yin, and Jiliang Tang. 2017. A Survey on Dialogue Systems: Recent Advances and New Frontiers. SIGKDD Explor. Newsl., Vol. 19, 2 (Nov. 2017), 25--35. https://doi.org/10.1145/3166054.3166058
[7]
Yu Chen, Lingfei Wu, and Mohammed J. Zaki. 2019. Bidirectional Attentive Memory Networks for Question Answering over Knowledge Bases. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 2913--2923. https://doi.org/10.18653/v1/N19-1299
[8]
Shreyansh Daftry, J. Andrew Bagnell, and Martial Hebert. 2016. Learning Transferable Policies for Monocular Reactive MAV Control. In International Symposium on Experimental Robotics, ISER 2016, Tokyo, Japan, October 3-6, 2016 (Springer Proceedings in Advanced Robotics, Vol. 1), Dana Kulic, Yoshihiko Nakamura, Oussama Khatib, and Gentiane Venture (Eds.). Springer, 3--11. https://doi.org/10.1007/978-3-319-50115-4_1
[9]
Yinpei Dai, Huihua Yu, Yixuan Jiang, Chengguang Tang, Yongbin Li, and Jian Sun. 2020. A Survey on Dialog Management: Recent Advances and Challenges. CoRR, Vol. abs/2005.02233 (2020). arxiv: 2005.02233 https://arxiv.org/abs/2005.02233
[10]
Thomas Davenport, Abhijit Guha, Dhruv Grewal, and Timna Bressgott. 2020. How artificial intelligence will change the future of marketing. Journal of the Academy of Marketing Science, Vol. 48, 1 (2020), 24--42. https://link.springer.com/article/10.1007/s11747-019-00696-0
[11]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186. https://doi.org/10.18653/v1/N19-1423
[12]
Bhuwan Dhingra, Lihong Li, Xiujun Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, and Li Deng. 2017. Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Vancouver, Canada, 484--495. https://doi.org/10.18653/v1/P17--1045
[13]
Zhifu Gao, Shiliang Zhang, Ming Lei, and Ian McLoughlin. 2020. SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition. In Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25--29 October 2020, Helen Meng, Bo Xu, and Thomas Fang Zheng (Eds.). ISCA, 6--10. https://doi.org/10.21437/Interspeech.2020-2471
[14]
M. Ga?ić and S. Young. 2014. Gaussian Processes for POMDP-Based Dialogue Manager Optimization. IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 22, 1 (2014), 28--40. https://doi.org/10.1109/TASL.2013.2282190
[15]
Tianxing He, Jingzhao Zhang, Zhiming Zhou, and James R. Glass. 2019. Quantifying Exposure Bias for Neural Language Generation. CoRR, Vol. abs/1905.10617 (2019). arxiv: 1905.10617 http://arxiv.org/abs/1905.10617
[16]
Yuncheng Hua, Yuan-Fang Li, Gholamreza Haffari, Guilin Qi, and Tongtong Wu. 2020. Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement Learning. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 5827--5837. https://doi.org/10.18653/v1/2020.emnlp-main.469
[17]
Sanjeev Jha, Montserrat Guillen, and J. Christopher Westland. 2012. Employing Transaction Aggregation Strategy to Detect Credit Card Fraud. Expert Syst. Appl., Vol. 39, 16 (Nov. 2012), 12650--12657. https://doi.org/10.1016/j.eswa.2012.05.018
[18]
Michael Laskey, Jonathan Lee, Roy Fox, Anca Dragan, and Ken Goldberg. 2017. Dart: Noise injection for robust imitation learning. In Conference on robot learning. PMLR, 143--156.
[19]
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A Diversity-Promoting Objective Function for Neural Conversation Models. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, San Diego, California, 110--119. https://doi.org/10.18653/v1/N16--1014
[20]
Xiujun Li, Yun-Nung Chen, Lihong Li, Jianfeng Gao, and Asli Celikyilmaz. 2017. End-to-End Task-Completion Neural Dialogue Systems. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Asian Federation of Natural Language Processing, Taipei, Taiwan, 733--743. https://www.aclweb.org/anthology/I17--1074
[21]
Michael F. McTear. 1998. Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit. In The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998. ISCA. http://www.isca-speech.org/archive/icslp_1998/i98_0545.html
[22]
Antons Mislevics, Janis Grundspenkis, and Raita Rollande. 2018. A Systematic Approach to Implementing Chatbots in Organizations - RTU Leo Showcase. In Joint Proceedings of the BIR 2018 Short Papers, Workshops and Doctoral Consortium co-located with 17th International Conference Perspectives in Business Informatics Research (BIR 2018), Stockholm, Sweden, September 24-26, 2018 (CEUR Workshop Proceedings, Vol. 2218), Jelena Zdravkovic, Janis Grabis, Selmin Nurcan, and Janis Stirna (Eds.). CEUR-WS.org, 356--365. http://ceur-ws.org/Vol-2218/paper36.pdf
[23]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing Atari With Deep Reinforcement Learning. In NIPS Deep Learning Workshop. https://arxiv.org/abs/1312.5602
[24]
Andrew Y. Ng and Stuart J. Russell. 2000. Algorithms for Inverse Reinforcement Learning. In Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29 - July 2, 2000, Pat Langley (Ed.). Morgan Kaufmann, 663--670. https://dl.acm.org/doi/10.5555/645529.657801
[25]
PBOC. 2020. The overall operation of the payment system in 2019. http://www.gov.cn/xinwen/2020-03/17/5492272/files/62dddcc8e29849e89a65fb001f2b3445.pdf. Accessed January 29, 2021.
[26]
Baolin Peng, Xiujun Li, Jianfeng Gao, Jingjing Liu, and Kam-Fai Wong. 2018. Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, 2182--2192. https://doi.org/10.18653/v1/P18-1203
[27]
Baolin Peng, Chenguang Zhu, Chunyuan Li, Xiujun Li, Jinchao Li, Michael Zeng, and Jianfeng Gao. 2020. Few-shot Natural Language Generation for Task-Oriented Dialog. In Findings of the Association for Computational Linguistics: EMNLP 2020. Association for Computational Linguistics, Online, 172--182. https://doi.org/10.18653/v1/2020.findings-emnlp.17
[28]
Susie Xi Rao, Shuai Zhang, Zhichao Han, Zitao Zhang, Wei Min, Zhiyao Chen, Yinan Shan, Yang Zhao, and Ce Zhang. 2020. xFraud: Explainable Fraud Transaction Detection on Heterogeneous Graphs. CoRR, Vol. abs/2011.12193 (2020). arxiv: 2011.12193 https://arxiv.org/abs/2011.12193
[29]
Stephane Ross and Drew Bagnell. 2010. Efficient Reductions for Imitation Learning. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 9), Yee Whye Teh and Mike Titterington (Eds.). JMLR Workshop and Conference Proceedings, Chia Laguna Resort, Sardinia, Italy, 661--668. http://proceedings.mlr.press/v9/ross10a.html
[30]
Sté phane Ross, Geoffrey J. Gordon, and Drew Bagnell. 2011. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, April 11-13, 2011 (JMLR Proceedings, Vol. 15), Geoffrey J. Gordon, David B. Dunson, and Miroslav Dud'i k (Eds.). JMLR.org, 627--635. http://proceedings.mlr.press/v15/ross11a/ross11a.pdf
[31]
Stuart Russell. 1998. Learning Agents for Uncertain Environments (Extended Abstract). In Proceedings of the Eleventh Annual Conference on Computational Learning Theory (Madison, Wisconsin, USA) (COLT' 98). Association for Computing Machinery, New York, NY, USA, 101--103. https://doi.org/10.1145/279943.279964
[32]
Iulian Vlad Serban, Alessandro Sordoni, Yoshua Bengio, Aaron C. Courville, and Joelle Pineau. 2016. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona, USA, Dale Schuurmans and Michael P. Wellman (Eds.). AAAI Press, 3776--3784. http://www.aaai.org/ocs/index.php/AAAI/AAAI16/paper/view/11957
[33]
Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural Responding Machine for Short-Text Conversation. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, Beijing, China, 1577--1586. https://doi.org/10.3115/v1/P15--1152
[34]
Yuanlong Shao, Stephan Gouws, Denny Britz, Anna Goldie, Brian Strope, and Ray Kurzweil. 2017. Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Copenhagen, Denmark, 2210--2219. https://doi.org/10.18653/v1/D17-1235
[35]
Weiyan Shi, Yu Li, Saurav Sahay, and Zhou Yu. 2020. Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration. CoRR, Vol. abs/2012.15375 (2020). arxiv: 2012.15375 https://arxiv.org/abs/2012.15375
[36]
Pei-Hao Su, Paweł Budzianowski, Stefan Ultes, Milica Gavsić, and Steve Young. 2017. Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management. In Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue. Association for Computational Linguistics, Saarbrücken, Germany, 147--157. https://doi.org/10.18653/v1/W17-5518
[37]
Pei-Hao Su, Milica Gavsić, Nikola Mrkvsić, Lina M. Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, and Steve Young. 2016. On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Berlin, Germany, 2431--2441. https://doi.org/10.18653/v1/P16-1230
[38]
Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. 2013. A Dataset for Research on Short-Text Conversations. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Seattle, Washington, USA, 935--945. https://www.aclweb.org/anthology/D13-1096
[39]
Pengfei Wang, Yu Fan, Long Xia, Wayne Xin Zhao, ShaoZhang Niu, and Jimmy Huang. 2020 a. KERL: A Knowledge-Guided Reinforcement Learning Model for Sequential Recommendation. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020, Virtual Event, China, July 25-30, 2020, Jimmy Huang, Yi Chang, Xueqi Cheng, Jaap Kamps, Vanessa Murdock, Ji-Rong Wen, and Yiqun Liu (Eds.). ACM, 209--218. https://doi.org/10.1145/3397271.3401134
[40]
Zihao Wang, Jia Liu, Hengbin Cui, Chunxiang Jin, Minghui Yang, Yafang Wang, Xiaolong Li, and Renxin Mao. 2020 b. Two-stage Behavior Cloning for Spoken Dialogue System in Debt Collection. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020, Christian Bessiere (Ed.). ijcai.org, 4633--4639. https://doi.org/10.24963/ijcai.2020/639
[41]
Tsung-Hsien Wen, David Vandyke, Nikola Mrkvs ić, Milica Gavs ić, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young. 2017. A Network-based End-to-End Trainable Task-oriented Dialogue System. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. Association for Computational Linguistics, Valencia, Spain, 438--449. https://www.aclweb.org/anthology/E17--1042
[42]
Haiqin Weng, Zhao Li, Shouling Ji, Chen Chu, Haifeng Lu, Tianyu Du, and Qinming He. 2018. Online E-Commerce Fraud: A Large-Scale Detection and Analysis. In 34th IEEE International Conference on Data Engineering, ICDE 2018, Paris, France, April 16-19, 2018. IEEE Computer Society, 1435--1440. https://doi.org/10.1109/ICDE.2018.00162
[43]
Jason D. Williams, Kavosh Asadi, and Geoffrey Zweig. 2017. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Vancouver, Canada, 665--677. https://doi.org/10.18653/v1/P17-1062
[44]
Jason D. Williams and Steve Young. 2007. Partially observable Markov decision processes for spoken dialog systems. Computer Speech & Language, Vol. 21, 2 (2007), 393--422. https://doi.org/10.1016/j.csl.2006.06.008
[45]
Zhao Yan, Nan Duan, Peng Chen, Ming Zhou, Jianshe Zhou, and Zhoujun Li. 2017. Building Task-Oriented Dialogue Systems for Online Shopping. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4-9, 2017, San Francisco, California, USA, Satinder P. Singh and Shaul Markovitch (Eds.). AAAI Press, 4618--4626. http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14261
[46]
Sanghyun Yi and Kyomin Jung. 2017. A chatbot by combining finite state machine, information retrieval, and bot-initiative strategy. Proc. Alexa Prize (2017). http://alexaprize.s3.amazonaws.com/2017/technical-article/chattychat.pdf
[47]
Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, and Lei Xie. 2020. Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. In Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25--29 October 2020, Helen Meng, Bo Xu, and Thomas Fang Zheng (Eds.). ISCA, 2142--2146. https://doi.org/10.21437/Interspeech.2020-1972
[48]
Zheng Zhang, Minlie Huang, Zhongzhou Zhao, Feng Ji, Haiqing Chen, and Xiaoyan Zhu. 2019. Memory-Augmented Dialogue Management for Task-Oriented Dialogue Systems. ACM Trans. Inf. Syst, Vol. 37, 3, Article 34 (July 2019), bibinfonumpages30 pages. https://doi.org/10.1145/3317612
[49]
Tiancheng Zhao and Maxine Eskenazi. 2016. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning. In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue. Association for Computational Linguistics, Los Angeles, 1--10. https://doi.org/10.18653/v1/W16-3601
[50]
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, and Hua Wu. 2018. Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, 1118--1127. https://doi.org/10.18653/v1/P18-1103

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
October 2021
4966 pages
ISBN:9781450384469
DOI:10.1145/3459637
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. dialogue policy
  2. dialogue risk detection
  3. dialogue system
  4. outbound bot
  5. reinforcement learning

Qualifiers

  • Research-article

Conference

CIKM '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 303
    Total Downloads
  • Downloads (Last 12 months)46
  • Downloads (Last 6 weeks)4
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media