research-article

Dressing as a Whole: Outfit Compatibility Learning Based on Node-wise Graph Neural Networks

Authors:

Liang WangAuthors Info & Claims

WWW '19: The World Wide Web Conference

Pages 307 - 317

https://doi.org/10.1145/3308558.3313444

Published: 13 May 2019 Publication History

Abstract

With the rapid development of fashion market, the customers' demands of customers for fashion recommendation are rising. In this paper, we aim to investigate a practical problem of fashion recommendation by answering the question “which item should we select to match with the given fashion items and form a compatible outfit”. The key to this problem is to estimate the outfit compatibility. Previous works which focus on the compatibility of two items or represent an outfit as a sequence fail to make full use of the complex relations among items in an outfit. To remedy this, we propose to represent an outfit as a graph. In particular, we construct a Fashion Graph, where each node represents a category and each edge represents interaction between two categories. Accordingly, each outfit can be represented as a subgraph by putting items into their corresponding category nodes. To infer the outfit compatibility from such a graph, we propose Node-wise Graph Neural Networks (NGNN) which can better model node interactions and learn better node representations. In NGNN, the node interaction on each edge is different, which is determined by parameters correlated to the two connected nodes. An attention mechanism is utilized to calculate the outfit compatibility score with learned node representations. NGNN can not only be used to model outfit compatibility from visual or textual modality but also from multiple modalities. We conduct experiments on two tasks: (1) Fill-in-the-blank: suggesting an item that matches with existing components of outfit; (2) Compatibility prediction: predicting the compatibility scores of given outfits. Experimental results demonstrate the great superiority of our proposed method over others.

References

[1]

Sean Bell and Kavita Bala. 2015. Learning visual similarity for product design with convolutional neural networks. ACM Transactions on Graphics (TOG)34, 4 (2015), 98.

Digital Library

[2]

Long Chen and Yuhang He. 2018. Dress Fashionably: Learn Fashion Collocation With Deep Mixed-Category Metric Learning. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence. 2103-2110.

[3]

Qiang Cui, Shu Wu, Qiang Liu, Wen Zhong, and Liang Wang. 2018. MV-RNN: A Multi-View Recurrent Neural Network for Sequential Recommendation. IEEE Transactions on Knowledge and Data Engineering (2018).

[4]

Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2014. Decaf: A deep convolutional activation feature for generic visual recognition. In International conference on machine learning. 647-655.

Digital Library

[5]

David K Duvenaud, Dougal Maclaurin, Jorge Iparraguirre, Rafael Bombarell, Timothy Hirzel, Alán Aspuru-Guzik, and Ryan P Adams. 2015. Convolutional networks on graphs for learning molecular fingerprints. In Advances in neural information processing systems. 2224-2232.

Digital Library

[6]

Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. 1999. Learning to forget: Continual prediction with LSTM. (1999).

Digital Library

[7]

Marco Gori, Gabriele Monfardini, and Franco Scarselli. 2005. A new model for learning in graph domains. In Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005., Vol. 2. IEEE, 729-734.

[8]

Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 855-864.

Digital Library

[9]

Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry S Davis. 2017. Learning fashion compatibility with bidirectional lstms. In Proceedings of the 25th ACM international conference on Multimedia. ACM, 1078-1086.

Digital Library

[10]

Ruining He, Chunbin Lin, Jianguo Wang, and Julian McAuley. 2016. Sherlock: sparse hierarchical embeddings for visually-aware one-class collaborative filtering. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. AAAI Press, 3740-3746.

Digital Library

[11]

Ruining He, Charles Packer, and Julian McAuley. 2016. Learning compatibility across categories for heterogeneous item recommendation. In 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 937-942.

[12]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation9, 8 (1997), 1735-1780.

Digital Library

[13]

Yang Hu, Xi Yi, and Larry S Davis. 2015. Collaborative fashion recommendation: A functional tensor factorization approach. In Proceedings of the 23rd ACM international conference on Multimedia. ACM, 129-138.

Digital Library

[14]

Thomas N Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. arXiv preprint arXiv:1609.02907(2016).

[15]

Hanbit Lee, Jinseok Seol, and Sang-goo Lee. 2017. Style2Vec: Representation Learning for Fashion Items from Style Sets. arXiv preprint arXiv:1708.04014(2017).

[16]

Ruiyu Li, Makarand Tapaswi, Renjie Liao, Jiaya Jia, Raquel Urtasun, and Sanja Fidler. 2017. Situation recognition with graph neural networks. In Proceedings of the IEEE International Conference on Computer Vision. 4173-4182.

[17]

Yuncheng Li, Liangliang Cao, Jiang Zhu, and Jiebo Luo. 2017. Mining fashion outfit composition using an end-to-end deep learning approach on set data. IEEE Transactions on Multimedia19, 8 (2017), 1946-1955.

Digital Library

[18]

Yujia Li, Daniel Tarlow, Marc Brockschmidt, and Richard Zemel. 2015. Gated graph sequence neural networks. arXiv preprint arXiv:1511.05493(2015).

[19]

Zhongyang Li, Xiao Ding, and Ting Liu. 2018. Constructing Narrative Event Evolutionary Graph for Script Event Prediction. arXiv preprint arXiv:1805.05081(2018).

Digital Library

[20]

Qiang Liu, Shu Wu, and Liang Wang. 2017. Deepstyle: Learning user preferences for visual recommendation. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 841-844.

Digital Library

[21]

Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2016. Predicting the next location: a recurrent model with spatial and temporal contexts. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. AAAI Press, 194-200.

Digital Library

[22]

Kenneth Marino, Ruslan Salakhutdinov, and Abhinav Gupta. 2017. The More You Know: Using Knowledge Graphs for Image Classification. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 20-28.

[23]

Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton Van Den Hengel. 2015. Image-based recommendations on styles and substitutes. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 43-52.

Digital Library

[24]

Tomáš Mikolov, Martin Karafiát, Lukáš Burget, Jan Cernock?, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In Eleventh annual conference of the international speech communication association.

[25]

Tomáš Mikolov, Stefan Kombrink, Lukáš Burget, Jan Cernock?, and Sanjeev Khudanpur. 2011. Extensions of recurrent neural network language model. In 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 5528-5531.

[26]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111-3119.

Digital Library

[27]

Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 701-710.

Digital Library

[28]

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, 452-461.

Digital Library

[29]

Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. 2009. The graph neural network model. IEEE Transactions on Neural Networks20, 1 (2009), 61-80.

Digital Library

[30]

Ali Sharif Razavian, Hossein Azizpour, Josephine Sullivan, and Stefan Carlsson. 2014. CNN features off-the-shelf: an astounding baseline for recognition. In CVPR workshops.

Digital Library

[31]

Yong-Siang Shih, Kai-Yueh Chang, Hsuan-Tien Lin, and Min Sun. 2018. Compatibility family learning for item recommendation and generation. In Thirty-Second AAAI Conference on Artificial Intelligence. 2403-2410.

[32]

Xuemeng Song, Fuli Feng, Jinhuan Liu, Zekun Li, Liqiang Nie, and Jun Ma. 2017. Neurostylist: Neural compatibility modeling for clothing matching. In Proceedings of the 25th ACM international conference on Multimedia. ACM, 753-761.

Digital Library

[33]

Martin Sundermeyer, Ralf Schlüter, and Hermann Ney. 2012. LSTM neural networks for language modeling. In Thirteenth annual conference of the international speech communication association.

[34]

Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2818-2826.

[35]

Zhixing Tan, Mingxuan Wang, Jun Xie, Yidong Chen, and Xiaodong Shi. 2017. Deep Semantic Role Labeling with Self-Attention. arXiv preprint arXiv:1712.01586(2017).

[36]

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th international conference on world wide web. International World Wide Web Conferences Steering Committee, 1067-1077.

Digital Library

[37]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998-6008.

Digital Library

[38]

Andreas Veit, Balazs Kovacs, Sean Bell, Julian McAuley, Kavita Bala, and Serge Belongie. 2015. Learning visual clothing style with heterogeneous dyadic co-occurrences. In Proceedings of the IEEE International Conference on Computer Vision. 4642-4650.

Digital Library

[39]

Shu Wu, Yuyuan Tang, Yanqiao Zhu, Xing Xie, and Tieniu Tan. 2018. Session-based Recommendation with Graph Neural Networks. In Thirty-Third AAAI Conference on Artificial Intelligence.

Cited By

Sun KZhao ZLi MHuang G(2025)Multi-order attributes information fusion via hypergraph matching for popular fashion compatibility analysisExpert Systems with Applications10.1016/j.eswa.2024.125758263(125758)Online publication date: Mar-2025
https://doi.org/10.1016/j.eswa.2024.125758
Cui KLiu SFeng WDeng XGao LCheng MLu HYang L(2024)Correlation-aware Cross-modal Attention Network for Fashion Compatibility Modeling in UGC SystemsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3698772Online publication date: 5-Oct-2024
https://doi.org/10.1145/3698772
Selwon KSzymański J(2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
https://dl.acm.org/doi/10.1145/3664614
Show More Cited By

Recommendations

Hierarchical Fashion Graph Network for Personalized Outfit Recommendation
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Fashion outfit recommendation has attracted increasing attentions from online shopping services and fashion communities.Distinct from other scenarios (e.g., social networking or content sharing) which recommend a single item (e.g., a friend or picture) ...
Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network
MM '19: Proceedings of the 27th ACM International Conference on Multimedia

Existing works about fashion outfit compatibility focus on predicting the overall compatibility of a set of fashion items with their information from different modalities. However, there are few works explore how to explain the prediction, which limits ...
OutfitNet: Fashion Outfit Recommendation with Attention-Based Multiple Instance Learning
WWW '20: Proceedings of The Web Conference 2020

Recommending fashion outfits to users presents several challenges. First of all, an outfit consists of multiple fashion items, and each user emphasizes different parts of an outfit when considering whether they like it or not. Secondly, a user’s liking ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '19: The World Wide Web Conference

May 2019

3620 pages

ISBN:9781450366748

DOI:10.1145/3308558

Editors:
Ling Liu
Georgia Tech, USA
,
Ryen White
Microsoft Research, USA

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

IW3C2: International World Wide Web Conference Committee

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '19

WWW '19: The Web Conference

May 13 - 17, 2019

CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

88
Total Citations
View Citations
1,178
Total Downloads

Downloads (Last 12 months)99
Downloads (Last 6 weeks)16

Reflects downloads up to 22 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Sun KZhao ZLi MHuang G(2025)Multi-order attributes information fusion via hypergraph matching for popular fashion compatibility analysisExpert Systems with Applications10.1016/j.eswa.2024.125758263(125758)Online publication date: Mar-2025
https://doi.org/10.1016/j.eswa.2024.125758
Cui KLiu SFeng WDeng XGao LCheng MLu HYang L(2024)Correlation-aware Cross-modal Attention Network for Fashion Compatibility Modeling in UGC SystemsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3698772Online publication date: 5-Oct-2024
https://doi.org/10.1145/3698772
Selwon KSzymański J(2024)A Review of Explainable Fashion Compatibility Modeling MethodsACM Computing Surveys10.1145/366461456:11(1-29)Online publication date: 28-Jun-2024
https://dl.acm.org/doi/10.1145/3664614
Jang JHwang EPark S(2024)Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00788(8051-8060)Online publication date: 3-Jan-2024
https://doi.org/10.1109/WACV57701.2024.00788
Pang KZou XWong W(2024)Learning Visual Body-shape-Aware Embeddings for Fashion Compatibility2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00787(8041-8050)Online publication date: 3-Jan-2024
https://doi.org/10.1109/WACV57701.2024.00787
Zhou DZhang HYang KLiu LYan HXu XZhang ZYan S(2024)Learning to Synthesize Compatible Fashion Items Using Semantic Alignment and Collocation Classification: An Outfit Generation FrameworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.3202842(1-15)Online publication date: 2024
https://doi.org/10.1109/TNNLS.2022.3202842
Cui ZLi ZWu SZhang XLiu QWang LAi M(2024)DyGCN: Efficient Dynamic Graph Embedding With Graph Convolutional NetworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.3185527(1-12)Online publication date: 2024
https://doi.org/10.1109/TNNLS.2022.3185527
Dong XSong XZheng NWu JDai HNie L(2024)TryonCM2: Try-on-Enhanced Fashion Compatibility Modeling FrameworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.317329535:1(246-257)Online publication date: Jan-2024
https://doi.org/10.1109/TNNLS.2022.3173295
Su ZChen YZhang FWang RZhou FLin G(2024)DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit CollocationIEEE Transactions on Multimedia10.1109/TMM.2024.340254126(9988-10000)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TMM.2024.3402541
Xu RWang JLi Y(2024)Heterogeneous-Grained Multi-Modal Graph Network for Outfit RecommendationIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2024.33581908:2(1788-1799)Online publication date: Apr-2024
https://doi.org/10.1109/TETCI.2024.3358190
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents