Imagine and Imitate: Cost-Effective Bidding under Partially Observable Price Landscapes
<p>An overlook on the frequency of performance ranking of Imagine and Imitate Bidding (IIBidder) compared with others in a series of experimental combinations concerning budget and mask. The vertical axis lists different methods, while the horizontal axis represents ranking frequency. IIBidder ranks 1st in most of the combinations.</p> "> Figure 2
<p>The Imagine and Imitate Bidding (IIBidder) framework.</p> "> Figure 3
<p>The Imagination module.</p> "> Figure 4
<p>The rankings of our method for certain mask and budget combinations for each dataset and metric. Each cell represents an experiment of a specific mask-budget combination, with a ranking number inside. The smaller the ranking, the better. Our method ranks the first in most of experiments.</p> "> Figure 5
<p>The overall performance of Imagine and Imitate Bidding (IIBidder) under different budget constraints on datasets and metrics.</p> "> Figure 6
<p>The overall performance of Imagine and Imitate Bidding (IIBidder) for different masking scales on different datasets and metrics.</p> "> Figure A1
<p>The real-time bidding (RTB) ecosystem and its process.</p> ">
Abstract
:1. Introduction
- Our work jointly optimizes bidding strategy and landscape prediction, leveraging market price distribution data to enhance bidding strategies.
- IIBidder introduces the Imagination module, enabling the model to infer hidden information and adapt to expert bidding samples, resulting in more accurate real-world bidding strategies, particularly in the presence of incomplete data.
- Experimental results demonstrate that our method outperforms current RTB methods in terms of click-through rates and winning rates considering cost constraints.
2. Related Work
2.1. Bidding Landscape Forecasting
2.2. Bidding Strategy Design
2.3. Imitation Learning and GAIL Optimization
3. Problem Statement
3.1. Problem Definition
3.2. Bidding Strategy as Markov Decision Process
3.3. Partially Observable Scenario
4. The IIBidder Model
4.1. IIBidder Agent
4.2. The Imagination Module
4.3. Expert Sample Transitions
4.4. Discriminator
4.5. The IIBidder Algorithm
Algorithm 1: The IIBidder algorithm. |
5. Experiments
- Under budget constraint, what is the performance of IIBidder in comparison to other classic algorithms? In terms of modeling dynamic price environments, what advantages does this algorithm have?
- In an incomplete data scenario, is IIBidder effective in addressing the challenge of missing values within the competitive bidding environment?
- Does the incorporation of the Expert sample module, Discriminator module, and Imagination module effectively stimulate favorable behavior in the agent? Under budget constraints and incomplete data landscapes, can these modules operate effectively?
5.1. Experimental Settings
5.1.1. Implementation Details
5.1.2. Datasets
5.1.3. Data Preprocessing
5.1.4. Evaluation Metrics
5.1.5. Budget, Masking and Hyper-Parameter Setting
5.1.6. Baseline Models
- Uniform: This strategy assumes that the bid price is uniformly distributed between the lowest bid and the highest bid , namely
- Gamma [6]: Similarly, the strategy is made upon the Gamma distribution, characterized by a shape parameter k and a scale parameter . The probability density function (PDF) of the Gamma distribution is given by:
- Normal: The logarithm assumes that market price follows the Gaussian distribution .
- Lin [19]: The bid for the ith ad display opportunity depends on the historical average click-through rate , the estimated click-through rate for that opportunity, and the base price .
- GMM [16]: a Gaussian mixture model to describe and discriminate the multimodal distribution of market price by utilizing the impression-level features.
- DLF [40]: A method combining deep recurrent neural networks and survival analysis to forecast the distribution of market prices.
5.2. Experimental Results and Discussions
5.2.1. Performance under Budget Constraints (Q1)
5.2.2. Analysis on Incomplete Data Scenario (Q2)
5.2.3. Ablation Studies on Different Module (Q3)
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A. Real-Time Bidding for Online Advertisement
- Advertiser: An individual consumer or organization that needs to display an advertisement who has a certain budget and bids actively in the bidding process to get the opportunity to display their ads.
- Publisher: A provider with online advertising resources, usually the owner of web pages, search engines, and mobile apps. Publishers earn profits by selling their ad positions.
- Supplier Side Platform (SSP): SSP represents the interests of publishers. SSP integrates media advertising resources for centralized management. It provides publishers with functions such as bottom price setting and automatic ad placement. Since SSPs connect to a large number of partners, they need to formulate a reasonable allocation strategy for each partner to obtain the same amount of advertisers.
- Demand Side Platform (DSP): DSP represents the interests of advertisers, who register on the DSP platform and initiate the requests for ad placement. DSP recommends the most valuable ad positions for advertisers by various means such as big data analysis and user behavior analysis. At the same time, DSP provides advertisers with functions such as user response prediction, budget management and control, and automatic real-time bidding.
- Ad Exchange (ADX): ADX is the hub of the RTB system, which connects many DSPs and SSPs and handles all ad bidding requests in the system. Each bidding process is conducted under the auspices of ADX, making sure that the whole process is fair and open. Meanwhile, ADX sells ad display opportunities to the bidder who has given the highest price according to the generalized second price (GSP) mechanism [42].
- Data Management Platform (DMP): After collecting information such as users’ cookies, click records, purchase records, and search contents, DMP tries to clean, transform, and refine the collected information. It adopts machine learning methods to mine user preferences and recommend accurate target customers to the advertisers.
- 1.
- When a user opens an app or web page developed by a third party (publisher), an ad request will be generated and sent to an SSP with the user’s information. The request contains the user’s cookie information and the ad context information (e.g., web page URL, ad position, ad position length and width, ad position reserve price, etc.).
- 2.
- Once the SSP receives the ad display request, it forwards the request to the ADX with an attachment.
- 3.
- ADX broadcasts the bid request, including the user’s cookie information and contextual information of the ad, to all advertisers on the DSP platform.
- 4.
- After receiving the bid request, an advertiser makes a bid, probably with a bidding robot. The ADX platform will compare these bid prices and determine a final winner. Meanwhile, the process needs to be completed within a specified time (usually 100 ms). When the time is out, the bidding opportunity is considered abandoned.
- 5.
- The winning advertiser only needs to pay the second-highest price. If all advertisers’ bid prices are lower than the reserved price defined by the publisher, the bidding process for that ad display is considered abandoned.
- 6.
- ADX notifies the winner and automatically deducts the cost from the winner’s budget. Then, the ADX platform sends back the winner’s ad content to the corresponding SSP.
- 7.
- The SSP shows the ad to the user and records the user’s feedback, such as clicks and conversions.
Appendix B. From Reinforcement Learning to Generative Adversarial Imitation Learning
Appendix C. Formula Derivation
Appendix D. Implementation Details
Feature | Description | Example |
---|---|---|
IP | user’s IP address | 121.225.158.3 |
adexchange | the platform of the advertiser participating in the bidding | 1 |
advertiser | participating advertiser ID | 1458 |
bidid | the unique identifier of the bid record | 72879b068fec2d3c2afd51 |
bidprice | advertiser’s bid price | 300 |
city | user’s city | 84 |
click | the number of clicks on the ad | 0 |
creative | Ad creative logo | 48f2e9ba1570a5e1dd653caa |
domain | user’s area | trqRTuqbjoFf1mKYUV |
hour | time | 0 |
ipinyouid | user’s ID | Vhk7ZAnyPIc9tbE |
keypage | URL of advertiser landing page | befa5efe83be5e7c5085b |
logtpye | log type | 1 |
payprice | the transaction price of the ad display opportunity | 55 |
region | user administrative area information | 80 |
slotformat | format of the opportunity | 1 |
slotheight | the height of the opportunity | 90 |
slotid | the slot ID of the opportunity | mm_34955955_11267874 |
slotprice | the reserved price of the opportunity | 0 |
slotvisibility | the visibility of the opportunity | 0 |
slotwidth | width of the advertising display opportunity | 728 |
timestamp | the timestamp of the bid | 20130606000105500.0 |
url | URL address of the ad display opportunity | de0cca5e4ff921ca803b |
useragent | the user’s browser information | windows_chrome |
usertag | the user’s tag | 100631304500037075861504 |
weekday | indicates what day of the week it is | 4 |
Advertiser ID | Category |
---|---|
1458 | E-commerce |
2259 | Milk powder |
2261 | Communication |
2821 | Footwear |
2997 | M-Commerce |
3358 | Software Development |
3386 | International E-Commerce |
3427 | Oil |
3476 | Tires |
Adv.ID | Impression | Clicks | Cost ($) | |||
---|---|---|---|---|---|---|
1458 | 3,083,056 | 2454 | 212,400.20 | 20.97% | 0.08% | 86.55 |
2259 | 835,556 | 280 | 77,754.90 | 27.97% | 0.03% | 277.70 |
2261 | 687,617 | 207 | 61,610.94 | 31.84% | 0.03% | 297.64 |
2821 | 1,322,561 | 843 | 118,082.30 | 24.99% | 0.06% | 140.07 |
2997 | 312,437 | 1386 | 19,689.07 | 30.69% | 0.44% | 14.21 |
3358 | 1,657,692 | 1358 | 160,943.10 | 46.44% | 0.08% | 118.51 |
3386 | 2,847,802 | 2076 | 219,066.90 | 20.21% | 0.07% | 105.52 |
3427 | 2,512,439 | 1926 | 210,239.90 | 29.35% | 0.08% | 109.16 |
3476 | 1,945,007 | 1027 | 156,088.50 | 23.78% | 0.05% | 151.98 |
total | 15,204,167 | 11,557 | 1,235,875.81 | 28.47% | 0.10% | 144.59 |
Adv.ID | Impression | Clicks | Cost ($) | |||
---|---|---|---|---|---|---|
1458 | 614,638 | 543 | 45,216.45 | 100% | 0.09% | 83.27 |
2259 | 417,197 | 131 | 43,497.56 | 100% | 0.03% | 332.04 |
2261 | 343,862 | 97 | 28,796.00 | 100% | 0.03% | 296.87 |
2821 | 661,964 | 394 | 68,257.10 | 100% | 0.06% | 173.24 |
2997 | 156,063 | 533 | 8617.15 | 100% | 0.34% | 16.17 |
3358 | 261,001 | 339 | 34,159.77 | 100% | 0.13% | 100.77 |
3386 | 545,421 | 496 | 45,715.53 | 100% | 0.09% | 92.17 |
3427 | 514,559 | 395 | 46,356.52 | 100% | 0.08% | 117.36 |
3476 | 514,560 | 302 | 43,627.58 | 100% | 0.06% | 144.46 |
total | 4,029,265 | 3230 | 364,243.66 | 100% | 0.1% | 150.71 |
References
- Yuan, S.; Wang, J.; Zhao, X. Real-time bidding for online advertising: Measurement and analysis. In Proceedings of the Seventh International Workshop on Data Mining for Online Advertising, Chicago, IL, USA, 11 August 2013; pp. 1–8. [Google Scholar]
- Muthukrishnan, S. Ad exchanges: Research issues. In International Workshop on Internet and Network Economics; Springer: Berlin/Heidelberg, Germany, 2009; pp. 1–12. [Google Scholar]
- Wu, W.C.-H.; Yeh, M.-Y.; Chen, M.-S. Predicting winning price in real time bidding with censored data. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, Australia, 10–13 August 2015; pp. 1305–1314. [Google Scholar]
- Cui, Y.; Zhang, R.; Li, W.; Mao, J. Bid landscape forecasting in online ad exchange marketplace. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, 21–24 August 2011; pp. 265–273. [Google Scholar]
- Zhang, W.; Yuan, S.; Wang, J. Optimal real-time bidding for display advertising. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 24–27 August 2014; pp. 1077–1086. [Google Scholar]
- Zhu, W.-Y.; Shih, W.-Y.; Lee, Y.-H.; Peng, W.-C.; Huang, J.-L. A gamma-based regression for winning price estimation in real-time bidding advertising. In Proceedings of the 2017 IEEE International Conference on Big Data (Big Data), Boston, MA, USA, 11–14 December 2017; pp. 1610–1619. [Google Scholar]
- Wu, W.; Yeh, M.-Y.; Chen, M.-S. Deep censored learning of the winning price in the real time bidding. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 2526–2535. [Google Scholar]
- Wang, T.; Yang, H.; Yu, H.; Zhou, W.; Liu, Y.; Song, H. A revenue-maximizing bidding strategy for demand-side platforms. IEEE Access 2019, 7, 68692–68706. [Google Scholar] [CrossRef]
- Wang, T.; Yang, H.; Jiang, S.; Shi, Y.; Li, Q.; Tang, X.; Yu, H.; Song, H. Kaplan–meier markov network: Learning the distribution of market price by censored data in online advertising. Knowl.-Based Syst. 2022, 251, 109248. [Google Scholar] [CrossRef]
- Ren, K.; Zhang, W.; Chang, K.; Rong, Y.; Yu, Y.; Wang, J. Bidding machine: Learning to bid for directly optimizing profits in display advertising. IEEE Trans. Knowl. Data Eng. 2017, 30, 645–659. [Google Scholar] [CrossRef]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar]
- Wang, K.; Gou, C.; Duan, Y.; Lin, Y.; Zheng, X.; Wang, F.-Y. Generative adversarial networks: Introduction and outlook. IEEE/CAA J. Autom. Sin. 2017, 4, 588–598. [Google Scholar] [CrossRef]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal policy optimization algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar]
- Zhang, W.; Zhou, T.; Wang, J.; Xu, J. Bid-aware gradient descent for unbiased learning with censored data in display advertising. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 665–674. [Google Scholar]
- Wang, Y.; Ren, K.; Zhang, W.; Wang, J.; Yu, Y. Functional bid landscape forecasting for display advertising. In Proceedings of the Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2016, Riva del Garda, Italy, 19–23 September 2016; Proceedings, Part I 16. Springer: Berlin/Heidelberg, Germany, 2016; pp. 115–131. [Google Scholar]
- Wang, T.; Yang, H.; Liu, Y.; Yu, H.; Song, H. A multimodal approach for improving market price estimation in online advertising. Knowl.-Based Syst. 2023, 266, 110392. [Google Scholar] [CrossRef]
- Cai, H.; Ren, K.; Zhang, W.; Malialis, K.; Wang, J.; Yu, Y.; Guo, D. Real-time bidding by reinforcement learning in display advertising. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, Cambridge, UK, 6–10 February 2017; pp. 661–670. [Google Scholar]
- Perlich, C.; Dalessandro, B.; Hook, R.; Stitelman, O.; Raeder, T.; Provost, F. Bid optimizing and inventory scoring in targeted online advertising. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, 12–16 August 2012; pp. 804–812. [Google Scholar]
- Lin, C.-C.; Chuang, K.-T.; Wu, W.C.-H.; Chen, M.-S. Combining powers of two predictors in optimizing real-time bidding strategy under constrained budget. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, Indianapolis, IN, USA, 24–28 October 2016; pp. 2143–2148. [Google Scholar]
- Chen, Y.; Berkhin, P.; Anderson, B.; Devanur, N.R. Real-time bidding algorithms for performance-based display ad allocation. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, 21–24 August 2011; pp. 1307–1315. [Google Scholar]
- Chapelle, O. Modeling delayed feedback in display advertising. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 24–27 August 2014; pp. 1097–1105. [Google Scholar]
- Xu, J.; Lee, K.-c.; Li, W.; Qi, H.; Lu, Q. Smart pacing for effective online ad campaign optimization. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, Australia, 10–13 August 2015; pp. 2217–2226. [Google Scholar]
- Lee, K.-C.; Jalali, A.; Dasdan, A. Real time bid optimization with smooth budget delivery in online advertising. In Proceedings of the Seventh International Workshop on Data Mining for Online Advertising, Chicago, IL, USA, 11 August 2013; pp. 1–9. [Google Scholar]
- Afshar, R.R.; Zhang, Y.; Kaymak, U. Dynamic ad network ordering method using reinforcement learning. Int. J. Comput. Intell. Syst. 2022, 15, 27. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Wang, Y.; Liu, J.; Liu, Y.; Hao, J.; He, Y.; Hu, J.; Yan, W.P.; Li, M. Ladder: A human-level bidding agent for large-scale real-time online auctions. arXiv 2017, arXiv:1708.05565. [Google Scholar]
- Liu, M.; Liu, J.; Hu, Z.; Ge, Y.; Nie, X. Bid optimization using maximum entropy reinforcement learning. Neurocomputing 2022, 501, 529–543. [Google Scholar] [CrossRef]
- Wang, H.; Du, C.; Fang, P.; Yuan, S.; He, X.; Wang, L.; Zheng, B. Roi constrained bidding via curriculum-guided bayesian reinforcement learning. arXiv 2022, arXiv:2206.05240. [Google Scholar]
- Lu, J.; Yang, C.; Gao, X.; Wang, L.; Li, C.; Chen, G. Reinforcement learning with sequential information clustering in real-time bidding. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, Beijing, China, 3–7 November 2019; pp. 1633–1641. [Google Scholar]
- Mnih, V.; Badia, A.P.; Mirza, M.; Graves, A.; Lillicrap, T.; Harley, T.; Silver, D.; Kavukcuoglu, K. Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning; PMLR: London, UK, 2016; pp. 1928–1937. [Google Scholar]
- Jin, J.; Song, C.; Li, H.; Gai, K.; Wang, J.; Zhang, W. Real-time bidding with multi-agent reinforcement learning in display advertising. In Proceedings of the 27th ACM International Conference on Information and knowledge Management, Torino, Italy, 22–26 October 2018; pp. 2193–2201. [Google Scholar]
- Lu, Y.; Lu, C.; Bandyopadhyay, N.; Kumar, M.; Gupta, G. Functional optimization reinforcement learning for real-time bidding. arXiv 2022, arXiv:2206.13939. [Google Scholar]
- Abbeel, P.; Ng, A.Y. Apprenticeship learning via inverse reinforcement learning. In Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada, 4–8 July 2004; p. 1. [Google Scholar]
- Hussein, A.; Gaber, M.M.; Elyan, E.; Jayne, C. Imitation learning: A survey of learning methods. ACM Comput. Surv. (CSUR) 2017, 50, 1–35. [Google Scholar] [CrossRef]
- Pomerleau, D.A. Efficient training of artificial neural networks for autonomous navigation. Neural Comput. 1991, 3, 88–97. [Google Scholar] [CrossRef] [PubMed]
- Ross, S.; Gordon, G.; Bagnell, D. A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, Ft. Lauderdale, FL, USA, 11–13 April 2011; pp. 627–635. [Google Scholar]
- Ho, J.; Gupta, J.; Ermon, S. Model-free imitation learning with policy optimization. In International Conference on Machine Learning; PMLR: London, UK, 2016; pp. 2760–2769. [Google Scholar]
- Pathak, D.; Agrawal, P.; Efros, A.A.; Darrell, T. Curiosity-driven exploration by self-supervised prediction. In International Conference on Machine Learning; PMLR: London, UK, 2017; pp. 2778–2787. [Google Scholar]
- Hafner, D.; Lillicrap, T.; Ba, J.; Norouzi, M. Dream to control: Learning behaviors by latent imagination. arXiv 2019, arXiv:1912.01603. [Google Scholar]
- Ren, K.; Qin, J.; Zheng, L.; Yang, Z.; Zhang, W.; Yu, Y. Deep landscape forecasting for real-time bidding advertising. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 363–372. [Google Scholar]
- Wu, D.; Chen, X.; Yang, X.; Wang, H.; Tan, Q.; Zhang, X.; Xu, J.; Gai, K. Budget constrained bidding by model-free reinforcement learning in display advertising. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, Torino, Italy, 22–26 October 2018; pp. 1443–1451. [Google Scholar]
- Ausubel, L.M. A generalized vickrey auction. Econo0 Metr. 1999, 23, 493–505. [Google Scholar]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Graves, A.; Antonoglou, I.; Wierstra, D.; Riedmiller, M. Playing atari with deep reinforcement learning. arXiv 2013, arXiv:1312.5602. [Google Scholar]
- Wang, S.; Jia, D.; Weng, X. Deep reinforcement learning for autonomous driving. arXiv 2018, arXiv:1811.11329. [Google Scholar]
- Zhou, F.; Luo, B.; Hu, T.; Chen, Z.; Wen, Y. A combinatorial recommendation system framework based on deep reinforcement learning. In Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), Orlando, FL, USA, 15–18 December 2021; pp. 5733–5740. [Google Scholar]
- Cui, B.; Sun, R.; Su, J. A novel deep reinforcement learning strategy in financial portfolio management. In Proceedings of the 2022 7th International Conference on Big Data Analytics (ICBDA), Guangzhou, China, 4–6 March 2022; pp. 341–348. [Google Scholar]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]
- Ross, S.; Bagnell, D. Efficient reductions for imitation learning. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, Sardinia, Italy, 13–15 May 2010; pp. 661–668. [Google Scholar]
Symbol | Description |
---|---|
N | the total number of bid requests |
x | whether to win the bidding request (1 if success, 0 otherwise) |
v | value evaluated for bid request |
b | bid price for bid request |
reserved price for bid request | |
B | total budget of an advertiser |
a positive number infinitely close to 0 | |
z | feature vector denoting bid request |
bid price function that wins | |
probability function of winning | |
estimated value of advertised bid request | |
prior distribution of the feature vector for bid request ad |
Budget | 1/32 | 1/16 | 1/8 | 1/4 | 1/2 | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Metric | CER | WRC | CER | WRC | CER | WRC | CER | WRC | CER | WRC | |
Lin | ipinyou | 0.669 | 5.313 | 1.166 | 5.062 | 1.734 | 4.978 | 2.105 | 5.052 | 2.177 | 5.135 |
yoyi | 1.420 | 5.105 | 5.826 | 4.817 | 15.900 | 4.743 | 19.999 | 4.740 | 19.999 | 4.740 | |
Normal | ipinyou | 0.055 | 2.550 | 0.128 | 2.537 | 0.271 | 2.573 | 0.486 | 2.502 | 0.724 | 2.585 |
yoyi | 0.180 | 1.429 | 0.231 | 1.433 | 0.425 | 1.432 | 1.705 | 1.433 | 4.751 | 1.430 | |
Uniform | ipinyou | 0.089 | 3.350 | 0.193 | 3.895 | 0.456 | 4.115 | 0.915 | 3.928 | 1.401 | 4.090 |
yoyi | 0.521 | 2.288 | 0.540 | 2.278 | 1.129 | 2.287 | 3.053 | 2.285 | 11.553 | 2.282 | |
Gamma | ipinyou | 0.112 | 4.162 | 0.208 | 3.975 | 0.471 | 4.143 | 0.859 | 3.887 | 1.379 | 3.997 |
yoyi | 0.462 | 2.283 | 0.497 | 2.285 | 1.010 | 2.283 | 2.862 | 2.287 | 11.560 | 2.280 | |
GMM | ipinyou | 0.173 | 1.718 | 0.405 | 1.725 | 1.141 | 1.677 | 2.334 | 1.687 | 3.718 | 1.687 |
yoyi | 0.118 | 1.427 | 0.202 | 1.529 | 0.171 | 1.353 | 0.166 | 14.3 | 0.166 | 1.592 | |
DLF | ipinyou | 0.555 | 2.816 | 1.015 | 2.828 | 2.306 | 27.725 | 5.211 | 2.615 | 6.272 | 2.611 |
yoyi | 0.208 | 1.482 | 0.244 | 1.466 | 0.247 | 1.481 | 0.212 | 1.475 | 0.243 | 1.472 | |
DRLB | ipinyou | 8.962 | 29.510 | 9.822 | 28.615 | 8.235 | 15.083 | 7.088 | 19.427 | 4.476 | 13.360 |
yoyi | 79.398 | 73.275 | 46.065 | 23.460 | 58.283 | 42.432 | 44.447 | 31.852 | 52.897 | 43.047 | |
IIBidder | ipinyou | 26.249 | 52.373 | 24.055 | 45.688 | 14.176 | 54.110 | 16.567 | 22.567 | 22.132 | 71.337 |
yoyi | 50.913 | 23.827 | 71.430 | 60.452 | 75.622 | 56.535 | 55.325 | 28.522 | 53.773 | 30.577 |
Masked | 0% | 10% | 20% | 30% | 40% | 50% | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Metric | CER | WRC | CER | WRC | CER | WRC | CER | WRC | CER | WRC | CER | WRC | |
Lin | ipinyou | 1.875 | 2.260 | 1.812 | 2.992 | 1.707 | 3.994 | 1.531 | 5.178 | 1.381 | 6.854 | 1.117 | 9.370 |
yoyi | 13.491 | 2.624 | 16.653 | 3.276 | 14.258 | 4.046 | 12.656 | 4.980 | 10.646 | 6.182 | 8.068 | 7.866 | |
Normal | ipinyou | 0.360 | 1.724 | 0.380 | 2.054 | 0.350 | 2.316 | 0.330 | 2.502 | 0.300 | 3.014 | 0.277 | 3.686 |
yoyi | 1.642 | 1.009 | 1.453 | 1.124 | 1.285 | 1.272 | 1.269 | 1.446 | 1.644 | 1.698 | 1.459 | 2.040 | |
Uniform | ipinyou | 0.475 | 2.454 | 0.571 | 2.798 | 0.621 | 3.496 | 0.668 | 3.884 | 0.676 | 5.106 | 0.654 | 5.516 |
yoyi | 3.490 | 1.566 | 3.468 | 1.762 | 3.387 | 2.004 | 3.232 | 2.316 | 2.835 | 2.734 | 3.744 | 3.322 | |
Gamma | ipinyou | 0.451 | 2.152 | 0.588 | 3.100 | 0.626 | 3.664 | 0.634 | 4.008 | 0.660 | 5.004 | 0.678 | 6.268 |
yoyi | 3.450 | 1.564 | 3.265 | 1.758 | 3.280 | 2.004 | 3.040 | 2.318 | 2.782 | 2.740 | 3.853 | 3.318 | |
GMM | ipinyou | 1.355 | 1.162 | 1.474 | 1.324 | 1.530 | 1.498 | 1.481 | 1.728 | 1.696 | 2.012 | 1.793 | 2.468 |
yoyi | 0.159 | 0.778 | 0.168 | 1.222 | 0.177 | 1.226 | 0.153 | 1.362 | 0.180 | 2.202 | 0.151 | 2.008 | |
DLF | ipinyou | 3.332 | 20.2 | 3.240 | 2.244 | 2.915 | 2.466 | 2.982 | 2.766 | 2.802 | 3.182 | 3.176 | 3.692 |
yoyi | 0.275 | 1.054 | 0.279 | 1.172 | 0.254 | 1.312 | 0.209 | 1.296 | 0.211 | 1.736 | 0.155 | 2.082 | |
DRLB | ipinyou | 6.504 | 3.284 | 6.074 | 6.164 | 11.526 | 16.534 | 8.944 | 24.560 | 8.244 | 33.692 | 5.007 | 42.960 |
yoyi | 27.604 | 3.300 | 25.999 | 5.546 | 40.039 | 13.634 | 38.998 | 23.592 | 52.123 | 42.806 | 152.546 | 168.000 | |
IIBidder | ipinyou | 12.234 | 4.302 | 25.757 | 20.780 | 32.898 | 46.128 | 18.115 | 45.060 | 22.070 | 87.720 | 12.739 | 91.300 |
yoyi | 28.259 | 6.194 | 41.269 | 14.328 | 58.737 | 26.820 | 56.365 | 41.412 | 102.873 | 85.820 | 80.970 | 65.320 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Luo, X.; Chen, Y.; Zhuo, S.; Lu, J.; Chen, Z.; Li, L.; Tian, J.; Ye, X.; Tang, Y. Imagine and Imitate: Cost-Effective Bidding under Partially Observable Price Landscapes. Big Data Cogn. Comput. 2024, 8, 46. https://doi.org/10.3390/bdcc8050046
Luo X, Chen Y, Zhuo S, Lu J, Chen Z, Li L, Tian J, Ye X, Tang Y. Imagine and Imitate: Cost-Effective Bidding under Partially Observable Price Landscapes. Big Data and Cognitive Computing. 2024; 8(5):46. https://doi.org/10.3390/bdcc8050046
Chicago/Turabian StyleLuo, Xiaotong, Yongjian Chen, Shengda Zhuo, Jie Lu, Ziyang Chen, Lichun Li, Jingyan Tian, Xiaotong Ye, and Yin Tang. 2024. "Imagine and Imitate: Cost-Effective Bidding under Partially Observable Price Landscapes" Big Data and Cognitive Computing 8, no. 5: 46. https://doi.org/10.3390/bdcc8050046
APA StyleLuo, X., Chen, Y., Zhuo, S., Lu, J., Chen, Z., Li, L., Tian, J., Ye, X., & Tang, Y. (2024). Imagine and Imitate: Cost-Effective Bidding under Partially Observable Price Landscapes. Big Data and Cognitive Computing, 8(5), 46. https://doi.org/10.3390/bdcc8050046