TWI280488B - Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks - Google Patents
Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks Download PDFInfo
- Publication number
- TWI280488B TWI280488B TW90124089A TW90124089A TWI280488B TW I280488 B TWI280488 B TW I280488B TW 90124089 A TW90124089 A TW 90124089A TW 90124089 A TW90124089 A TW 90124089A TW I280488 B TWI280488 B TW I280488B
- Authority
- TW
- Taiwan
- Prior art keywords
- information
- price
- vendor
- website
- search
- Prior art date
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
12804881280488
五、發明說明(1 ) 本申請案2000年9月29日建檔之第60/236,574號與 2001年6月19日建播之第6〇/299,360號臨時申請案在 35u.s.c.§119(e)下聲明優先權。 有關附件提出者為總數貳片光碟(包括一片為複製光 碟片)的電腦程式清單。在所提出之光碟片中包括的一單一 檔案,名稱為‘‘ implementation.txt”,記載之創立日期為 2001年9月18日,其大小為14K位元組。該光碟片所包 含之材料因而在此處被納入參考。 未發明大致係有關於在全球資訊網(Web)上的自動工 作,更明確地說係有關於線上買方或使用者透過單一介面 與全球資訊網上之多語言賣方之自動工作,如比較購物或 交談,以提高溝通效率及提供個人化的購買經驗。 由於全球資訊網在1990年中葉之創立以來,網際網路 之規模已擴充了上千倍。人們已不是用面對面的交談互 連,而是透過虛擬的通訊頻道。此技術之新革命已根本地 改變人的生活方式。 與全球資訊網平行發展的是「資訊技術世代」,其代表 驚人種類之線上資訊資源,範圍包括產品資訊至學術論 文。這些元素已促成電子商務之指數成長,其對網際網路 所產生的便利性與低成本加以資本化。 在全球資訊網上有數百萬以上的線上賣主。雖然目前 的比較購物或價格比較搜尋引擎可依據買方或使用者之要 求由不同的全球資訊網競爭者擷取多少相關屬於任何所欲 產品及其所欲價格要求之搜尋結果,但買者或使用者會面 1280488 五、發明說明(2 ) 對無很的資訊海。有時候,因為搜尋引擎漏掉販售確所被 要求之其他以網際網路連結的國家(目前數目為245個)中 現有之線上多語言賣主的網站,買者或使用者會收到「網 頁失效」的搜尋結果。還有,雖然有關產品與賣主之資訊 在Web上可容易的存取,買者或使用者還是會在購買過程 的所有階段的迴圈内。 網際網路將e商務之目前模態變換為實在的全球整體 市場的潛力在今日仍大部分未被實現,且電子購物仍是未 自動的。在網際網路上購物離簡單、有效或享受還很遠。 貝者引擎與集中化的目錄服務要將線上買者要的產品及願 意出售這些產品或服務的商人定位是不效率的。還有,典 型的線上購物程序大多為人工堪動且要求買者鍵入其要搜 尋之所有項目與關鍵字。所以,可能的買者會面對失去信 心的工作,會有責任要收集及解釋有關商人與產品之資 訊。其情境為使用者或買者會因資訊過度負荷而沒有足夠 的時間與專業。 就複雜程度而言,目前被採行實施以使線上目錄價格 比較請求比較處理自動化之二個不完美的策略如下: (1)非即時做法 (2)即時硬編碼包袠做法 非即時做法為實作價格比較代理器之最簡單的方法。 其實作包含由Web以人工收集所有必要的資訊,然後就搜 尋結果之每一項目撰寫分別的HTML檔以視覺地顯示搜尋 結果 1280488V. INSTRUCTIONS INSTRUCTIONS (1) The provisional application No. 60/236,574 filed on September 29, 2000 and No. 6/299,360, which was broadcast on June 19, 2001, is at 35u.sc§119 ( e) Declare priority. The list of attachments is a list of computer programs on a total number of discs (including a piece of copied disc). A single file included in the proposed optical disc, named ''implement.txt', was created on September 18, 2001, and its size is 14K bytes. The material contained in the optical disc is thus It is hereby incorporated by reference. The invented is generally about automated work on the World Wide Web (Web), more specifically about online buyers or users through a single interface and global information online multi-language sellers. Automated work, such as comparing shopping or talking, to improve communication efficiency and provide personalized purchasing experience. Since the establishment of the World Wide Web in the mid-1990s, the size of the Internet has expanded by a thousand times. People are no longer face to face. The conversation is interconnected, but through a virtual communication channel. The new revolution in this technology has fundamentally changed the way people live. Parallel to the global information network is the "information technology generation", which represents an amazing variety of online information resources, The scope includes product information to academic papers. These elements have contributed to the exponential growth of e-commerce, which capitalizes on the convenience and low cost of the Internet. There are millions of online sellers on the Global Information Network. Although the current comparison shopping or price comparison search engine can obtain the search results related to any desired product and its desired price requirements by different global information network competitors according to the requirements of the buyer or the user, the buyer or the use Meeting 1280488 V. Invention Description (2) There is no information on the sea. Sometimes, because the search engine misses the website of the existing online multilingual seller in the other Internet-connected countries (currently 245) that are being sold, the buyer or user will receive the "webpage". "invalid" search results. Also, although the information about the product and the seller is easily accessible on the Web, the buyer or user will still be in the loop of all stages of the purchase process. The potential of the Internet to transform e-commerce's current modality into a real global market is still largely unrealized today, and e-shopping is still not automated. Shopping on the Internet is far from simple, effective, or enjoyable. The Bayer Engine and the centralized directory service are inefficient in locating the products that online buyers want and the merchants who are willing to sell those products or services. Also, typical online shopping programs are mostly manual and require the buyer to type in all the items and keywords they are searching for. Therefore, potential buyers will face the loss of confidence and will be responsible for collecting and interpreting information about merchants and products. The situation is that the user or the buyer will not have enough time and expertise due to excessive information load. In terms of complexity, the two imperfect strategies currently implemented to automate the online catalog price comparison request processing are as follows: (1) Non-immediate practices (2) Instant hard-coded packages are not immediate practices. The easiest way to make a price comparison agent. In fact, it consists of manually collecting all the necessary information by the Web, and then writing separate HTML files for each item of the search results to visually display the search results. 1280488
五、發明說明(3) 上述者的效益很明顯一實施容易且搜尋時間短。雖然 有這些效益,其有三個主要的不欲有之缺失。首先,價格 比較以人工完成,維護大的包裹儲藏變得非常耗費成本, 特別是在網際網路持續成長的情形下❶第二,必須投入很 夕功夫以更新價格與其他資訊。最後,儲存及協調上述資 訊所需的資料庫會非常大。 即時硬碼包裹做法為非即時做法外之替選方式。取代 如非即時做法直接取得項目的是,即時做法嘗試將HTML 網頁一般化成為特定的格式。為實施此抽取工作,稱為 pcwrapHLRT(程式之字首)之訂製包袠程序被激發。第1圖 提供具有一個‘‘ While”迴圈之程式的固定部分之例子。在 此例中,創立包裹後之法則為利用一對標記限定該HTML 網頁上之目標資料。因為該網站展現統一的格式協定,此 pcwrapHLRT程式會成功。產品項目以黑體字、價格以斜 體字被呈現。pcwrapHLRT藉由就特定字串彳“<b>,,, *‘</B>’,,“<I>”,“</1>”}掃描HTML文件而操作,其定 出將被抽取之文字段落。這些字串分別被pcwrapHLR丁定 為I,,η,Ip與rp。此記法Ik(ke{i,p})表示將要被抽取的 屬性的左邊加以標示,而rk表示右標記。其他可被包裹抽 取之可屬性為產品名稱、圖形、項目與條件等。 當一 HTML網頁被給予時,pcwrapHLRT由標頭編號 開始循序地掃描整個網頁。外圍的迴圈藉由就該網頁上非 掃描部分上掃描“<B>”標記檢查是否額外的模型號碼與 (或)價格對要抽取。只要一模型號碼被找到,内層的迴圈 1280488V. Description of invention (3) The benefits of the above are obvious. One implementation is easy and the search time is short. Despite these benefits, there are three major undesired deficits. First, price comparisons are done manually, and maintaining large parcel storage becomes very costly, especially in the case of continued growth of the Internet. Second, it is necessary to invest in time to update prices and other information. Finally, the database needed to store and coordinate the above information can be very large. The instant hardcode parcel approach is an alternative to non-immediate practices. Instead of getting the project directly from a non-instantaneous approach, the immediate practice attempts to generalize the HTML page into a specific format. To perform this extraction, a custom package called pcwrapHLRT (program prefix) is fired. Figure 1 provides an example of a fixed portion of a program with a '' While' loop. In this example, the rule after creating a package is to use a pair of tags to define the target material on the HTML page. Because the site exhibits uniformity Format agreement, this pcwrapHLRT program will succeed. Product items are presented in bold type and price in italics. pcwrapHLRT by "<b>,,,*'</B>'," <I>", "</1>"} scans the HTML file and determines the paragraph of the text to be extracted. These strings are respectively defined by pcwrapHLR as I, η, Ip and rp. This notation Ik(ke{i,p}) indicates the left side of the attribute to be extracted, and rk indicates the right mark. Other attributes that can be extracted by the package are product names, graphics, items and conditions. When an HTML web page is given, pcwrapHLRT starts scanning the entire web page sequentially by the header number. The peripheral loop checks whether additional model numbers and/or price pairs are to be extracted by scanning the "<B>" flag on the non-scanned portion of the web page. As soon as a model number is found, the inner loop is 1280488
五、發明說明(Ο 被激發以抽取適當的網頁子字串。 很少有網站公布其格式協定。因此,設計使用pcwrapHLR丁 之資訊收集系統者會以人工為每一資源構建此一包袠。不 幸的是,此硬編碼處理為沉悶且易於錯誤的,因為普通的 HTML網頁會包含數千行的程式編瑪。而且,大多數的網 站會週期性地改變其格式協定,此通常會破壞包袠。 pcwrapHLRT之另一缺點是因代理器在接收由使用者 來之要求之際必須連絡賣方之網站,故其搜尋時間之速度 相當溫吞。由於此種包裹為部分地自動,額外的行政工作 必須被實施以人工分析HTML的格式以決定該包袠。 在看到此種普遍遭遇之前述問題,一個替選做法以根 據新的網際網路策略以人工及部分自動地操縱為自動操 縱一線上智慧型價格比較代理器,其可減緩線上目錄購買 或購物之價格比較處理(拍賣),且可以親和網際網路互動 式代理器特性圊形使用者介面(IACGUI)同時提供較佳的 導航環境。此在所謂的第四代全球整體市場架構—代理器 仲裁之B對c、c對C、B對B、e採購與拍賣、G對B/C(政 府對企業/消費者)投標e商務與m商務(行動商務)變得廣 泛地被實施時特別重要。因而Ί發統為消費者對 _交易提供較佳的環-境、。 簡單地說,線上智慧型價格比較代理器為自動的購買 或購物助理,其對每一產品搜尋全球多語言商店並探出交 易°其亦遞送加值(顧客評分)之企業全球資訊網服務至線 上買者/使用者。這種代理器為有吸引力的,其因可減輕使 1280488 五、發明說明(5) 用者在消費者購買行為模型中以人工實施每次作業之沉 悶。 慣常地,買者/使用者透過前端介面與線上服務之Web 伺服器通訊,其呈現被買者/使用者提出搜尋要求,該線上 服務之Web伺服器查詢其資料庫以便媒合,並呈現其結果 至使用者之Web瀏覽器。 在本發明中,使用者代理器(以人類買者/使用者身份 作用之線上智慧型價格比較代理器)在線上目錄價格比較 處理中帶著要被項目與關鍵字,並以買者/使用者之最高利 益在全球資訊網之互連電腦網路上與以網際網路被連接之 2从個國家的任一個的很多多語言web伺服器通訊。然後 該使用者代理器將其找到的線上賣主網站排等級而經由 Web瀏覽器呈現搜尋結果摘要給線上的人類使用者。 應用本發明之系統至多重e商務段的利益是非常顯著 的。通訊效率與效果可觀地被提高,線上賣主以及線上買 者的時間與成本節省被最大化。最重要的是,使用者/買者 將對全球規模之史無前例且無數的資訊來源與上萬產品以 及無以測量之數目的商機有存取權。本發明之系統與方法 亦將有助粉碎時間與語言障礙、人口統計學上之界限,並 真正地促成e商務之全球化。此外,使用者代理器之個人 化、連續運轉、自動的性質使其相當適合於仲裁買者/消費 者行為。其咸信本發明將有助於整個購買經驗之最佳化及 現行e商務之革命。 因此本發明之一目標為要提供一種改良的線上賣主之 1280488 五、發明說明(6 ) 產品或服務的價格比較。 本發明還有之一目標為要構建線上商店之賣主描述。 本發明還有之一目標為要收集資料,包括被用於訓練 之產品樣本與URL。 本發明還有之一目標為要在實施賣主網站或線上商店 訓練前擷取訓練資料。 本發明還有之一目標為要使用訓練資料中被給予之資 訊由線上賣主收集訓練網頁。 本發明還有之一目標為要由該訓練資料與所收集之訓 練網頁產生賣主抬述。 本發明還有之一目標為要在一離線資料庫鍺存所產生 之賣主描述。 本發明還有之一目標為要為一系統管理員提供一介面 以添加 、修改與刪除被該系統支援之賣主。 本發明還有之一目標為要為一管理員提供一介面以觀 看賣主資訊。 本發明還有之一目標為要提供一種價格比較方法,一 顴客可由此啟動價格·比較。 本發明還有之一目標為要剖析HTML網頁成為有用的 資料。 本發明還有之一目標為要將所要的產品/服務過濾及 排序。 本發明還有之一目標為要提供一單一介面以比較網際 網路或全球資訊網上不同線上多語言賣主與不同網域之價 1280488V. Description of the invention (Ο was motivated to extract the appropriate page substrings. Few websites publish their format agreements. Therefore, those who use the pcwrapHLR Dingzhi information collection system will manually build this package for each resource. Unfortunately, this hard-coded process is boring and error-prone because ordinary HTML pages contain thousands of lines of programming. Moreover, most websites periodically change their format conventions, which usually breaks the package.另一. Another disadvantage of pcwrapHLRT is that the agent must contact the seller's website when receiving the request from the user, so the search time is quite speedy. Since the package is partially automatic, additional administrative work must be done. It is implemented to manually analyze the format of the HTML to determine the package. In view of the aforementioned problems encountered in this common encounter, an alternative approach is to manually and partially automatically manipulate the online intelligence according to the new Internet strategy. Type price comparison agent, which can slow down the online catalog purchase or purchase price comparison processing (auction), and can be related to the Internet The network interactive agent feature graphical user interface (IACGUI) also provides a better navigation environment. This is in the so-called fourth-generation global overall market architecture - agent arbitration B to c, c to C, B to B , e-procurement and auction, G-to-B/C (government-to-business/consumer) bidding e-commerce and m-commerce (mobile commerce) have become particularly important when implemented. Preferably, the online smart price comparison agent is an automated purchase or shopping assistant that searches for global multi-language stores for each product and explores the transaction. It also delivers a bonus (customer rating) The company's global information network service to online buyers/users. This kind of agent is attractive, and it can be mitigated by making it 1280488. 5. Inventor's Note (5) The user is manually implemented in the consumer purchase behavior model. Difficulties in each job. Conventionally, the buyer/user communicates with the web server of the online service through the front-end interface, which presents the search request by the buyer/user, and the web server of the online service queries its database to The mediation and presentation of the result to the user's web browser. In the present invention, the user agent (online intelligent price comparison agent acting as a human buyer/user) is online in the catalog price comparison process. Many multilingual web servers with any one of the countries from the country to be connected to the Internet and the Internet in the best interests of the buyer/user in the best interests of the buyer/user. The user agent then ranks the online seller websites it finds and presents the search results to the human users on the line via a web browser. The benefits of applying the system of the present invention to multiple e-commerce segments are very significant. Communication efficiency and effectiveness are significantly improved, and time and cost savings for online vendors and online buyers are maximized. Most importantly, users/buyers will have access to an unprecedented number of sources of information on a global scale with tens of thousands of products and unmeasured business opportunities. The system and method of the present invention will also help to smash time and language barriers, demographic boundaries, and truly contribute to the globalization of e-commerce. In addition, the personalization, continuous operation, and automatic nature of the user agent make it quite suitable for arbitrating buyer/consumer behavior. Its invention will help to optimize the entire purchasing experience and the current revolution in e-commerce. It is therefore an object of the present invention to provide an improved online seller of 1280488. 5, Invention Description (6) Price comparison of products or services. Still another object of the present invention is to construct a vendor description for an online store. Still another object of the present invention is to collect data, including product samples and URLs that are used for training. Still another object of the present invention is to obtain training materials prior to training on a vendor website or online store. Still another object of the present invention is to collect training web pages from online vendors using the information given in the training materials. Still another object of the present invention is to generate a vendor repost from the training material and the collected training web page. Still another object of the present invention is to store the resulting vendor description in an offline database. Still another object of the present invention is to provide a system administrator with an interface to add, modify, and delete vendors that are supported by the system. Still another object of the present invention is to provide an interface for an administrator to view vendor information. Still another object of the present invention is to provide a price comparison method by which a hacker can initiate price comparisons. Still another object of the present invention is to analyze the HTML web pages into useful materials. Still another object of the present invention is to filter and sort desired products/services. Still another object of the present invention is to provide a single interface for comparing the price of different online multi-language vendors and different domains on the Internet or the World Wide Web. 1280488
五、發明說明(7) 格〇 在本發明之系統中一第一使用者代理器被實施且以語 思刀辨學習者代理器(Semantics Rec〇gniti〇n Learner a 2eni SRLA)之形式被實作。其根據先前被系統管理員提供之編 譯或準備後之訓練例使用歸納學習法執行即時自動包裹歸 納來子I 賣主網站之URL與其網域描述^ (在一實砲例 中,SRLA連接微軟後端順從SQl之伺服器或微軟存取資 料庫以對每一線上商店一次地產生一賣主與產品描述)。該 包裹歸納係藉由以即時構建例子之包袠而被完成,其由儲 存於離線資料庫之賣主與產品描述被抽取。然後srla以 該等例子自動地透過網際網路以即時快速驅動至賣主網站 之遠端主機以存取依據所提供之URL展現特定例子的網 頁,再以該網域或商品資訊有智慧地填寫相關的搜尋表 格,然後虛擬地「壓enter鍵」而對該網站提出搜尋請求。 回應於搜尋準則被送回之網頁結果為含有正確資訊之成功 網頁或失效網頁。這些具有屬於特定賣主(不論是否有在本 系統中登入)之獨特賣主與產品描述之網頁結果隨後被儲 存於離線資料庫(如順從SQL之伺服器或微軟存取資料庫) 中之一賣主描述佇列内被系統管理員維護。賣主URL、賣 主描述與其他資訊較佳地每日一次依時間表自動地被更 新。 在本發明之系統中一第二使用者代理器被稱為語意分 辨貝者代理器(Semantics Recognition Buyer Agent, SRB A)。SRB A使用先前被語意分辨學習者代理器學習之 10 1280488 五、發明說明(8)V. INSTRUCTIONS (7) In the system of the present invention, a first user agent is implemented and is implemented in the form of a Semantics Rec〇gniti〇n Learner a 2eni SRLA Work. It uses the inductive learning method to perform the instant automatic parcel induction to the URL of the seller's website and its domain description according to the compiled or prepared training case previously provided by the system administrator. (In a real shot, SRLA connects to the Microsoft backend. Obey SQl's server or Microsoft Access Library to generate a vendor and product description for each online store). The parcel induction is accomplished by building a package of instant examples that are extracted by the vendor and product description stored in the offline repository. Then, srla automatically uses the examples to quickly and quickly drive to the remote host of the seller's website through the Internet to access a webpage displaying a specific example according to the provided URL, and then intelligently fill in the relevant information by the domain or product information. Search the form and then "press the enter key" to make a search request for the site. The results of the web page returned in response to the search criteria are successful or invalid pages with the correct information. These web page results with unique vendors and product descriptions belonging to a specific vendor (whether or not there is a login in the system) are then stored in an offline database (such as a server compliant with SQL or a Microsoft Access Library). The queue is maintained by the system administrator. The vendor URL, vendor description and other information are preferably automatically updated on a daily basis according to the schedule. In the system of the present invention, a second user agent is referred to as a Semantics Recognition Buyer Agent (SRB A). SRB A uses the semantics previously learned by the learner agent. 10 1280488 V. Description of invention (8)
賣主描述以搜尋符合者,而同時存取在全球資訊網上之各 種線上多語言賣主網站。SRBA智慧地以線上買者或使用 者提供之產品資訊填入賣主之搜尋表格並虛擬地「壓下 enter鍵」。然後賣主透過全球資訊網送回搜尋結果網頁至 SRBA,其方式為結果網頁大約與由其他賣主被送回者同時 地到達。(該語意分辨買者代理器將這些送回之網頁儲存於 分離的記憶體或快取記憶體作為擊中以便稍後被其化 SRBA使用)。SRBA依據對應的賣主描述分析所送回之網 頁、由其抽取相關的資訊與資料、將價格與型號排序、並 經由Web瀏覽器以格式化之摘要在用戶端機器將其顯示給 該線上買者/使用者。 依照本發明,一方法為以電腦實施之語意分辨學習者 代理器被提供以實施歸納學習。該方法包含掘取依線上賣 主而定之訓練資料以由相互連接之電腦網路產生對應的賣 主描述。該方法包含使用儲存於賣主佇列内被給予訓練資 料之訓練網頁來收集訓練網頁。使用該訓練資料以及被拍員 取之訓練網頁之該方法包含一歸納學習方法以由該訓練資 料與被擷取訓練網頁被抽取之資訊產生依賣主而定的賣主 描述° 一種方法被提供用於在一離線資料庫中儲存該被擷取 與(或)被抽取之賣主描述,其將在稍後被一語意分辨買者 代理器(SRBA)使用。 一種方法依照本發明被提供用於由線上賣主之產品或 服務之價格比較。該方法包含一線上使用者就一特定的產 11 1280488 五、發明說明(9) 品或服務啟動一請求,然後語意分辨買者代理器使用預先 定義之賣主描述構建一搜尋請求之參數。該方法包含郵寄 請求至不同的線上賣主,同時較佳地使用包含該等賣主描 述之剖析器由自線上賣主被送回之結果網頁抽取資料。該 方法包含用語意分辨買者代理器以HTML格式構建/組合 被排序並被過濾後之資料用於呈現該資料至線上買者,,使 用者。 一種方法透過語意分辨買者代理器被提供用於剖析由 線上賣主被送回之網頁以擷取有用的資料。該方法包含由 一離線資料庠擷取賣主描述、為任何在全球資訊網上以網 際網路被連接之國家(目前為246個)剖析由線上賣主被送 回之網頁、以及使用由被送回之賣主描述來的資訊收集有 用的資料。 在本發明之一實施例中,上面的功能僅在一線上買者 登入為登錄為暫時試用或終生會員後於會員Web網頁上為 可得的。 依照本發明,一種方法被提供用於在相互連接的電腦 網路上對所選擇之資訊型式之即時線上搜尋處理。該方法 包含數個步驟:為數個在相互連接的電腦網路之數個網站 組合網站描述,就每一數個網站包括的(a)該網站用之 URL ;該網站用之搜尋格式URL ; (b)該網站上之被選擇的 資訊型式如何被組織之一般化規則;(c)由對應於選擇的資 訊型式之網站擷取的樣本資料;以及在該網站被找到之網 域的描述;為由線上使用者來之特定資訊型式接收一請 12 1280488 五、發明說明(i〇) 求;由該等網站描述辨适具有該等特定資訊型式之網站; 為每一被辨認之網站使用網站描述對該等特定資訊型式構 建搜尋請求;向所辨認之網站提出所構建之搜尋請求;由 所辨認之網站接收搜尋結果;以及(d)在所接收之搜尋結果 將正確相符者定位之際以該網站之當地語言抽取對應於該 等特定資訊型式之資訊且顯示所抽取之資訊至該使用者。 更一般的是,本發明涉及一種方法,用於相互連接之 電腦網路上的即時線上搜尋處理。該方法包含之步驟為: (a)為數個由該等相互連接之電腦網路來的賣主網站維護 一離線資料庫資訊;該資訊包括URL、搜尋表格URL、網 域之描述、及賣主描述,其中該賣主描述包括有關產品資 訊如何在每一該等賣主網站被組織之一般化規則;(b)使用 在該離線資料庫中被維護之資訊為所要的產品用於價格比 較請求來處理參數,而該價格比較請求係由一線上使用者 與(或)語意分辨買者代理器被接收;(c)由該等數個賣主網 站中被辨認之一抽取即時價格與產品資訊,其中被_由取之 價格與產品資訊為該網站之當地語言;以及(d)顯示|抽取 之價格與產品資訊至使用者。 \ 第1圖為在習知即時硬編碼包袤做法被使用以由\ 主網站抽取資訊之pcwrapHLRT程式的永久部分之例子。 第2圖為本發明之較佳實施例、本發明之使用者代理 器、一使用者/買者與線上賣主經由全球資訊網/網際網路 間的相互作用之一般化圊。 第3圖為語意分辨學習者代理器(SRLA)如何以訓練資 13 1280488 五、發明說明(η) 料產生賣主描述而作用之整體的簡化流程圖100。 第4圖提供依照本發明可被包括於賣主描述之資訊種 類的描述。 第5圖為依照本發明可住於該賣主描述襴位的資料 例。 第6圖為實施歸納學習及產生一般化的整個網頁有效 之賣主描述的語意分辨學習者代理器(SRLA)之整體簡化 流程圖。 第7圖提供由一網站之網頁如人員在網際網路上瀏覽 該網頁所出現般的整頓部分的例子與被用以產生或定義此 整頓之對應的HTML碼。 第8圖提供標記(label)之例子,其依照本發明被用以 定出在訓練網頁中之項目描述與價格資訊的位置。 第9圖為在本發明之一例中於訓練程序之際所使用之 被該等標記所代表者之一般化描述。 第1 〇A與10B圊提供第5至9圊說明之訓練過程例中 可能的候選標記的例子。 第Π圖為簡化地描述依照本發明之具有導航規律之 網頁的單幅畫面利用可搜尋之指標與產品網域欄位對特定 查詢資料庠之容易的存取。 第12圖提供一 Web網頁之單幅畫面的簡化描述,其 顯示以所有項目用簡單一致的格式排列而使用統一規律。 在··周頁中為一框,且該框包含所查詢之資訊的搜尋結果, 這些結果被統一地格式化。Vendor descriptions are used to search for compliance, while simultaneously accessing various online multilingual seller websites on the World Wide Web. SRBA intelligently fills in the seller's search form with the product information provided by the online buyer or user and virtually "presses the enter key". The seller then sends back the search results page to the SRBA via the World Wide Web, in that the resulting web page arrives at approximately the same time as the other sellers are returned. (This semantically distinguishes the buyer agent from storing these returned web pages in separate memory or cache memory for hitting to be used later by the SRBA). The SRBA analyzes the returned web page according to the corresponding vendor description, extracts related information and materials, sorts the price and model, and displays it on the client machine to the online buyer via a web browser with a formatted summary. /user. In accordance with the present invention, a method is to resolve the learner agent in a computer-implemented manner to provide inductive learning. The method includes mining training materials based on online vendors to generate corresponding vendor descriptions from interconnected computer networks. The method includes collecting training web pages using training web pages that are provided with training materials stored in the vendor queue. The method of using the training material and the training page taken by the recruiter includes an inductive learning method to generate a vendor-specific vendor description from the training material and the information extracted from the captured training web page. A method is provided for The retrieved and/or extracted vendor description is stored in an offline repository, which will be used later by a semantically distinguishable buyer agent (SRBA). One method is provided in accordance with the present invention for price comparisons of products or services by an online seller. The method includes an online user initiating a request for a particular product, or a service, and then semantically distinguishing a buyer agent using a predefined vendor description to construct a search request parameter. The method includes mailing the request to a different online vendor, and preferably extracting the data from the results web page returned from the online vendor using a parser containing the vendor descriptions. The method includes semantically distinguishing the buyer agent to construct/combine in HTML format. The sorted and filtered material is used to present the material to an online buyer, user. One method is to distinguish between a buyer agent and a proxy for being used to parse a web page sent back by an online seller to retrieve useful information. The method includes extracting the seller description from an offline data, parsing the webpage returned by the online seller for any country connected to the Internet on the global information network (currently 246), and using the returned The information described by the seller describes useful information. In one embodiment of the invention, the above functions are only available on the online web page of the buyer after logging in as a temporary trial or a lifetime member. In accordance with the present invention, a method is provided for instant online search processing of selected information patterns on interconnected computer networks. The method comprises a number of steps: combining a plurality of website descriptions on a plurality of connected computer networks, each of the plurality of websites including (a) a URL for the website; a search format URL used by the website; b) the generalization rules of how the selected information type on the website is organized; (c) the sample data retrieved from the website corresponding to the selected information type; and the description of the domain found on the website; A specific type of information received by an online user. 12 1280488 5. Invention Description (i〇) request; such websites describe the website with the specific information type; use the website description for each identified website Build a search request for the particular type of information; submit a search request constructed to the identified website; receive the search result from the identified website; and (d) at the time the received search result is to be correctly matched The local language of the website extracts information corresponding to the particular information type and displays the extracted information to the user. More generally, the present invention relates to a method for instant online search processing on interconnected computer networks. The method includes the steps of: (a) maintaining an offline database information for a plurality of vendor websites connected by the interconnected computer networks; the information including a URL, a search form URL, a description of the domain, and a vendor description. The vendor description includes generalization rules regarding how product information is organized on each of the vendor websites; (b) using the information maintained in the offline database for the desired product for price comparison requests to process parameters, And the price comparison request is received by an online user and/or semantically distinguishing the buyer agent; (c) extracting real-time price and product information from one of the identified ones of the seller websites, wherein The price and product information are taken in the local language of the website; and (d) the price of the product and the product information are displayed to the user. \ Figure 1 is an example of the permanent part of the pcwrapHLRT program that is used to extract information from the \ main website in the case of the conventional instant hard-coded package. Figure 2 is a generalized embodiment of the present invention, a user agent of the present invention, a user/buyer and an online seller via the World Wide Web/Internet. Figure 3 is a simplified flow diagram 100 of how the semantic learner agent (SRLA) acts as a training vendor 13 1280488. Figure 4 provides a description of the types of information that can be included in the vendor description in accordance with the present invention. Figure 5 is a diagram showing an example of information that can be resident in the vendor description in accordance with the present invention. Figure 6 is a simplified flow diagram of the semantically-resolved learner agent (SRLA) that implements inductive learning and produces a generalized web page effective vendor description. Figure 7 provides an example of a rectification portion that appears on a web page of a website, such as a person browsing the web page on the Internet, and an HTML code that is used to generate or define the rectification. Figure 8 provides an example of a label that is used in accordance with the present invention to determine the location of item descriptions and price information in a training web page. Figure 9 is a generalized description of the ones represented by the markers used in the training program in one example of the present invention. Examples 1A and 10B provide examples of possible candidate markers in the training process examples described in Sections 5 through 9. The figure is a simplified illustration of a single frame of a web page having navigational rules in accordance with the present invention utilizing searchable metrics and product domain fields for easy access to particular query data. Figure 12 provides a simplified depiction of a single web page of a web page showing uniformity in a simple and consistent format for all items. In the ··· Week page is a box, and the box contains the search results of the information queried, and these results are uniformly formatted.
14 1280488 五、發明說明(12) 第13圖為如第12圖顯示之相同單幅畫面的簡化描 述’其顯示使用垂直隔離規律而以搜尋結果顯示產品之排 列目錄,其位於頭尾間之中央。 第14圖為本發明之語意分辨學習者代理器作業之一 般化圖示。 第15A圖顯示在賣主網站“www.8〇〇.com,,上用「電 子」為關鍵字之搜尋結果,其中每一產品有其特色與功能 之簡介(排列框之左邊與中間),且相關的「表列價格」與 「你的價格」資訊出現在排列框右邊,及本發明之語意分 辨學習者代理器的智慧型價格辨識器在賣主描述之學習過 程之際可分辨的資訊。 第15B圊為本發明之語意分辨買者代理器的一般化圖 示’其在如第14圖顯示之賣主描述的學習過程後存取該賣 主網站 “ www .800.com,,。 苐15C圖為第2圖之語意分辨買者代理器(Srba)20 如何與賣主描述交談以回應於一線上買者/使用者針對一 個以至於全部可用的線上多語言賣主就償格比較之請求的 流程圖3 0 0。 第16圖為在本發明中使用之用以獲取訓練資訊的互 動式代理器字元學習者介面畫面例子。 第丨7圊為訓練資訊已就賣主“ lcachec〇m,,被填入之 例子。 第18圖顯示被用以顯示己被學習之賣主描述資訊的 學習者介面畫面。 15 1280488 厂五、發明說明(η) 第19圖為具有標示標藏「賣主資訊」之學習者介面單 幅畫面’賣主資訊可透過其被鍵人或被搜尋。 第20圖提供先前就特定賣主被鍵 入用於顯示該等訓 練例子之學習者介面的單幅畫面。 第21圖為回應於稱之為「賣主描述」檔案的開啟下被 顯示之學習者介面的單幅畫面。 第22圖顯示依照本發明之選擇學習選項,即「學習某 一個」選項被顯不為所選擇的,及賣主之名稱(其已被填入) 為 “ 1 cache.com”。 第23圖顯示賣主“ icache c〇m,,之學習結果。 第24圖顯示包裹歸納問題,其以資訊抽取之簡單模型 的界定解答被成立。 第25圖提供程序“exccHLRT”之虛擬碼。 第26圖為用於學習頭與尾標記之虛擬碼中的一簡單 方法之一模組。 第27A與27B圖提供該程序leaniHLRT用之詳細的表 與相關的副常式。 第2 8圖顯示依照本發明之一實施例顯示使用者/買者 如何與伺服器通訊以透過一 ASP(主動 >(司服器網頁)樓案 (NextGen.asp)在伺服器機器上執行處理中 DLL樓 (NextGen.dll)。 第29圊顯示語意分辨買者代理器促成使用者與資料 庫伺服器間之通訊的方法。 第30圖提供如何依照本發明之一實施例設立SQL伺 16 1280488 五、發明說明(ι〇 服器的詳細流程圖。 第31圖顯示語意分辨買者代理器如何虛擬地在一線 上賣主網站來回地郵寄一搜尋。 第32圖為用於本發明之GUI或互動式代理器字元瞵 物者/買者介面(IACS/BI)的「主選單」畫面之簡化圖示。 其要注意到在此「主選單」畫面之右上角有產品之選擇「頻14 1280488 V. INSTRUCTIONS (12) Figure 13 is a simplified description of the same single frame as shown in Figure 12, which shows the arrangement of products using search results using vertical isolation rules, centered between the head and tail. . Figure 14 is a generalized illustration of the semantics of the present invention. Figure 15A shows the search results on the seller's website "www.8〇〇.com, using "electronics" as a keyword, each of which has a description of its features and functions (left and middle of the arrangement box), and The related "listed price" and "your price" information appear to the right of the arrangement box, and the semantics of the present invention distinguishes the intelligent price identifier of the learner agent from the learning process described by the seller. 15B is a generalized illustration of the semantic distinguishing buyer agent of the present invention. It accesses the seller website "www.800.com, after the learning process described by the seller as shown in Fig. 14. 苐15C A flow chart for the semantics of Figure 2 to distinguish the buyer agent (Srba) 20 from the vendor description in response to a request from an online buyer/user for an online multi-lingual vendor that is all available for comparison Figure 30 is an example of an interactive agent character learner interface screen used to obtain training information in the present invention. The seventh information is that the training information has been sold to the seller "lcachec〇m," Into the example. Figure 18 shows the learner interface screen used to display the vendor description information that has been learned. 15 1280488 Factory V. Invention Description (η) Figure 19 is a single screen of the learner interface with the label “Seller Information”. The seller information can be searched by the key person or by. Figure 20 provides a single picture of a learner interface previously entered for a particular vendor to display such training examples. Figure 21 is a single screen of the learner interface displayed in response to the opening of the file called "Seller Description". Figure 22 shows the selection learning option in accordance with the present invention, that is, the "learn one" option is not selected, and the name of the vendor (which has been filled in) is "1 cache.com". Figure 23 shows the learning result of the seller "icache c〇m," Figure 24 shows the package induction problem, which is defined by the definition of a simple model for information extraction. Figure 25 provides the virtual code of the program "exccHLRT". Figure 26 is a block diagram of a simple method for learning the head and tail marks of the virtual code. Figures 27A and 27B provide a detailed table and associated subroutine for the program leaniHLRT. Figure 28 shows One embodiment of the present invention shows how a user/buyer communicates with a server to execute a processing DLL building on a server machine via an ASP (Active> (Web) website (NextGen.asp) (NextGen .dll). Figure 29 shows a semantic method for distinguishing the buyer agent from facilitating communication between the user and the database server. Figure 30 provides an illustration of how to set up SQL Server 16 1280488 in accordance with an embodiment of the present invention. Detailed flowchart of the ι server. Figure 31 shows semantically distinguishing how the buyer agent virtually mails a search back and forth on an online seller website. Figure 32 is a GUI or interactive agent character used in the present invention. A simplified illustration of such offenders / buyer interface (IACS / BI) in the "Main Menu" screen of its upper right corner to note that in this "Main Menu" screen of the products have to select "Frequency
道」(類別)為該使用者被提供。一「快速畫面」在該畫面 之左邊亦被提供。在其正下方有一盒,其中有自己打字之 動畫處理後的外貌指示線上人類使用者如何使用快速搜尋 選項。左邊的畫面板亦提供一组盒子讓會員登入成為暫時 試用或終身會員。(注意,本發明之大多數入口功能被失能 直至使用者認、證有效為止)。在左下角被提供一組連結至已 登錄本發明之入口的線上賣主,而在右邊可看到有一大的 訊息盒被標為「回饋」就線上使用者被提供以透過e_mail 對e-mail伺服器鍵具有註解之訊息,此較佳地用微軟公司 之Outlook Express電子郵件應用程式來執行。 第33圖為用於本發明之GUI或購物者/買者介面的畫 面之簡化圖不,其中公司在回應於已在先前畫面被線上買 者/使用者點選之「政府對企業」文字囷示如下被顯示。然 而,由於這些公司,或所謂的政府對企業之e商務服務或 平〇提供者目則嚴格地藉由在完全封閉連接的電腦網路環 境内採納認證保全介面來限制會員存取其Web伺服器之權 利,此真正的畫面無法作用。 的顯 第34圓為用於本發明之Gm或購物者/買者介面 17 1280488 五、發明說明(15 示畫面之簡化圖示,其中有關使用者已在第33圖之畫面上 點選「進階搜尋」選項後由所提供之選擇中被使用者選擇 的公司之細節被提供。注意在此畫面中,五個網域之標籤 正下方的面板框中的大標題為r進階的代理器為開啟的」。 除外在畫面底部,使用者被提供對話盒,其可被填入以使 用本發明提供之語意分辨買者代理器來執行搜尋。然而再 次地說,由於這些公司,或所謂的政府對企業之e商務服 務或平台提供者目前嚴格地藉由在完全封閉連接的電腦網 路環境内採納認證保全介面來限制會員存取其Web伺服器 之權利,此真正的畫面無法作用。 第35圖為用於本發明之Gm或購物者/買者介面的畫 面之簡化圖示,其中公司在回應於已在先前畫面被線上買 者/使用者點選之「企業對企業」文字圖示如下被顯示。然 而,由於這些公司,或所謂的企業對企業之e商務服務或 平台提供者目前嚴格地藉由在完全封閉連接的電腦網路環 境内採納認證保全介面來限制會員存取其Web伺服器之權 利,此真正的畫面無法作用。 第36圖為用於本發明之GUI4購物者/買者介面的顯 示畫面之簡化圖示,其中有關使用者已在第乃圖之畫面上 點選「進階搜尋」選項後由所提供之選擇中被使用者選擇 的公司之細節被提供。 第37圖為用於本發明之⑽或構物者/買者介面的顯 示畫面之簡化圖示’其中所選擇之項目與其描述在回應於 使用者在畫面上選擇「網域A」下被顯示。 18 1280488 五、發明說明(16 ) 第38圖為用於本發明之GUI或購物者/買者介面的顯 示畫面之簡化圖示,其中所列出之賣主在回應於使用者已 在第37圖之畫面點選「進階搜尋」選項下銷售網域a中 之項目。"道" (category) is provided for this user. A "fast screen" is also provided on the left side of the screen. There is a box directly below it, which has its own animated appearance to indicate how online human users use the quick search option. The left panel also provides a set of boxes for members to log in as a temporary trial or lifetime membership. (Note that most of the entry functions of the present invention are disabled until the user recognizes that the certificate is valid). In the lower left corner, a set of online sellers connected to the portal of the present invention is provided, and on the right side, a large message box is marked as "feedback". Online users are provided to serve the e-mail through e_mail. The button has an annotation message, which is preferably executed using Microsoft's Outlook Express email application. Figure 33 is a simplified diagram of the GUI for the GUI or shopper/buyer interface of the present invention, in which the company responds to the "government-to-business" text that has been selected by the online buyer/user in the previous screen. The display is shown below. However, because these companies, or so-called government-to-business e-business services or providers, strictly restrict access to their web servers by adopting a certification security interface in a completely closed computer network environment. The right, this real picture does not work. The 34th circle is used for the Gm or shopper/buyer interface 17 1280488 of the present invention. 5. Description of the invention (15. A simplified illustration of the screen, wherein the user has clicked on the picture on the 33rd screen. The Step Search option is provided by the details of the company selected by the user in the provided selection. Note that in this screen, the headings in the panel box directly below the labels of the five domains are the r-level agents. Except at the bottom of the screen, the user is provided with a dialog box that can be populated to perform a search using the semantics provided by the present invention to distinguish the buyer agent. However, again, due to these companies, or so-called Government-to-business e-business services or platform providers are now strictly restricting the right of members to access their web servers by adopting a certification security interface in a completely closed computer network environment. This real picture does not work. Figure 35 is a simplified illustration of a screen for the Gm or Shopper/Buyer interface of the present invention, in which the company responds to a "business-to-business" that has been selected by an online buyer/user in the previous screen. The text is shown below. However, these companies, or so-called business-to-business e-business services or platform providers, are now strictly limited by adopting a certification security interface in a completely closed computer network environment. The member's right to access his web server, this real picture does not work. Figure 36 is a simplified illustration of the display screen for the GUI4 shopper/buyer interface of the present invention, where the user is already in the pictogram The details of the company selected by the user in the provided selection are provided after clicking the "Advanced Search" option on the screen. Figure 37 is a display screen for the (10) or constructor/buyer interface of the present invention. A simplified illustration of the selected item and its description is displayed in response to the user selecting "Domain A" on the screen. 18 1280488 V. Description of Invention (16) Figure 38 is a GUI for use in the present invention or A simplified illustration of the display screen of the shopper/buyer interface, in which the listed sellers are selling items in the domain a in response to the user having selected the "Advanced Search" option on the screen of Figure 37. .
第39圖為用於本發明之GUI或購物者/買者介面的顯 示畫面之簡化圖示,其中使用本發明之語意分辨買者代理 為特點被執行之搜尋結果細節被提供。該購物者/買者透過 在第38圖之畫面底部被顯示之搜尋參數介面對使用者提 出之搜尋請求加以回應。 參照第2圖,其中一般化的圖被提供,說明本發明之 較佳實施例10、一使用者/買者12與線上賣主14間經過 全球資訊網/網際網路16之相互作用。 在本發明之較佳實施例10中,一學習者代理器18(亦 被稱為語意分辨學習者代理器)與一購物者代理器20(亦被 稱為語意分辨買者代理器)被提供。一伺服器22被運用以 提供對儲存全球多語言賣主資訊之離線資料庫24之存 取。一系統管理員26準備/編譯有關被選擇之賣主網站的 訓練資料,並透過伺服器22將之儲存在離線資料庫24之 賣主件列」27内。然後系統管理員26可運用該訓練資 料與έ吾意分辨學習者代理器18以經過全球資訊網16進行 自賣主網站被擷取之訓練網頁的「歸納學習」。此「歸納學 習」形成賣主描述佇列28形式之賣主描述,其被儲存於離 線資料庫24。 使用者/買者12可使用本發明之較佳實施例以藉由使 19 1280488 五、發明說明(l7 ) 用語意分辨買者代理器(SRBA)20來擷取有關被指定主題 之被指定的資訊。SRBA藉由使用包含於先前被學習之賣 主描述28的資訊處理由使用者/買者12來之請求。賣主描 述24内之資訊允許語意分辨買者代理器實質上同步地利 用全球資訊網16對很多賣主網站瞬間地準備及發出搜 尋。該賣主描述亦允許語意分辨買者代理器20瞬間地處理 所接收之搜尋結果,並將已過濾之外來及無關的資訊之由 所有賣主網站被搜尋的結果呈現至使用者/買者12。 現在參照第3圖,流程圖1 〇〇顯示本發明之實施例的 語意分辨學習者代理器(SRLA)l8之作業。在本發明之較佳 實施例中,語意分辨學習者代理器18在於伺服器或個人電 腦上執行之電腦程式内被實施。在步驟11〇,語意分辨學 習者代理器由儲存於訓練資料庫35之「賣主佇列」27掏 取預先被定義或稍早被準備之訓練資料。訓練資料庠24 較佳地為離線的。 該訓練資料包括屬於資訊將由其被學習之線上賣主的 一束資料。這些資料如下面於右邊欄位顯示地可包括 URL、網域描述、產品樣本與屬性及其他因網域而定之資 訊〇 i賣主名稱 圓 —、— 1 cache.com 賣主之URL http://www.cache.com 賣主搜尋表格URL http://st4.yahoo.com/cgi- bin/nsearch?catalog= 1 cached querv= 學習網域 dvd 20 1280488 五、發明說明(1S) 訓練例 -—-- 1 i)具有I-Glasses之DVD虛擬筆記型戲院 ii) JVC XV523GD杜比數位DVD播放器 iii) PianeerDVL_919·組合式LC/DVD/CD播放器Figure 39 is a simplified pictorial illustration of a display screen for the GUI or shopper/buyer interface of the present invention in which search results details that are characterized using the semantics of the present invention are provided. The shopper/buyer responds to the search request submitted by the user through the search parameters displayed at the bottom of the screen in Fig. 38. Referring to Figure 2, a generalized diagram is provided to illustrate the interaction between a user/buyer 12 and an online vendor 14 via the World Wide Web/Internet 16 in accordance with a preferred embodiment 10 of the present invention. In a preferred embodiment 10 of the present invention, a learner agent 18 (also referred to as a semantically resolved learner agent) and a shopper agent 20 (also referred to as a semantically distinguishing buyer agent) are provided. . A server 22 is utilized to provide access to an offline repository 24 that stores global multilingual vendor information. A system administrator 26 prepares/compiles the training material for the selected vendor website and stores it in the vendor list "27" of the offline database 24 via the server 22. The system administrator 26 can then use the training information to identify the learner agent 18 to "inductively learn" the training web page that has been retrieved from the main website via the global information network 16. This "inductive learning" forms a vendor description in the form of a vendor description, which is stored in the offline database 24. The user/buyer 12 can use the preferred embodiment of the present invention to retrieve the designated subject matter of the designated subject by semantically distinguishing the buyer agent (SRBA) 20 from 19 1280488 5, the invention description (l7). News. The SRBA requests processing by the user/buyer 12 by using the information contained in the previously described vendor description 28. The information in the Vendor Description 24 allows semantics to distinguish that the Buyer Agent essentially uses the World Wide Web 16 to instantly prepare and issue searches for many seller websites. The vendor description also allows semantically distinguishing the buyer agent 20 to process the received search results instantaneously and present the results of the filtered and unrelated information from all of the seller websites being searched to the user/buyer 12. Referring now to Figure 3, a flow chart 1 shows the operation of the semantically resolved learner agent (SRLA) 18 of an embodiment of the present invention. In a preferred embodiment of the invention, the semantically resolved learner agent 18 is implemented within a computer program executed on a server or personal computer. In step 11, the semantically determined learner agent is retrieved from the "seller's queue" 27 stored in the training database 35 to obtain training materials that are previously defined or prepared earlier. The training data 庠 24 is preferably offline. The training material includes a bundle of information belonging to online sellers whose information will be learned. These information can be displayed as shown in the right-hand column below, including URL, domain description, product samples and attributes, and other information depending on the domain. 卖i vendor name circle—, — 1 cache.com vendor URL http://www .cache.com Vendor search form URL http://st4.yahoo.com/cgi- bin/nsearch?catalog= 1 cached querv= Learning domain dvd 20 1280488 V. Invention description (1S) Training example---- 1 i) DVD virtual notebook theater with I-Glasses ii) JVC XV523GD Dolby Digital DVD player iii) PianeerDVL_919·Combined LC/DVD/CD player
第4圖顯示依照本發明之訓練後或學習後資料之名稱 標示的型式與描述之例子。第5圖就被學習之真實「資料 元素」顯示表例,其在第4圖之賣主描述學習過程之際已 被產生’且被儲存於該離線資料庫之賣主描述中並被系統 管理員26維護。 該「訓練後」之資料較佳地被儲存於順從SQL或微軟 Acess資料庫。此添加可擴充性至由不同賣主來之資料容 器的選擇。典型而言,該訓練後資料係與該線上賣主之產 品網域、寫入字元與呈現型式獨立無關。例外的是在訓練 後資料内之URL路徑,其被要求須獨一地辨識不同的賣 主。 回到第3圖,在步驟丨20中被加以一檢查來看是否還 要賣主須被語意分辨學習者代理器18學習。若還有賣主等 著被學習,語意分辨學習者代理器將前進至步驟13〇 :否 則該子^過程終止。在步驟130,語意分辨學習者代理器 1s使用預先疋義的訓練資料智慧地存取該預先定義的訓 練資料所對應的特定學習賣主。語意分辨學習者代理器18 就該訓練資料中確定的產品經由賣主網站之搜尋該特定產 品。典型而言,語意分辨學習者代理器18由本發明之系統 或由系統管理者之人工輸入擷取訓練資料之數個網頁,其 被稱為「訓練網頁」且其稍後將被用以實施歸納學習。在 21 1280488 五、發明說明(19) 較佳實施例中,控制資料(會引發賣主網站之錯誤網頁的訓 練資料)亦在此階段被包括。 接著在步驟140中,電腦程式對語意分辨學習者代理 器1 8所獲得之訓練資料實施歸納學習。歸納學習之目標為 要獲得該網站之屬性描述與其如何組織產品資料及邏輯地 呈現該產品資料給潛在的線上顧客。此學習之產品被稱為 「賣主描述」,此階段將參照第6圖進一步被描述及解釋。 然後在步驟150中,語意分辨學習者代理器18較佳地 選擇所學習之畫面於順從SQL或微軟Acess資料庫24中。 (選擇於離線資料庫24之賣主資訊或「賣主描述」28稍後 將被線上語意分辨買者代理器2〇使用)。在步驟15〇完成 後語意分辨學習者代理器18回到步驟12〇看是否還有賣主 要被學習。若然’步驟13〇至150被重複。否則,該學習 過程終止。 現在回到第7圖,該賣主描述學習過程將使用資訊抽 取之一簡單的模型及一簡化的訓練網頁例子進一步詳細的 被解釋。第7圊之左邊顯示當潛在顧客瀏覽賣主網站時會 出現的模型排列與價格資訊。例如,右邊前三行為html 碼提供該排列之名稱··「簡單的產品目錄」,並指示要被顯 不之資訊的開始。第四行提供該表之標題文字—「md價 格」。第六與七行分別提供這些欄之名稱文字—「型號」與 「價格」。苐八至十一行提供型號與價格資訊内容。其餘的 订疋出之資訊為如表之結束、表之排列、與產品目錄之本 體結束。 'Figure 4 shows an example of the type and description of the name of the post-training or post-learning data in accordance with the present invention. Figure 5 shows the actual "data element" of the learned example, which was generated when the vendor described the learning process in Figure 4 and was stored in the vendor description of the offline database and was used by the system administrator 26 maintain. The "post-training" information is preferably stored in a compliant SQL or Microsoft Acess database. This adds scalability to the choice of data containers from different vendors. Typically, the post-training data is independent of the online vendor's product domain, write characters, and presentation style. The exception is the URL path within the post-training data, which is required to uniquely identify different vendors. Returning to Fig. 3, a check is made in step 来看20 to see if the seller still has to be semantically distinguished by the learner agent 18. If there is still a seller waiting to be learned, the semantically resolved learner agent will proceed to step 13: No, the sub-process terminates. At step 130, the semantically determined learner agent 1s intelligently accesses the particular learning vendor corresponding to the predefined training material using the pre-defined training material. The semantically determined learner agent 18 searches for the particular product via the vendor website for the product identified in the training profile. Typically, the semantically resolved learner agent 18 is referred to as a "training webpage" by the system of the present invention or manually by the system administrator to retrieve training webpages, which will be used later to implement induction. Learn. In 21 1280488 V. DESCRIPTION OF THE INVENTION (19) In the preferred embodiment, the control information (the training material that would trigger the error page of the seller's website) is also included at this stage. Next, in step 140, the computer program performs inductive learning on the training data obtained by the semantic learner learner agent 18. The goal of inductive learning is to obtain a description of the attributes of the website and how it organizes product information and logically presents the product information to potential online customers. This learning product is referred to as the “Seller Description” and will be further described and explained at this stage with reference to Figure 6. Then in step 150, the semantically resolved learner agent 18 preferably selects the learned picture in the compliant SQL or Microsoft Acess database 24. (The vendor information or "seller description" 28 selected in the offline database 24 will be used later to distinguish the buyer agent 2 from the online). After the completion of step 15〇, the learner agent 18 is returned to step 12 to see if there is still a seller to be learned. If so, steps 13〇 to 150 are repeated. Otherwise, the learning process is terminated. Returning now to Figure 7, the vendor description learning process will be explained in further detail using a simple model of information extraction and a simplified training webpage example. The left side of page 7 shows the model arrangement and price information that will appear when potential customers browse the seller's website. For example, the top three behaviors on the right side of the html code provide the name of the arrangement, "Simple Product Catalog," and indicate the beginning of the information to be displayed. The fourth line provides the title text of the table - "md price". The sixth and seventh lines provide the name texts for these columns - "Model" and "Price". Models 8 and 11 provide model and price information. The rest of the information ordered is the end of the form, the arrangement of the table, and the end of the product catalog. '
1280488 五、發明說明(20) 首先,一包袠函數就所給予之訓練網頁產生一組「標 示」。標示係被用以定出在訓練網頁中之訓練產品的資訊之 位置。第8圖顯示第7圖之簡單的產品訓練網頁,即被語 意分辨學習者代理器18產生之一組標示。第8圖中之標 示j指出第7圖之簡單的產品目錄網頁包含四個「元組 (tuple) ’其中每一元組包含一個「項目」值與一個「價烙 值。一對整數代表每一個值。 考慮第一對,<174, 180 >。這些整數表示該第一元 組之屬性為位置174與180間之子字串,即字串‘HM 381MD’位於位置174與位置180間。如此例中被使用者, 位置思即由指定開始點起之字元個數,如網頁之開始或一 網頁之「頭」的結束。文字字元間之空格被計算為一字元 位置。第7圖之檢查揭不字串“ HM 381MD ”中之字母 ‘‘ H”在第一行中由“ 字元起之174字元出現;及字 串中之‘‘ D”出現在字元位置180。類似地,最後一元組或 對< 350,301 >表示最後之屬性價格出現於字元位置356 與361間並辨識該字串“399.95” 。其將被了解雖然字元 位置在此例中被使用以辨識「標示」,其他準則可在本發明 之精神内被使用。其他準則可以本發明之方法論的應用被 使用。例如再次檢查第7圊,考慮第2圊之語意分辨學習 者代理器1 8自律地就四個電子產品指派數值至型號與「你 的價格」,而其被表示成下列公式: 度號 價格 !28〇4881280488 V. INSTRUCTIONS (20) First, a package function produces a set of "markers" on the training pages given. The marker is used to locate the information of the training product in the training webpage. Figure 8 shows a simple product training webpage of Figure 7, which is a set of logos generated by the semantically resolved learner agent 18. The label j in Figure 8 indicates that the simple catalog page of Figure 7 contains four "tuples" where each tuple contains an "item" value and a "price value. A pair of integers represents each one. Considering the first pair, <174, 180 > These integers indicate that the attribute of the first tuple is a substring between positions 174 and 180, i.e., the string 'HM 381MD' is located between position 174 and position 180. In this case, by the user, the position is the number of characters from the start of the designation, such as the beginning of a web page or the end of a "head" of a web page. Spaces between text characters are counted as a character position. The letter ''H' in the check mark of the 7th figure "HM 381MD" appears in the first line by 174 characters from the character; and the ''D' in the string appears at the character position 180. Similarly, the last tuple or pair < 350, 301 > indicates that the last attribute price appears between character positions 356 and 361 and recognizes the string "399.95". It will be understood that although the character position is here The examples are used to identify "marks" and other criteria can be used within the spirit of the present invention. Other criteria can be used for the application of the methodology of the present invention. For example, check the seventh point again, and consider the semantics of the second language. The learner agent 18 autonomously assigns values to the model and "your price" for the four electronic products, and it is expressed as the following formula: Degree price! 28〇488
五 '發明說明(21 ) MD2070 599.95 MD203 249.95 MDR3 399.95 L= <b2,i,e2,i>,<b2,p,e2,p> <b3,i,e3,i> ’ <b'P,e3,p>V' invention description (21) MD2070 599.95 MD203 249.95 MDR3 399.95 L= <b2,i,e2,i>,<b2,p,e2,p><b3,i,e3,i> ' <b 'P,e3,p>
^ <b4,i,e4,i>,<W,P,e4,p>J 若“b”代表開始及“e”代表結束,則辨識第二元組 之位置的值包含字串b-,i,即型號開始之值“M”,而字 串e—,i為型號之結束“ 。類比的是,其將被了解藉由^ <b4,i,e4,i>,<W,P,e4,p>J If "b" represents the start and "e" represents the end, the value identifying the position of the second tuple contains the string b -, i, the value at the beginning of the model "M", and the string e-, i is the end of the model. The analogy is that it will be understood by
根據包含「項目辨識器」與「智慧型價格辨識器」之標準 關係資料模型激發一模組的啟發搜尋促成標示之自動化, 其中反覆地一元組為二子串之向量< b2,i,b2,p〉。字 串b—,i為項目屬性之值,及字串b_,p為價格屬性之值。 所以,屬性代表行,而元组代表列。“ b2,,與“,,,間之數 值“b2, i”言外之意為第二列上之位置—位置值(標示)之計 算因而在確認整個Ppc(產品目錄之網頁—網頁“p”為包含 所欲資訊之網頁)的賣主描述之有效可學習的包袠歸納的 激發之際即時、自動、急忙地被實施,而不論在賣主網站(在 此例中為^ ♦ 8 0 0. com)#格式化之網頁是為任何語言之 當地字元字串或為以HTML,XML,cXML , Java等被編 碼或自然語言。 訓練網頁内容之標示在第9圊中更一般地被呈現。在 苐一行破標示之資訊被辨識一在此例中,產品與價格為被 找尋之資訊。在第二行,該等「標示」被標示為「元組」 登錄對應者一<產品左標記,產品右標記〉與〈價格左標 記’價格右標記>。 在系統管理員已執行學習系統一次後,然後其在與編 24 1280488 發明說明(22) 譯第10A與10B圖中可能的候選標記組平行地編譯可能的 候選標記組地由離線資料庫中之賣主佇列擷取該訓練網 頁。且其如上述例子顯示地在包含位置值的暫時自動實斿 (標示)之即時、急忙計算中持續地使用另一組訓練網頁。 由於二組候選者之交又產生有效的候選組,語意分辨學習 者代理器18便選擇一有效的候選組作為一賣主描述。 現在參照第2圖,流程圖200顯示本發明之語意分辨 學習者代理器18的_實施例。該方法論使用三個環境規 律’其管理在購物網站中被提供之產品描述之配置,此允 許資訊抽取以與網域獨立無關地進行。該等規律包括導航 規律具有可搜尋之指標、統一規律與垂直隔離規律。 有關導航規律,線上商店或賣主網站被設計以服務消 費者與企業買者詢問。因此,幾乎所有線上賣主提供可搜 尋的指標用於對特定被查詢之資料庫的容易存取。使用賣 主網站之可搜尋形式促成語意分辨學習者代理器18將多 語言首頁與網頁的格式形式一般化。第n圖為簡化圖顯 示具有可搜尋指標與產品網域欄之首頁。 有關統一規律,雖然線上商店或賣主在其產品描述格 式彼此廣泛地不同,任何已知的線上賣主典型地以簡單一 致的格式排列所有項目描述。第12圖為一網頁畫面之簡 圖’揭示統一地被格式化之搜尋結果的配置。因此,其可 被看出每一搜尋結果列出以一「型號」字串開始,其下方 被提供一「產品描述」。此外,「價格1」與「價格2」在「產 品描述」右邊。 1280488 五、發明說明(23) 第13圖提供該等搜尋結果用之相同畫面的簡化圖,其 顯示重直隔離之使用以顯示產品之目錄。此垂直格式可被 分類為文件之頭、内容與尾。 如網際網路之資訊基礎架構之原始設計一訓練架構、 線上賣主之產品描述的格式、及技術之表達一係欲於被人 類使用。此在查詢機制與輸出標準之使用為明白的,其特 別適合於直接的人為操縱。線上賣主因其促成對人類購物 者或買者之線上銷售故其符合這些規律。雖然其不保證使 線上商店讓人類導航容易會使得_智慧型軟體代理器對主 人為親和的,本發明之系統一多語言電子資料源之線上智 慧型資訊比較一被設計以取得這些規律之益處。 依照本發明,包袠構建係經由歸納學習被實施。該方 法論藉由對賣主網頁樣本推論而學習賣主之包袠。在本發 明之方法論中,實例對應於賣主網頁、網頁之標示對應於 其相關的内容、及假設對應於所構建之包裹。 此外,依照本發明,一個可有效地學習之包裹級,如 HLRT包袠級被納入。 還有’為確定該方法論實施良好,當訓練資料展現高 位準之雜訊時,雜訊容許技術被運用。例如,已知第15A 圖之wu 'J j.0 · coin之單幅畫面例子,智慧型價格辨識器可 分辨「標價j與「你的價格」。被辨識的實例便可被確認以 標示整個網頁。考慮項目用之辨識器與價格用之另一者, 確認產生一標示方法論,其標示包含成對之這些属性的網 頁。 26 1280488 五、發明說明(24 ) 實際上,賣主藉由使用對所型式之產品的統一觀看可 創造分辨之感覺。例如,賣主以與DVD產品相同的格式 提出一 MD產品資訊。藉由採用此規律性之好處,每種產 品假設可以相同的格式被描述》Inspired by a standard relational data model containing the "item identifier" and "smart price identifier", a module's heuristic search facilitates the automation of the label, where the inverse tuple is a vector of two substrings < b2,i,b2, p>. The string b—, i is the value of the item attribute, and the string b_, p is the value of the price attribute. So, the attribute represents the row and the tuple represents the column. "b2,, and the value "b2, i" between ",," means the position on the second column - the position value (mark) is calculated so that the entire Ppc (page of the product catalog - web page "p" is included The seller of the desired information describes the effective and learnable package induction that is implemented immediately, automatically, and hurriedly, regardless of the seller's website (in this case, ^ ♦ 8 0 0. com)# The formatted web page is a local character string for any language or is encoded in HTML, XML, cXML, Java, etc. or natural language. The indication of the content of the training web page is presented more generally in section 9. The information on the broken line is identified. In this case, the product and price are the information being searched for. In the second line, the "markers" are marked as "tuples". The corresponding ones are <product left mark, product right mark> and <price left mark' price right mark >. After the system administrator has executed the learning system once, then it compiles the possible candidate tag groups in parallel with the candidate tag groups in parallel with the possible candidate tag groups in the 10A and 10B diagrams. The seller browses the training page. And as shown in the above example, the other set of training web pages is continuously used in the instant, rush calculation including the temporary automatic real (signal) of the position value. Since the intersection of the two sets of candidates produces a valid candidate set, the semantically resolved learner agent 18 selects a valid candidate set as a vendor description. Referring now to Figure 2, a flow chart 200 illustrates an embodiment of the semantic discriminative learner agent 18 of the present invention. The methodology uses three environmental regulations 'which manages the configuration of the product descriptions provided in the shopping website, which allows the information extraction to be performed independently of the domain. These laws include navigational rules with searchable indicators, uniform laws, and vertical isolation laws. Regarding the navigation rules, online stores or seller websites are designed to serve consumers and business buyers. As a result, almost all online vendors provide searchable metrics for easy access to specific queried databases. Using the searchable form of the vendor website facilitates the semantics of the learner agent 18 to generalize the format of the multilingual home page and web page. Figure n shows the simplified map showing the top page with searchable metrics and product domain columns. Regarding the uniform law, although online stores or sellers differ widely from one another in their product description format, any known online seller typically arranges all project descriptions in a simple and consistent format. Figure 12 is a simplified diagram of a web page' showing the configuration of uniformly formatted search results. Therefore, it can be seen that each search result list starts with a "model" string, and a "product description" is provided below it. In addition, "Price 1" and "Price 2" are to the right of "Product Description". 1280488 V. INSTRUCTIONS (23) Figure 13 provides a simplified diagram of the same screen used for these search results, showing the use of the vertical isolation to display the catalog of products. This vertical format can be categorized as the head, content, and trailer of the file. For example, the original design of the information infrastructure of the Internet, the training structure, the format of the product description of the online seller, and the expression of the technology are intended to be used by humans. This is understood in the use of query mechanisms and output standards, which are particularly suitable for direct human manipulation. Online sellers are in compliance with these rules because they facilitate online sales to human shoppers or buyers. Although it does not guarantee that making online stores easy for human navigation will make the AI software agent friendly to the owner, the online intelligent information comparison of the multi-language electronic data source of the system of the present invention is designed to achieve the benefits of these laws. . In accordance with the present invention, the construction of the package is implemented via inductive learning. The methodology learns the burden of the vendor by inferring the sample of the seller's web page. In the methodology of the present invention, the examples correspond to the vendor web page, the logo of the web page corresponds to its associated content, and the hypothesis corresponds to the parcel being built. Moreover, in accordance with the present invention, a parcel level that can be effectively learned, such as a HLRT pack level, is included. Also, in order to ensure that the methodology is well implemented, the noise allows the technology to be used when the training data exhibits a high level of noise. For example, a single-screen example of wu 'J j.0 · coin in Figure 15A is known. The smart price recognizer can distinguish between "price j" and "your price". The identified instance can be confirmed to mark the entire web page. Considering the other one of the item's identifier and price, it is confirmed that a labeling methodology is generated that contains a web page containing these attributes in pairs. 26 1280488 V. INSTRUCTIONS (24) In fact, the seller can create a sense of resolution by using a unified view of the type of product. For example, the seller presents an MD product information in the same format as the DVD product. By adopting the benefits of this regularity, each product hypothesis can be described in the same format.
第2圖中之語意分辨學習者代理器18僅由例子之特定 網域學習包裹,並企圖將此網域套用到所有其他網域(在完 全不同之本體項目内所有其他的網頁類別)而以一致的格 式被組織,且在全球資訊網之245個以網際網路連接的國 豕之其餘網站線上地被分類❶因而,以本發明之第2圖的 語意分辨學習者代理器18維護完全更新的全球產品資料 庫的命名法是可行的,而不需對順從SQL之資料庫撰寫程 式碼,也不需對微軟Acess資料庫以人工鍵入每種產品至 每一網域。 繼續第6圊,在步驟210,語意分辨學習者代理器a 產生一組標示以代表該訓練網頁之内容。以另一方式來 說,標示位置值之方法論為就訓練網頁中之訓練產品辨識 其位置資訊。再次地說,第2圖之語意分辨學習者代理器 1 8即時、急忙、自動地產生包含如下位置值之標示: eKi> ^ ^Κρ^1ίΡ>Ν <b:!·丨.e2 丨> ’ <b2 p e2,p> <b3.i, e3“>,<b3,p,e3,p>The semantically resolved learner agent 18 in Figure 2 learns the package only from the specific domain of the example and attempts to apply this domain to all other domains (all other page categories within a completely different ontology project). The consistent format is organized and is categorized online on the 245 of the World Wide Web's remaining Internet-connected countries. Thus, the learner agent 18 maintains a full update with the semantics of Figure 2 of the present invention. The naming of the global product database is feasible, without having to write code for the SQL-compliant database, or manually typing each product into each domain for the Microsoft Acess database. Continuing with the sixth step, in step 210, the semantically resolved learner agent a generates a set of indicia to represent the content of the training web page. In another way, the methodology for indicating location values is to identify location information for training products in the training web page. Again, the semantics of Figure 2 distinguishes learner agent 18. Instant, rush, and automatically generate an indication containing the following position values: eKi> ^ ^Κρ^1ίΡ>Ν <b:!·丨.e2 丨> ; ' <b2 p e2,p><b3.i,e3">,<b3,p,e3,p>
<V,· e。〉,<b4.p, e4,p>J 詳細内容請看第8圖之右邊攔位。 27 ---- 1280488<V, · e. 〉,<b4.p, e4,p>J For details, please see the right side of Figure 8. 27 ---- 1280488
五、發明說明(Μ) 在步驟220,語意分辨學習者代理器18對所擷取之訓 練網頁使用相關的標示來實施歸納學習以輸出一組可能的 候選「賣主描述」。由於該等候選者係由特定訓練網頁以特 定的訓練資料被產生,候選者對這些頁不可能是無效的。 …:而若要候選者在整個賣主網站為有效的,跨網頁認證 必須被實施以導出一般化的賣主描述,其將在整個賣主網 站為有效的。 在步驟240, 一賣主描述認證器(VDV)就可能的候選賣 主描述針對另一組訓練網頁(在第3圖之步驟13〇被擷取) 加以認證。若一賣主描述為滿意的,該學習過程將停止(見 第3圖),否則該認證過程將持續藉由使用其餘候選者及透 過步驟230, 240與250之迴圈來改善賣主描述選擇。所謂 賣主描述為「滿意的」之一準則為每一後續被分析之訓練 網頁的候選者在編號與字元上均與先前的訓練網頁相同。 右一後續被分析之訓練網頁與候選者的訓練網頁編號不 同’則另一訓練網頁須被分析。 第10A與10B圊提供候選賣主描述之左、右標記用於 項目指述、價格資訊、訓練網頁之頭與尾的例子。「賣主描 述」之一例在第5圖中被提供,其包括用於項目描述、價 格資訊、訓練網頁之頭與尾的辨識用標記。 用於特定賣主之訓練資料較佳地被系統管理員26使 用。如在此處進一步詳細被描述者,為了添加特定之賣主 至本系統,對應的訓練例之賣主名稱、賣主URL、提出表 格之URL、網域資料被提供及被儲存於離線資料庫24,其 28 1280488 五、發明說明(26) 例如可為微軟Acess資料庫。賣主名稱將為一筆記錄之基 本鍵。人工包袠輸入可被提供做為一選擇作法。為了提供 精確的訓練例之資料集(此將反而大幅地加強語意分辨學 習者代理器13在準備其本身用於即時、自動、急忙地產生 賣主描述時所學習的賣主資訊之精確度與效率),很重要的 是系統管理員或其他個人所準備的訓練例資料為可了解網 頁URL及網域名稱設立為多少可了解正被處理任何多語 έ賣主網站中被使用之當地語言,且能辨識學習目標之資 訊型式。此人不須了解程式碼。 一旦賣主之資訊被提供,管理員26可為每一賣主執行 語意分辨學習者代理器處理。在管理員已為賣主執行語意 分辨學習者代理器18 一次後,其便可在執行如第16至23V. INSTRUCTION STATEMENT (Μ) At step 220, the semantically-resolved learner agent 18 performs inductive learning using the relevant indicia for the trained training web page to output a set of possible candidate "vendor descriptions". Since the candidates are generated from specific training web pages with specific training materials, the candidates may not be invalid for these pages. ...: and if the candidate is valid throughout the vendor's website, cross-web authentication must be implemented to derive a generalized vendor description that will be valid throughout the vendor's website. At step 240, a vendor description authenticator (VDV) authenticates the possible candidate vendor descriptions for another set of training web pages (stepped in step 13 of Figure 3). If a vendor description is satisfactory, the learning process will stop (see Figure 3), otherwise the authentication process will continue to improve the vendor description selection by using the remaining candidates and through the loops of steps 230, 240 and 250. One of the criteria described by the vendor as "satisfactory" is that the candidates for each subsequent analyzed training web page are identical in number and character to the previous training web page. The training webpage analyzed on the right and subsequent is different from the training webpage of the candidate. Then another training webpage must be analyzed. Sections 10A and 10B provide examples of the left and right markers of the candidate vendor description for item description, price information, and the head and tail of the training web page. An example of "seller description" is provided in Figure 5, which includes identification tags for item descriptions, price information, and headers and trailers of training pages. Training materials for a particular vendor are preferably used by system administrator 26. As described in further detail herein, in order to add a particular vendor to the system, the vendor name of the corresponding training instance, the vendor URL, the URL of the proposed form, the domain data are provided and stored in the offline database 24, 28 1280488 V. Description of the invention (26) For example, it can be the Microsoft Acess database. The vendor name will be the base key for a record. Artificial packet input can be provided as an alternative. In order to provide a precise training case data set (this will instead greatly enhance the semantics to distinguish the accuracy and efficiency of the learner agent 13 in preparing itself for the instant, automatic, and rush to generate vendor descriptions) It is very important that the training case information prepared by the system administrator or other individuals can be used to understand the URL of the webpage and the name of the domain name to know the local language being used in any multilingual website. Learning the information type of the goal. This person does not need to know the code. Once the seller's information is provided, the administrator 26 can perform semantics for each vendor to resolve the learner agent processing. After the administrator has performed the semantics for the vendor to resolve the learner agent 18 once, it can execute as shown in steps 16 to 23
圖顯示之任何所欲的選項時以逐一步驟地走過互動式代理 器字元學習者介面(IACLI)畫面導航通過該學習過程。最 終,由被送回之訓練網頁被擷取之「賣主描述」結果集將 被儲存於如微軟Acess資料庫之離線資料庫24。為了刪除 /去掉一特定的賣主,該管理員可直接由「賣主佇列」或「賣 主描述佇列」刪除該記錄。為了修改/編輯,管理員可由資 料庠中之「賣主佇列」或「賣主描述佇列」修改該記錄。 在小容器中,本發明之語意分辨學習者代理器18產生 特定賣主之獨特的賣主描述❶賣主描述為一賣主如何以特 定格式組織其產品資訊之一般化規則的集合。因此,本發 明之包袠構建系統的輸入基本上為將被學習之包裹的行為 樣本。在此成形下,包袠構建變成根據包裹行為重建包裹The graph shows any desired options as it walks through the interactive agent character learner interface (IACLI) screen through the learning process. Finally, the "seller description" result set retrieved from the returned training page will be stored in an offline repository 24 such as the Microsoft Acess database. In order to delete/remove a particular vendor, the administrator can delete the record directly from the "seller queue" or "seller description queue". For modification/editing, the administrator can modify this record from the "Seller's List" or "Seller's Description" column in the folder. In a small container, the semantic discriminative learner agent 18 of the present invention produces a unique vendor description of a particular vendor. The seller describes a set of generalized rules for how a vendor organizes its product information in a particular format. Thus, the input to the package building system of the present invention is essentially a sample of the behavior of the package to be learned. Under this shape, the package construction becomes a reconstruction of the package according to the package behavior.
29 1280488 五、發明說明(27 ) 的過程。 語意分辨學習者代理器(SRLA)18之方法論在第14圖 以簡化後之例子的方式被摘錄。在步驟1中,二件資訊被 饋入系統以進行包裹歸納:(1)賣主網站之URL(如 h t t p : / / W W —W · 8 0 0 · C Ο 斑(2 V網Μ也冰,其包括特定網域 之某些訓練例。例如,一網域描述可為「電子產品,,及此 網域之記錄可為「Sony ΗΜ3 81MD」,其為被用以填入賣主 之搜尋表格的型號。在步驟2與3,語意分辨學習者代理 器1 8自動地使用由該等訓練例之URL與網域/型號通過網 際網路至賣主網站。就特定例而言,該語意分辨學習者代 理器將依據步驟1提供之URL前進至www· 8 0 0. 姻 頁。然後其將以相關的搜尋表格填入必要的產品資訊(即網 域描述一「電子產品」與「HM381MD」)。最後,其將「提 出」該搜尋表格以請求一搜尋並等候回應。 參照步驟4,一結果網頁依據搜尋準則被送回。此結 果可為具有相關產品描述之成功結果網頁,或為失敗的網 頁。其將被注意的,在被送回之網頁的所論及之内容為 HTML碼、項目描述、項目價格與這些資訊針對HTML碼 的位置。 在步驟5與6,該搜尋結果網頁經由網際網路被送回 語意分辨學習者代理器18用於分析。在步驟7被進行之分 析稱為「包袠歸納」,其中該網頁被一般化為一組排列與格 式規則,賣主以邏輯方式遵守此些規則以呈現其賣主描 述。本發明之語意分辨買者代理器20以這些規則可在一買 30 1280488 五、發明說明(28) 者/使用者由該賣主網站之同一網域搜尋某些產品資訊時 由同一賣主抽取產品資訊。 其將被了解,語意分辨學習者代理器過程依照本發明29 1280488 V. Process of invention (27). The methodology of semantically resolved Learner Agent (SRLA) 18 is extracted in Figure 14 in a simplified example. In step 1, two pieces of information are fed into the system for parcel induction: (1) the URL of the seller's website (eg http: / / WW - W · 800 ) · C Ο spot (2 V network is also ice, its Including certain training examples for a specific domain. For example, a domain description can be "electronics, and the record for this domain can be "Sony ΗΜ3 81MD", which is the model used to fill in the vendor's search form. In steps 2 and 3, the semantically resolved learner agent 18 automatically uses the URL and the domain/model of the training examples via the Internet to the vendor website. For a specific example, the semantically distinguishes the learner agent. The device will proceed to the www.800 page according to the URL provided in step 1. Then it will fill in the necessary product information with the relevant search form (ie domain description "electronic product" and "HM381MD"). Finally It will "propose" the search form to request a search and wait for a response. Referring to step 4, a result web page is sent back according to the search criteria. This result can be a successful result web page with a related product description, or a failed web page. It will be noticed, being The content of the returned web page is the HTML code, the item description, the item price, and the location of the information for the HTML code. In steps 5 and 6, the search result web page is sent back to the semantically resolved learner agent via the Internet. For analysis. The analysis performed in step 7 is referred to as "packaging induction", where the web page is generalized into a set of permutations and formatting rules, and the vendor logically abides by these rules to present its vendor description. Separately distinguishing the buyer agent 20 from these rules can be used to buy product information from the same vendor when the buyer/user searches for certain product information from the same domain of the vendor website. It is understood that semantically distinguishing learner agent processes in accordance with the present invention
將為每一賣主被實施,一賣主描述欲於由此得來。由於本 發明所使用之資訊標記做法,賣主描述可由任何賣主網站 以任何語言被獲取一簡明地說,雖然呈現給使用者之語言 可能為特定的當地字元字串,可就所要資訊被辨識為標記 的以之為基礎的程式碼不管該語言之當地字元字串為何均 保持為相同的。換言之,賣主描述之資訊將以賣主網站所 使用的當地語言由該網頁被獲取。其不需任何將當地語言 翻譯為標準語言。此外,因為每一賣主網站被辨識之候選 “ 0己並非以此網頁用之程式語言的基礎碑被編寫,後續的 搜尋可不需在被搜尋之網頁中用之不同程式語言被完成。 此允許語意分辨買者代理器20以多語言與多重網域(產品 類別)為基礎且與任何程式語言獨立無關地進行搜尋。 參照第1 5C圖’流程圓300顯示本發明之語意分辨買 者代理器20的實施例。在步称310,語意分辨買者代理器 20由第2圖之買者/使用者接收一請求,其要求產品之價 格比較。在步驟310,語意分辨買者代理器20以通訊用之It will be implemented for each seller, and a seller description will be derived from it. Due to the information tagging practices used in the present invention, the vendor description can be obtained in any language from any vendor website. Succinctly speaking, although the language presented to the user may be a particular local character string, the desired information may be identified as The marked code based on the language remains the same regardless of the local character string of the language. In other words, the information described by the seller will be obtained from the web page in the local language used by the seller's website. It does not require any translation of the local language into a standard language. In addition, because each vendor's website is identified as a candidate that is not written on the basis of the programming language used for the web page, subsequent searches may be completed without the need to use different programming languages in the web page being searched. The buyer agent 20 is distinguished from the multi-domain (product category) and is searched independently of any programming language. Referring to Figure 15C, the flow circle 300 shows the semantic resolution of the buyer agent 20 of the present invention. In the step 310, the semantically distinguishing buyer agent 20 receives a request from the buyer/user of Fig. 2, which requires a price comparison of the product. In step 310, the buyer agent 20 is semantically distinguished to communicate. Use
Active X搆成要素亦較佳地建立一連結至使用者/買者 12。使用者12必須提供至少一參數,其例如可包含所欲之 產品名稱、所欲價格之範圍、目標的線上賣主或儲存準則β 步驟312檢查看在包含所辨識之賣主網站的所欲資訊之記 憶體或快取記憶體中是否有「擊中」;若然,語意分辨買者The Active X component also preferably establishes a link to the user/buyer 12. The user 12 must provide at least one parameter which may, for example, include the desired product name, the range of desired prices, the online vendor or storage criteria of the target. Step 312 checks the memory of the desired information contained in the identified vendor website. Whether there is a "hit" in the body or cache memory; if so, semantically distinguishing the buyer
31 1280488 五、發明說明(29) 代理器將到步驟370以便儲存被抽取之目標資訊。然後語 意分辨買者代理器將到步驟380以由HTML中之目標資訊 產生結果網頁,再於步驟390中語意分辨買者代理器將顯 示該結果網頁至線上人類使用者/買者。 若在步驟312未找到「擊中」,步驟320將召喚語意分 辨買者代理器20以使用輸入參數以由離線資料庫24擷取 該等賣主描述。這些「賣主描述」為語意分辨學習者代理 器18先前在賣主描述學習過程之際被定義者。在步驟 3 3 0 ’ ί吾意分辨貝者代理器20將組合使用者之新請求以存 取在「賣主描述佇列」被辨識之不同的線上賣主。被組成 之使用者的新請求將以使用者給予之參數與賣主描述中之 資料為基礎。較佳的是,若所請求之線上賣主有Ν個(如產 品模型請求),其將有Ν個新請求要被語意分辨買者代理器 20組成。 邊意分辨買者代理器20使用賣主描述以由賣主網站 取得即時價格資訊。語意分辨買者代理器2〇使用賣主之 URL與包括於組成賣主描述之資訊内的賣主名稱來導航至 賣主網站’亦包括於賣主描述中者為賣主之搜尋表格 URL。在步驟340,於導航至賣主網站後,語意分辨買者 代理器20根據使用者之新請求「虛擬地」填入賣主搜尋表 格’並Γ虛擬地」壓下enter鍵以將之提出。此就每一被 辨識後之線上賣主被完成。 如上述者,離線資料庫24中之賣主描述包括一攔位, 其&供賣主之搜尋表格URL ’如“http://www.oiiliiieshop. 32 128048831 1280488 V. INSTRUCTIONS (29) The agent will proceed to step 370 to store the extracted target information. The semantically distinguishing buyer agent will then proceed to step 380 to generate a resulting web page from the target information in the HTML, and in step 390, the buyer agent will be visually identified to display the resulting web page to the online human user/buyer. If a "hit" is not found at step 312, step 320 will summon the semantics to distinguish the buyer agent 20 to use the input parameters to retrieve the vendor descriptions from the offline database 24. These "seller descriptions" are semantically distinguished learner agents 18 that were previously defined when the vendor described the learning process. In step 3 3 0 ', we know that the shell agent 20 will combine the new request of the user to access the different online sellers identified in the "seller description queue". The new request of the composed user will be based on the parameters given by the user and the information in the vendor description. Preferably, if there is one of the requested online sellers (e.g., product model request), it will have a new request to be semantically distinguished from the buyer agent 20. The buyer's agent 20 is determined to use the vendor description to obtain real-time price information from the seller's website. The semantically distinguishing buyer agent 2 uses the vendor's URL and the vendor name included in the information describing the vendor description to navigate to the vendor website', which is also included in the vendor description as the vendor's search form URL. In step 340, after navigating to the seller's website, the semantically distinguishing buyer agent 20 "fills the seller search table "virtually" according to the user's new request and "virtually" presses the enter key to present it. This is done after each identified online seller. As described above, the seller description in the offline database 24 includes a block, which & the seller's search form URL ', such as "http://www.oiiliiieshop.32 1280488
五、發明說明(30) com/search.asp?item=”。語意分辨買者代理器20使用使用 者之輸入參數與搜尋表格URL以為每一被辨識之線上賣 主組合新的HTTP請求。例如,若使用者要買「硬碟」,被 語意分辨買者代理器20組合之新請求將為如下: “http://www.onlineshop.com/search.asp?item=harddisk,’’ 且語意分辨買者代理器20將送出此HTTP請求至線上賣 主,就好像該使用者自己提出該請求。若其有N個被辨識 賣主,語意分辨買者代理器20將啟動N條執行線以為N 填入搜尋表格。語意分辨買者代理器20較佳地平行於每一 線上賣主之可搜尋的指標進行,將之填入並提出一搜尋請 求。 在語意分辨買者代理器20將在特定的暫定或使用者 定義之暫停内等候由線上賣主來之回應。若一暫停出現, 語意分辨買者代理器前進至步驟370;否則其將到步驟358 與360以進一步處理所接收之搜尋結果資料。 在暫停期間内,語意分辨買者代理器20收集由不同線 上賣主來之搜尋請求的回應。在步驟358,語意分辨買者 代理器20接收由線上賣主來之搜尋結果回應,並將之儲存 於伺服器22内之快取記憶體或記憶體内。在步驟360,所 論及之資料由被接收之回應被抽取。語意分辨買者代理器 20使用儲存於賣主描述28或離線資料庫24内之賣主描述 佇列中的資訊抽取所欲的資料。舉例而言,賣主描述包括 辨識左、右包袠用之碼的攔位。首先,語意分辨買者代理 器(SRBA)20會使用左包袠資訊以將回應網頁中有效資料 33 1280488 五、發明說明(31 ) 的開始定位。此後,目標資料之正確位置(如賣主描述佇列 中之資訊被定義者)將被儲存及被儲存於記憶體内(回想在 第3與6圖的學習過程之際已被語意分辨學習者代理器18 學習者為賣主描述中之資訊)。該目標資料之抽取將被重複 至網頁之結束為止。 例如在抽取過程中,產品描述與產品價格將被抽取。 其將被了解包含於語意分辨學習者代理器所定義之賣 主描述内的資訊為與網域獨立無關但與多語言相關的。例 如’假設線上買者或買者之平台使用Windows 98作業系統 且執行語言版本“B,,(或較佳地其平台係執行英語版本之 Wmdows 2000 professi〇nal與/或其平台安裝個人Web伺服 器版本“B”),微軟網際網路探索家將在其登入本發明之入 口後催促其下載“ B”語言顯示軟體。在線上買者使用者 A ’以語言‘‘ B”之當地字元鍵入一產品模型作為本發明 之入口被提供之文字盒内的關鍵字時,第2圖中之語意分 辨買者代理器20將使用預先描述之例子資料(稍早於即時 包裹歸納學習後在賣主描述被擁取者)實施資料抽取,且其 為以語言“ B”表示之產品模型包含當地之字元字串。這些 賣主也述以預先定義之資料結構被儲存於離線資料庫 24(較佳地為微軟Access資料庫)中之賣主描述佇列内。該 資料抽取涉及對r擊中」一其包含由先前搜尋結果來之價 格、描述、與相關的產品資訊網一之暫時的搜尋,其使用 被使用者如第15C圊中展現之步驟312鍵入的以語言“B,, 表示之產品模型用的正確當地字元字串駐於伺服器22中 34 1280488 五、發明說明(32 ) 被儲存的記憶體或快取記憶醴内。由於語言“B”之字元字 串為特定的當地語言,任何被找到的「擊中」將用於使用 當地語言“B”之被辨識的賣主網站並具有以語言“b”表 示之字元字串。 回想在第14圖之步驟7,賣主描述於學習過程之際在 第2圖之語意分辨學習者代理器18已學習由線上賣主來之 包裹後被儲存於離線資料庫24(較佳地為微軟Access資料 庫)或資料庫伺服器22(較佳地為微軟順從SQL之資料庫伺 服器)内。由於語意分辨買者代理器20在每次對線上買者 使用者之請求擷取賣主描述之資料為不效率的,賣主描述 幸乂佳地僅在其方為语意分辨貝者代理器20第一次請求其 所欲集合之搜尋-相符-抽取時才由離線資料庫24或伺服器 22被擷取。然後賣主描述將被儲存於記憶體或快取記憶體 用於更多的瞬間插取及用於同一或新的線上使用者由語意 分辨買者代理器20來之稍後的其他請求。 在記憶體或快取記憶體内之賣主描述較佳地每天自動 地被更新。 換言之,語意分辨買者代理器(SRBA)20可使用賣主描 述中之資料以在不同網域與不同語言中將目標資料定位。 此乃因就特定賣主而言,雖然語言會改變,對應於目標資 訊之基礎編碼將不會如此。由於三r格式規律」預先凌駕 大多數賣主網站,如B對C、C對B、C對C線上商店等, 在賣主網站上不同的網域將一致地使用相同的格式與基礎 編碼以呈現目標資訊,如項目描述與價格。 1280488V. Invention Description (30) com/search.asp?item=”. Separately, the buyer agent 20 uses the user's input parameters and the search form URL to combine new HTTP requests for each identified online vendor. For example, If the user wants to buy a "hard disk", the new request to be semantically distinguished by the buyer agent 20 combination will be as follows: "http://www.onlineshop.com/search.asp?item=harddisk,'' The agent 20 will send the HTTP request to the online vendor as if the user had made the request. If there are N identified vendors, the semantically distinguishing buyer agent 20 will launch N execution lines to fill in the search for N. The semantically distinguishing buyer agent 20 is preferably parallel to the searchable metrics of each online vendor, populating it and proposing a search request. In the semantic sense, the buyer agent 20 will be tentatively specified or used. Waiting for a response from the online vendor within the defined pause. If a pause occurs, the semantically distinguishing buyer agent proceeds to step 370; otherwise it proceeds to steps 358 and 360 to further process the received search result data. During the pause period, the semantically distinguishing buyer agent 20 collects responses to search requests from different online vendors. In step 358, the semantically distinguishing buyer agent 20 receives the search result response from the online seller and stores it in the servo. The memory or memory within the cache 22 is retrieved from the received response at step 360. The semantically distinguishing buyer agent 20 uses the vendor stored in the vendor description 28 or offline database 24 Describe the information in the queue to extract the desired information. For example, the vendor description includes the identification of the code for the left and right packets. First, the semantically distinguishing buyer agent (SRBA) 20 will use the left package information. In order to locate the effective information 33 1280488 5, the invention description (31) in the response page, the correct location of the target data (such as the information in the seller description column is defined) will be stored and stored in the memory. (Recall that in the learning process of Figures 3 and 6, the learner agent 18 has been semantically distinguished. The learner is the information in the seller description.) The extraction of the target data will be repeated. Until the end of the page. For example, during the extraction process, the product description and product price will be extracted. It will be understood that the information contained in the vendor description defined by the semantic learner agent is independent of the domain independent but more Language-related. For example, 'Assume that the online buyer or buyer's platform uses the Windows 98 operating system and executes the language version "B, (or preferably its platform is the English version of Wmdows 2000 professi〇nal and / or its platform) Install the personal web server version "B"), Microsoft Internet Explorer will urge him to download the "B" language display software after logging in to the portal of the invention. In the online buyer user A 'type a product model in the local character of the language 'B' as the keyword in the text box provided by the entry of the present invention, the semantics in Fig. 2 distinguishes the buyer agent 20 The data extraction will be performed using the pre-described example data (slightly earlier than the instant package induction learning in the seller description of the advocate), and the product model represented by the language "B" contains the local character string. These vendors Also described in the pre-defined data structure is stored in the vendor description column in the offline database 24 (preferably the Microsoft Access database). The data extraction involves hitting the r" which contains the previous search results. The price, description, and associated product information network temporary search uses the correct local character word for the product model represented by the language "B," as entered by the user in step 312 as shown in Section 15C. The string resides in the server 22 34 1280488 V. The invention (32) is stored in the memory or cache memory. Since the character string of the language "B" is a specific local language, any is The "hit" found will be used for the identified vendor website using the local language "B" and has the character string represented by the language "b". Recall that in step 7 of Figure 14, the vendor is described in the learning process. In the second diagram, the learner agent 18 has been learned by the online vendor and stored in the offline database 24 (preferably Microsoft). Access database) or database server 22 (preferably Microsoft compliant SQL database server). Since it is inefficient to distinguish the buyer agent 20 from the request of the online buyer user to retrieve the information of the seller description, the seller description is fortunately only in the sense of the other. The search-conform-extraction of the desired set is requested by the offline database 24 or the server 22. The vendor description will then be stored in memory or cache memory for more instant insertion and for subsequent requests by the same or new online user to distinguish the buyer agent 20 later. The vendor description in memory or cache memory is preferably automatically updated daily. In other words, the semantically distinguishing buyer agent (SRBA) 20 can use the material in the vendor description to locate the target data in different domains and in different languages. This is because, for a particular vendor, although the language will change, the underlying code corresponding to the target information will not. Since the three r-format rules pre-exceed most seller websites, such as B to C, C to B, C to C online stores, etc., different domains on the seller's website will consistently use the same format and base code to present the target. Information such as project descriptions and prices. 1280488
五、發明說明(33 ) 所以就母一被送回之搜尋回應而言,語意分辨買者代 理器20將使用賣主描述實施資料抽取。若時間到,語意分 辨買者代理器20將進至第15C圖之步驟37〇β在步驟37〇, 語意分辨買者代理器20根據使用者定義之排序準則將由 不同線上賣主被抽取之資料棑序。若使用者未定義排序準 則’預置值將為產品之價格。或者,排序準則為辨識所找 到之最佳價格,並僅以最佳價格由賣主來之資訊呈現至使 用者/買者(附帶地說,其他排序準則可被使用)。 在完成排序後,語意分辨買者代理器20將進到步驟 3 80。在步驟380中,語意分辨買者代理器20根據由步驟 370被過濾及排序之資料組合HTML網頁。在步驟390中, 語意分辨買者代理器20藉由呈現組合後之HTML網頁作 為一「結果」網頁使用先前被建立之ActiveX要素對使用 者請求回應。 若時間在步驟350未到,語意分辨買者代理器20將進 到步驟358。在步驟358中,語意分辨買者代理器20於記 憶體或快取記憶體儲存搜尋結果網頁之資料以便瞬間回應 於同一使用者/買者之進一步新請求或新使用者/買者之請 求時使用。隨於步驟358後,語意分辨買者代理器20將進 到步驟360,其中其由搜尋結果網頁抽取目標資訊、根據 使用者定義之排序準則排序在步驟370中之結果與由不同 線上賣主被擷取之抽取後資料、然後根據由步驟370被過 濾及排序之資料於步驟380組合HTML網頁、及最後在步 驟390使用先前被建立之ActiveX要素對使用者/買者回 36 1280488V. INSTRUCTIONS (33) So in terms of the search response that the parent is sent back, the semantically distinguishing buyer agent 20 will use the vendor description to implement the data extraction. If the time is up, the semantically distinguishing buyer agent 20 will proceed to step 37 of FIG. 15C. In step 37, the semantically distinguishing buyer agent 20 will extract data from different online sellers according to user-defined ranking criteria. sequence. If the user does not define a sorting criterion, the preset value will be the price of the product. Alternatively, the ranking criterion is to identify the best price found and present only the information from the seller to the user/buyer at the best price (in other words, other sorting criteria can be used). After the sorting is completed, the semantically distinguishing buyer agent 20 will proceed to step 380. In step 380, the semantically distinguishing buyer agent 20 combines the HTML web pages based on the information filtered and sorted by step 370. In step 390, the semantically distinguishing buyer agent 20 responds to the user request by presenting the combined HTML web page as a "result" web page using the previously created ActiveX element. If the time has not expired in step 350, the semantic resolution of the buyer agent 20 will proceed to step 358. In step 358, semantically distinguishing the buyer agent 20 from storing the data of the search result web page in the memory or the cache memory to instantly respond to the new request of the same user/buyer or the request of the new user/buyer use. Following step 358, the semantically distinguishing buyer agent 20 will proceed to step 360 where it extracts the target information from the search results web page, sorts the results in step 370 according to the user-defined ranking criteria, and is shackled by different online vendors. The extracted data is then combined, and the HTML web page is combined in step 380 based on the data filtered and sorted by step 370, and finally the previously created ActiveX element is used in step 390 to the user/buyer back 36 1280488
五、發明說明(34) 應0 語意分辨買者代理器20之預置語言為英語。利用預 置,語意分辨買者代理器2 0在其接收使用者之請求時將進 到所有之賣主。當回應送回時,語意分辨買者代理器 將使用已被語意分辨學習者代理器丨8學習之賣主描述來 過濾、無效的結果。 在本發明之另一實施例中,賣主可被分類至使用者之 場所,使得使用者12可選擇「進階搜尋」以搜尋賣主被分 類之群組。 本發明所運用之方法論本質上為多語言的。當語意分 辨學習者代理器18學習一賣主網站時,此學習可用該網站 之當地語言被實施。被擷取及被用以為該網站殖以賣主描 述之結果將為此網站之當地語言。因而,當線上使用者/ 買者在第15C圖之步驟310以特定的當地語言提出一請 求’έ吾意分辨貝者代理器20在步驟312使用使用者鍵入之 正確字元字串就「擊中」在記憶艘或快取記憶體中尋找。 由於該字元字串將為特定的當地語言,所找到的任何「擊 中」將為使用相同當地語言且具有相同字元字串被辨識之 賣主網站所用。以此方法論之情形,其將被了解依照本發 明不需有「翻譯」步驟將以一語言提出之搜尋請求變換為 「標準」語言。藉由使用該搜尋請求之當地語言,翻譯步 驟所導致之錯誤與模糊被避免。 以在本發明之系統的較佳實施例被使用的資料庫伺服 器之發展工具(較佳的為微軟順從SQL之資料庫)被建立之 37 1280488 五、發明說明(35) 電腦程式模組為標準的,且本發明可使用如支援ODBC之 加州 Redwood Shores 的 Oracle Corporation、加州 Emeryville 的Sybase公司來之SQL資料庫伺服器及其他者的相關資 料庫。如上述者,同時搜尋之多執行線處理對本發明之較 佳實施例是重要的。有關此點,使用Windows NT 40平台 (微軟公司之產品)可提供此多執行線處理能力。 現在參照第16至23圊在「賣主佇列」27内之訓練資 料與在「賣主描述佇列」28内之資料的編譯與準備將更詳 細地被經營。第16圖為互動式代理器字元學習者介面 (IACLI)的真實畫面,其可被用以獲得訓練資訊在本發明中 使用。使用對應於第18圖中顯示之「加入賣主」標示的畫 面,資料登錄點就系統管理員看過賣主網站“ lcache.com” 後獲得之資訊的登錄被提供。因而在第17圊中一例子被提 供,其中此資訊已就賣主“ lcache.com”被鍵入。此資訊包 括如下面在右邊欄位所顯示者: i賣主名稱 1 cache.com 賣主URL http://www.cache.com ί 賣主之搜尋表格 http://st4.yahoo.com/cgi- URL bin/nsearch?catalog=lcache&query= 學習網域 Dvd 訓練例 i) 具有I-Glasses之DVD虛擬筆記型戲院 ii) JVC XV523GD杜比數位DVD播放器 iii) Pioneer DVL-919-組合式 LC/DVD/CD 播放器 然後上面的資訊被存放於賣主佇列27作為離線資料 38 1280488 五、發明說明(36) 庫24中之訓練資料。其須被注意到已被鍵入之訓練例為特 定產品之佇列,其將在訓練過程之際以即時對訓練網頁將 被獲取之被指定的已辨識賣主網站被搜尋。然後,該「賣 主描述」將由這些被送回之訓練網頁被「學習」。 然後該資訊可在如第1 8圖顯示之學習被顯示對應於 「賣主資訊」標示。該「賣主資訊」畫面介面(見苐19圖) 為賣主名稱提供一「搜尋」功能。藉由鍵入一賣主名稱及 壓下「搜尋」按紐,被鍵入之賣主的賣主資訊由離線資料 庫24被擷取並被顯示。在此「賣主資訊」畫面中要注意「包 裹」襴-·「頭」、「尾」、「項目左標記」、「項目右標記」、「價 格左標記」、「價格右標記」一為空的。 第20圊提供學習介面之一單幅畫面用於顯示先前為 特定賣主被鍵入之訓練例。該畫面在回應於開啟被稱為「訓 練資料」之檔案下被顯示。至於在學習者介面之「賣主資 訊」畫面有為「訓練資料」被提供之搜尋功能畫面。為了 使用訓練資料搜尋功能,系統管理員鍵入賣主名稱並壓下 「開始」按鈕。此激發先前就特定賣主被鍵入之「訓練資 料」佇列以如圖地被顯示。其要注意到「訓練資料」介面 提供其他功能:如「加入」(加入額外的例子)、「刪除」(刪 除一訓練例)、「編輯」(編輯訓練例)、儲存(將例子之佇列 鍺存至其目前狀態)、「取消」(取消所鍵入之改變)。 現在參照第21圖,學習者介面之一單幅畫面在回應於 開啟被稱為「賣主描述」之檔案下被顯示。此介面開始「學 習」一賣主描述之過程,並為系統管理員提供對學習「所 39 1280488 五、發明說明(37) 有」賣主(訓練資料已就其被鍵入)或對「某一」賣主(其名 稱已被管理員在被提供之盒中被鍵入)學習之選擇。 在第22圖中,「學習某一個」被選擇且被鍵入之賣主 名稱為“lcache.com”。在系統被連接至全球資訊網下,該 「現在學習」按鈕將被壓下以發動語意分辨學習者代理器 (SRLA)18來「學習」有關該特定賣主“lcache.com”之賣 主資訊,其方式為在網站上即時使用此賣主之訓練例中被 確定之訓練例。 在學習/訓練過程結束後,已由網站被送回之訓練或學 習例之結果如第23圖顯示地在學習者介面之畫面被顯 示。再次地說,為了顯示此資訊,系統管理員可使用第19 圖之賣主資訊畫面上之搜尋功能以鍵入賣主名稱(在此案 例中為“lcache.com”)並按下搜尋按鈕。第23圖瞬間地顯 示賣主“ 1 cache.com”之學習結果。其要注意「包裹」現在 已被完成。同樣地,該網頁之「頭」被顯示成發生於值 “ 5230”。此值“5230”定出行、字元位置或其他位置之資 訊。「尾」表示項目之位置係如下以標記字串被定出·· “D >< /TD >< TD >< /TD〉< /TR>< /,, 就項目描述資訊而言,左標記被定為如下之字串: “G SRC=/Lmg/trans+ lxl.gif BORDER_OWID···,, 項目描述之右標記被定為如字串:“</b>”。 價格之左標記標記被定為如字串: “</b〉<>/A></TD><TD ALIGN = rightxFON··.,, 最後,價格之右標記被定為如字串“</T>”。 40 1280488V. INSTRUCTIONS (34) The default language of the buyer agent 20 should be determined in English as English. With the pre-set, semantically distinguishing the buyer agent 20 will go to all the sellers when it receives the user's request. When the response is sent back, the semantically distinguishing buyer agent will use the vendor description that has been semantically distinguished by the learner agent 丨8 to filter and invalidate the result. In another embodiment of the present invention, the vendor can be categorized to the user's premises so that the user 12 can select "Advanced Search" to search for the group to which the seller is classified. The methodology utilized by the present invention is inherently multilingual. When the semantic learner agent 18 learns a vendor website, this learning can be implemented in the local language of the website. The results of the descriptions that have been captured and used to portray the site for the vendor will be the local language of the site. Thus, when the online user/buyer submits a request in a specific local language at step 310 of Fig. 15C, the user is determined to use the correct character string typed by the user in step 312. In the memory boat or cache memory. Since the character string will be in a particular local language, any "hits" found will be used by the vendor website that uses the same local language and has the same character string identified. In the case of this methodology, it will be appreciated that there is no need for a "translation" step in accordance with the present invention to transform a search request made in one language into a "standard" language. By using the local language of the search request, errors and ambiguities caused by the translation steps are avoided. A development tool for a database server (preferably a Microsoft compliant SQL database) that is used in a preferred embodiment of the system of the present invention is established. 37 1280488 V. Description of the Invention (35) The computer program module is Standard, and the present invention may use an Oracle database such as Redwood Shores, Inc., which supports ODBC, a SQL database server from Sybase, Inc., Emeryville, California, and other related databases. As described above, simultaneous search for multiple execution line processing is important to the preferred embodiment of the present invention. In this regard, the use of the Windows NT 40 platform (a product of Microsoft Corporation) provides this multi-line processing capability. The compilation and preparation of the training materials in the "Seller's List" 27 and the information in the "Description of Sellers" 28 will now be operated in more detail in the light of the 16th to 23rd. Figure 16 is a real picture of the Interactive Agent Character Learner Interface (IACLI), which can be used to obtain training information for use in the present invention. Using the screen corresponding to the "Join Seller" logo shown in Figure 18, the login point for the information obtained by the system administrator after viewing the seller's website "lcache.com" is provided. Thus an example is provided in the 17th, where this information has been typed in for the vendor "lcache.com". This information includes those shown below in the right column: i vendor name 1 cache.com seller URL http://www.cache.com ί seller search form http://st4.yahoo.com/cgi- URL bin /nsearch?catalog=lcache&query= Learning Domain Dvd Training Example i) DVD Virtual Notebook with I-Glasses ii) JVC XV523GD Dolby Digital DVD Player iii) Pioneer DVL-919-Combined LC/DVD/ The CD player then stores the information in the seller's queue 27 as offline data 38 1280488. 5. Description of the invention (36) Training materials in the library 24. It must be noted that the training case that has been typed is a list of specific products that will be searched for during the training process with the designated identified seller website that will be acquired on the training web page. The "Seller Description" will then be "learned" from the returned training pages. The information can then be displayed in the learning as shown in Figure 18 corresponding to the "Seller Information" logo. The "Seller Information" screen interface (see Figure 19) provides a "search" function for the vendor name. By typing a vendor name and pressing the "Search" button, the vendor information of the typed seller is retrieved and displayed by the offline database 24. In this "Seller Information" screen, please note that "Package" - "Head", "Tail", "Item Left Mark", "Item Right Mark", "Price Left Mark", "Price Right Mark" are empty. of. Section 20 provides a single screen of the learning interface for displaying training examples that were previously typed for a particular vendor. This screen is displayed in response to the opening of a file called "Training Data". For the "Seller Information" screen on the learner interface, there is a search function screen provided for "Training Information". In order to use the training data search function, the system administrator types the vendor name and presses the "Start" button. This triggers the "training data" column that was previously typed for a particular vendor to be displayed as shown. It should be noted that the "Training Information" interface provides other functions such as "join" (additional examples), "delete" (delete a training case), "edit" (edit training example), save (the list of examples) Save to its current state), Cancel (cancel the typed change). Referring now to Figure 21, a single screen of the learner interface is displayed in response to the opening of a file called "Seller Description". This interface begins the process of "learning" a vendor description and provides the system administrator with the vendor "39 1280488 5, invention description (37)" (the training material has been typed) or the "some" vendor. (The name has been typed by the administrator in the box provided) to learn the choice. In Fig. 22, the "learning one" is selected and the name of the seller is "lcache.com". When the system is connected to the World Wide Web, the "Learning Now" button will be depressed to launch the semantic learner agent (SRLA) 18 to "learn" the vendor information about the particular vendor "lcache.com". The method is to use the training example determined in the training example of this seller on the website. At the end of the learning/training process, the results of the training or study that have been sent back by the website are displayed on the learner interface as shown in Figure 23. Again, to display this information, the system administrator can use the search function on the vendor information screen of Figure 19 to type in the vendor name ("lcache.com" in this case) and press the search button. Figure 23 shows the learning result of the seller “1 cache.com” in an instant. It should be noted that the "package" has now been completed. Similarly, the "header" of the web page is displayed to occur at the value "5230". This value "5230" specifies the information for the line, character position or other location. "Tail" indicates that the position of the item is as follows: "D >< /TD >< TD ></TD></TR>< /,, on the item For the description of the information, the left mark is defined as the following string: "G SRC=/Lmg/trans+ lxl.gif BORDER_OWID···,, the right mark of the item description is set as a string: "</b> ". The left mark of the price is set as a string: "</b><>/A></TD><TD ALIGN = rightxFON··.,, finally, the price right mark is set to Such as the string "</T>". 40 1280488
五、發明說明(38) 雖然第23囷中項目之左標記與價格之右標記的字元 子串因學習者介面之靜止顯示狀態呈現被裁斷,其將被了 解被語意分辨學習者代理器18定出之左標記字串的所有 字元將被儲存於賣主描述28 ,較佳地是在微軟Access資 料庫内’且稍後將被語意分辨買者代理器20使用。 現在,做為語意分辨學習者代理器18之基礎的方法論 將以「觀念證明」的程度更詳細地被描述。 該包裹歸納問題如第24圖顯示地被框為資訊抽取之 簡單模型的形式。 如第24圖顯示者,網頁p為包含所欲資訊之Web網 頁。p被採用為某些字母之字串。典型上,該字母為ASCII 字元集,且網頁為HTML文件。舉例而言,如稍早被經營 之第7圖係由一賣主網站被獲得之非常簡單的網頁。在「標 示術語」而言,此網頁自此將被稱為Ppc(產品目錄之網 頁)。注意本發明方法論係以使用HTML被啟動或激發但 不依賴於此。例如,該等網頁可為自然語言文字或可符合 XML標準。 一標準的關係資料模型被採用。與每筆產品記錄有關 的是二分明的屬性··項目與價格。此處「項目」代表產品 名稱或型號,及「價格」代表產品之價格。 一「元組」(tuple)為二字串之向量<Ai,Ap>。字串Ai 為「項目」屬性之值,及字串Ap為「價格」屬性之值, 而屬性代表關係模型之行,及「元組」代表列。因而如第 8圖顯示者,第7囷之產品目錄網頁的例子包含四個「元V. INSTRUCTIONS (38) Although the left subtag of the item in item 23 and the character substring of the right mark of the price are cut off due to the static display state of the learner interface, it will be understood to be semantically distinguished by the learner agent 18 All of the characters of the left-tagged string will be stored in the vendor description 28, preferably in the Microsoft Access repository' and will be used semantically to distinguish the buyer agent 20 later. Now, the methodology that is the basis for semantically distinguishing learner agents 18 will be described in more detail with the degree of "concept proof." The package induction problem is shown in Figure 24 as a simple model of information extraction. As shown in Figure 24, web page p is the web page containing the desired information. p is used as a string of certain letters. Typically, the letter is an ASCII character set and the web page is an HTML file. For example, Figure 7, which was operated earlier, is a very simple web page obtained from a seller's website. In the case of "marking terms", this page will be referred to as PPC (the web page of the product catalog). Note that the methodology of the present invention is initiated or activated using HTML but is not dependent on this. For example, such web pages may be natural language text or may conform to XML standards. A standard relational data model was adopted. Related to each product record is a bipartite attribute · project and price. Here, "item" represents the product name or model number, and "price" represents the price of the product. A "tuple" is a two-string vector <Ai, Ap>. The string Ai is the value of the "item" attribute, and the string Ap is the value of the "price" attribute, and the attribute represents the line of the relational model, and the "tuple" represents the column. Therefore, as shown in Figure 8, the example of the catalogue page of the seventh page contains four "yuans".
1280488 五、發明說明(39 ) 組」,其第一個為 < ‘HM381MD,,‘399.95’ >。 網頁之内容為其所包含之一組「元組例如,該字面 上的字串記號是適當的,但由於網頁具有未加界限之長 度’網頁内容之較清楚簡明的呈現取代地被使用。一網頁 之標示被使用以就網頁内之一組指標被準地呈現網頁之内 容,而非外顯地列出該等屬性。 例如,簡單產品目錄網頁(Ppc)用之「標示」Lpc被顯 示於第8圖之右邊爛位。 「標示」表示該簡單產品目錄包含四個「元組」,此處 每一「元組」由項目與價格值組成。一對整數代表每一值。 考慮第一對< 174,1 80>,這些整數表示第一元組之屬性為 位置174與180間之子字串,即字串‘HM381MD,。第Ί 圖右邊之字元字串的檢查揭示這些整數對應於字元位置, 由第一行中“<HTML>,,的“<,,開始。類似地,在第四「元 組一内之最後一對整數<356,361>表示最後之屬性,價格 出現於356與361間,即字串‘399.95,。 更一般地說’網頁P之内容可用標示[代表。 丨 p>、 L== ek<i> » <bk p ek p> [ 、bm,丨,、>,<bm,p em p>j 就僅有單一「元組」之網頁而言,其結果為下列標示: ,eu>,<bl p,ei p>>} 42 1280488 五、發明說明(40) 標示L將網頁P之内容編碼。該網頁包含|L|>〇之「元 組」,其每一個具有二屬性,項目與價格。整數l<m<|LI 將網頁内之「元組」定指標。每一對<bm〆em丨>將一項目 值編碼,及每一對<bm,p,6„1,|)>將一價格值編碼。值bm i, 為P中一項目的第m個「元組」之開始指標,值em i,為 第m個「元組」中之項目值的結束指標。類似地,值bmp 為P中一價格的第m個「元組」之開始指標,值,為 第m個「元組」 中之價格值的結束指標❶因而,第πι個元 組之項目屬性出現於〈IVi,6„^>間,第m個元組之價格屬 性出現於<bm,p,em,p>。所以第8圊中之對<b2.,,e2,,> = <229,234>例子將第7圊之網頁的簡單產品目錄中第二「元 組」的第二(項目)屬性編碼。 如上面顯示者,一包袠W為由網頁至一標示的函數; 記號W(P)=L表示在網頁P上激發包袠w之結果為標示 L。在此抽象程度下,包裹僅是隨意程序。 一包裹級為一組包袠。如稍後在此將被看出者,本發 明所運用之包袠被稱為HLRT包裹級。 有關被用以描述本發明之方法論之術語與公式的前面 解釋’現在將進一步說明學習者如何學習賣主之產品目錄 網頁。 直覺地說’本發明之學習系統的輸入為產品目錄網頁 之樣本與其相關的「標示」。在此點,其假設該等「標示_ 已被定出並被給予。用以為樣本網頁產生標示之本方法的 進一步努力將在此處稍後被提供。其輸出為一包裹。 43 1280488 五、發明說明(41 ) 理想上,W就所有樣本網頁輸出適當的標示。一般而言, 此種保證無法被做成,故(在歸納學習之精神下)其要求W 就某一組訓練例產生正確的標示。 以解題的方式來說,包裹歸納問題(針對特定級為 如下: 輸入:一組ε = {…,〈卩!!,Ln>,···}之訓練例,此處Pn 為一網頁,及每一 Ln為一標示; 輸出:一包裹WeW,使得就每一 <Pn,Ln〉€ ε,W(Pn)=Ln 〇 就如此處稍早解釋者,pcwrapHLPT程序說明一「字 頭程式設計」一使用由賣主產品目錄來之頭標記、左標記、 右標記、與尾標記來抽取相關的產品資訊與其價格。此頭 一左一右一尾(HLRT)包裹級為將此字頭形式化之一方 法。第25圖設立之“execHLRT”程序為pcwaHLRT之一般 化,其允許該等標記為任意的字串,而取代在先前實施 pcwrapHLPT使用之“<B”,</B>”等特定值。 注意,雖然在此例中之標記為整個HTML標蕺,本發 明之方法論不限於以HTML標蕺來操作。而且,其文字可 能完全不是HTML。因此,金額符號“ $”可以是如 “S399.95”之價格的有效左標記。 此execHLRT副常式確定HLRT包裹如何作用。稍早 其被指出W(P)為由激發對網頁P之包袠W結果所得之標 示。此副常式execHLRT為用於決定W(P)之程序及由W與 P用於當W為一 HLRT包裹之案例。 44 1280488 五、發明說明(42) li與η表示項目属性之左標記與右標記,而、與%表 示價格屬性之左標記與右標記,及11與1表示網頁之頭與 尾。(注意’h為行號取代字串。例如,若h=i〇〇,則網頁 之前100行為頭,語意分辨買者代理器2〇在其搜尋一產品 時立即跳過這些行。)例如,若execHLRT以h=7,1 <Β>” ’ “</Β>”,1ρ= “<ι>”,“〇,,及 t==: “</TABLE:>”被激發,則execHLRT類似pc^vrapHLRT而 作用。 更一般地說,一賣主網站用之任何HLRT包裹等值於 (h,1,,Γι,1ρ,Γρ,t)之向量,且任何此向量可被解譯為hlrt 包裹。在這種等值下’記法(h,,η,1ρ,Γρ,t)被用作為 以該某一標記部分地評估execHLRT所獲得之HLRT包袠 的縮寫。 由於HLRT包袠僅為一向量(h,1丨,η,,Γρ,t),因 而第7與8圖之HLRT包裹歸納在一組該網頁例與其標示 的£ = !...CP,(Pn,Ln),···}為基礎下為辨識四個標記的 其中之一。更精確地說,下列的限制滿足問題(CSP)為要 解: 變數:網頁P之頭標記:h 網頁P之尾標記:t 項目屬性之左標記:li 項目屬性之右標記:η 價格屬性之左標記·· lp 價格屬性之右標記:rp 45 1280488 五、發明說明(43) 定義域:每一標記為除了頭標記外之任何字串 限制 :就 < Pn,Ln> €ε,iT(Pn)= Ln,此處 HLRT 包袠 W=(h,I;,η,lp,rp,t) 現在learnHLRT將被描述,其係針對上面的限制滿足 問題被提出。 開始要注意標記變數之定義域受到例子ε之嚴格的限 制。至少而言,該等標記將為網頁例的子字串。當然,吾 人可做得更好。以僅單一例(Ppc,Lpc)為基礎,其可被看 出(價格屬性之右標記)必須為“</l></TD></TR>之一字 首,其中“xlT表示一個新的行字元。所謂“字首”為字串 中之字元組合,由該字串之最右邊字元開始;例如“ >”, “D>,,, “TR>” 等。 注意,若rp不是此字串之字首,則具有此標記之每一 包裹至少就Ppc的第四個「元組」抽取“399.95”作為碼屬 性時為失敗的。因此rp之候選者為“</卜</TD></TR>”之 所有字首。這些候選者在第10 A圖中被顯示。 詳細地說,簡單產品目錄網頁之候選標記如下地被產 生: 考慮價格屬性之左標記lp。回想第7圖中在價格前之片段 “ HM381 MDx/Bx/TDxTDxl>”, “ MD2070</BX/TD><TDxl〉” 等。假設在這些片段下其可出lp必為“</B></TD><TD><l>” 之字尾。因此,Ip之候選者為此字串之16個非空的字尾。 此候選者可在第10A圊中看出。所謂「字尾」,意即字串 之字元的組合,由該字串之最左字元開始,例如“ <”, 46 12804881280488 V. Inventions (39) Group, the first of which is < HM 381 MD,, ‘399.95’ >. The content of the web page is a group of "tuples, for example, the literal string token is appropriate, but because the webpage has an unbounded length", the clearer and concise presentation of the webpage content is used instead. The logo of the web page is used to accurately present the content of the web page for a set of indicators within the web page, rather than to explicitly list the attributes. For example, a simple product catalog page (PPC) is displayed with a "mark" Lpc. The right side of Figure 8 is rotten. "Mark" means that the simple product catalog contains four "tuples", where each "tuple" consists of item and price values. A pair of integers represents each value. Considering the first pair < 174, 1 80>, these integers indicate that the attribute of the first tuple is a substring between positions 174 and 180, i.e., the string 'HM381MD,. The check of the character string on the right side of the figure reveals that these integers correspond to the character position, starting with "<HTML>," in the first line. Similarly, the last pair of integers in the fourth "tuple one" <356,361> indicates the last attribute, and the price appears between 356 and 361, that is, the string '399.95,. More generally, 'page P The content can be labeled [representative. 丨p>, L== ek<i> » <bk p ek p> [ , bm, 丨,, >, <bm,p em p>j has only a single "yuan" For the webpage of the group, the result is the following: eu>, <bl p, ei p>>} 42 1280488 V. Description of Invention (40) The label L encodes the content of the web page P. This page contains the "tuples" of |L|>, each of which has two attributes, items and prices. The integer l<m<|LI sets the "tuple" in the web page. Each pair <bm〆em丨> encodes an item value, and each pair <bm,p,6„1,|)> encodes a price value. The value bm i is an item in P The starting indicator of the mth "tuple", the value em i, is the ending indicator of the item value in the mth "tuple". Similarly, the value bmp is the starting index of the mth "tuple" of a price in P, and the value is the ending index of the price value in the mth "tuple". Thus, the item attribute of the πι tuple Appears between <IVi, 6„^>, the price attribute of the mth tuple appears in <bm,p,em,p>. So the pair in the 8th is <b2.,,e2,,> ; = <229,234> The example encodes the second (item) attribute of the second "tuple" in the simple product catalog of the web page of the seventh page. As shown above, a packet W is a function indicated by a web page to a mark; the symbol W(P) = L indicates that the result of exciting the packet w on the web page P is the mark L. At this level of abstraction, the package is only a random program. A parcel is a set of packages. As will be seen later herein, the package used in the present invention is referred to as the HLRT package level. The previous explanation of the terms and formulas used to describe the methodology of the present invention will now further illustrate how the learner learns the vendor's catalog page. Intuitively, the input to the learning system of the present invention is a sample of the catalog page and its associated "mark". At this point, it is assumed that the "marker_ has been determined and given. Further efforts to generate the method for the sample web page will be provided later here. The output is a parcel. 43 1280488 V. DESCRIPTION OF THE INVENTION (41) Ideally, W will output appropriate indications for all sample web pages. In general, such guarantees cannot be made, so (in the spirit of inductive learning), it is required to generate correctness for a certain set of training cases. In the case of problem solving, the package induction problem (for a specific level is as follows: Input: a set of training examples of ε = {..., <卩!!, Ln>,··}}, where Pn is one The web page, and each Ln is an indication; Output: a parcel WeW, so that for each <Pn, Ln>€ ε, W(Pn)=Ln 〇 as explained earlier here, the pcwrapHLPT program describes a word Header design uses the head tag, left tag, right tag, and tail tag of the vendor's catalog to extract relevant product information and its price. This head is left and right (HLRT) package level for this word. One of the methods of head formalization. Figure 25 is established The "execHLRT" program is a generalization of pcwaHLRT, which allows these tags to be arbitrary strings instead of the specific values such as "<B", </B>" used in the previous implementation of pcwrapHLPT. The mark in this example is the entire HTML mark, and the method of the present invention is not limited to operating with HTML tags. Moreover, the text may not be HTML at all. Therefore, the amount symbol "$" may be a price such as "S399.95". The effective left mark. This execHLRT subroutine determines how the HLRT package works. Earlier it was pointed out that W(P) is the indication of the result of stimulating the package of the web page P. This sub-executive execHLRT is used to determine W (P) The procedure and the case where W and P are used when W is a HLRT package. 44 1280488 V. Description of invention (42) li and η represent the left and right marks of the item attribute, and % and % represent the price attribute. The left and right marks, and 11 and 1 indicate the head and tail of the page. (Note that 'h is the line number instead of the string. For example, if h=i〇〇, then the page is 100 before the head, semantically distinguishing the buyer agent 2 skips this when it searches for a product Some lines.) For example, if execHLRT is h=7,1 <Β>" '</Β>', 1ρ=“<ι>”, “〇,, and t==: “</ TABLE:>" is activated, then execHLRT acts like pc^vrapHLRT. More generally, any HLRT package used by a seller's website is equivalent to a vector of (h,1,,Γι,1ρ,Γρ,t), And any such vector can be interpreted as a hlrt package. Under this equivalent, the notation (h, η, 1ρ, Γρ, t) is used as an abbreviation for the HLRT package obtained by partially evaluating the execHLRT with the certain mark. Since the HLRT package is only a vector (h, 1丨, η, Γρ, t), the HLRT packages of Figures 7 and 8 are summarized in a set of pages and their marked £ = !...CP, ( Pn, Ln), ···} is based on one of the four markers. More precisely, the following constraint satisfaction problem (CSP) is to be solved: Variables: Header of page P: h End of page P Marker: t Left attribute of item attribute: li Right attribute of item attribute: η Price attribute Left mark · lp Right attribute of price attribute: rp 45 1280488 V. Description of invention (43) Definition field: Each tag is any string restriction except the header tag: just <Pn,Ln> €ε,iT( Pn) = Ln, where HLRT packet 袠 W = (h, I;, η, lp, rp, t) Now learnHLRT will be described, which is proposed for the above limitation satisfaction problem. It is important to note that the domain of the tag variable is strictly limited by the example ε. At a minimum, the tags will be substrings of the page instance. Of course, we can do better. Based on a single case (Ppc, Lpc), it can be seen (the right mark of the price attribute) must be one of the words "</l></TD></TR>, where "xlT Represents a new line character. The "word prefix" is a combination of characters in a string, starting from the rightmost character of the string; for example, ">", "D>,,, "TR>", etc. Note that if rp is not this word At the beginning of the string, each package with this tag fails at least when the fourth "tuple" of the Ppc extracts "399.95" as the code attribute. Therefore, the candidate for rp is all the prefixes of "</b</TD></TR>". These candidates are shown in Figure 10A. In detail, the candidate tags for the simple catalog page are generated as follows: Consider the left mark lp of the price attribute. Recall the segment before the price in Figure 7 "HM381 MDx/Bx/TDxTDxl>", "MD2070</BX/TD><TDxl>" and so on. It is assumed that under these segments, lp must be the suffix of "</B></TD><TD><l>". Therefore, the candidate for Ip is the 16 non-empty suffixes of this string. This candidate can be seen in Section 10A. The so-called "suffix", meaning the combination of the characters of a string, begins with the leftmost character of the string, such as " <", 46 1280488
五、發明說明(44) “</,,, “</B”, “</B>” 等0 標記li為最複雜的,原因在於該第一屬性前之字串在 先前之「元組」之第一屬性與最後屬性間,以及在該網頁 之頭與該第一「元組」間出現。在此例中,所考慮之字串 為 “ <TR><TD><B>,,及 44 </l></TD></TR><TR>^<TD> <B>”。很清楚的是li為此字串之字尾。因此,li之候選者 可藉由列舉此一片段之字尾被產生。 為將此努力一般化,其歸納出,給予該例子集與被寫 出之candSi(i,p,s)下,標記li與lp之候選者在每一例子中 藉由列舉出現於項目屬性或價格屬性之每一實例左邊的最 短字串的字尾而被產生(如在前段被提及者,此案洌項目屬 性多少是有些特殊的。在相鄰元組間或第一元組前最短字 串的字尾必須被列舉)。例如,若ε = {(Ppc,Lpc)丨,則: candSj(i,p,E) = {</1></TD></TR>nI^ <TR><TD><B> '…} cands丨(ί,ρ,ε)= {</B></TD〉<TD><l>, 右標記之候選者類似地被產生,句/有兩點不同。首先, 所考慮的字串發生於適當屬性右邊(而非左邊)。其次,r, 與rp必須為這些字串之字首(而非字尾)。例如,在簡單產 品目錄例中,標記必須為字串“ </B></TD〉<TD><l>之字 首,而 %必須為 “<1></TD><TR>,,與 </l></TD><TR> ^ <TR><TD><B>,,二者之字首。 特別是給予例子集£ 一被寫出之cands Jk,s)的右標記 47 1280488 五、發明說明(45) 藉由在每一例中列舉屬性k每一實例右邊發生之最短字串 的字首而被產生。(如上述者,li為一特例。)類似地,rp 為一特例。發生於相鄰「元組」間或最後「元組」後之最 短字串被列舉。例如, candsr(i,p,e)= {</B></TD><TR><l>,…} candsr(i,p,e) = {</1></TD><TR>vI^<TR><TD><B> '…} 類似的分析應用於頭與尾之標記。「頭」為第一個項目 屬性出現前網頁之字首。注意,此處「頭」被呈現為一字 串。當一包袠實際被實施時,為了提高本發明之績效,較 佳地是「頭」呈現為一整數,使得人類購物者或買者在使 用該包裹以尋找產品資訊時可不須看其内容而迅速跳越網 頁之頭。要變換頭為整數僅要找出頭字串跨越的行數。 辨識「尾」之標記十分類似於右標記li與Lp。該等尾 為該網頁之最後價格屬性後之字串字尾。 cands(head,E)={<HTML>>|^<nTLE>".PRIEC($US)</rH></rR> ,…} cands(head,£)= {</l></TD></TR>>||</TABLE>...<HTML> » 在就每一標記給予這些候選者下,用於學習此二標記 之簡單方法的虛擬程式鴒在第26圊被提供。 由於該模組以與每一標記之候選者個數之產品成比例 的時間執行,且因每一標記具有很多候選者,執行時間會 48 1280488 五、發明說明(46) 很慢。 一個更有效率的處理可藉由觀察相互獨立之標記 Ip,rp而被達成。而且,一候選者就特定儲存是否有效不 會與任何其他標記相關。例如,其可評估“ /B”是否就厂 為滿意的,而不須其他任何標記之合理安排。 要看此獨立性適當地成立,須回想execHRLT裎序。 在其執行之每一點,execHRLT為確實的任何一個標記^ , Ιρ’Γρ搜尋其輸入網頁P。若任何這些搜尋在定出P中正確 位置失敗時,則被execHRLT輸出之標示將會不正確。但 這些搜尋是否送回正確的答案僅視考慮中之標記與網頁例 而定’而非其他的標記。 換另一方式說,一旦特定的候選者(L,Ip,Γρ)就某些 標記被選擇,不論候選者是由其他標記被選擇,該候選者 不會是無效的。此主張之對換性亦有直覺上之意義;若一 候選者為無效的,則不論候選者是如何小心的由其他標記 被選擇,其是無法修復的。注意,此獨立性被保證:此不 僅是促進學習之啟發想法。 此說察之重要性在於三個標記Γ,,1ρ, Γρ可隔離地被學 習。在虛擬程式碼中,其可如下地被學習: 1 ·產生候選者組: 2·就每一標記選擇一有效的候選者。 此方法論比第26圊之程序快了很多;其以與每一標記 用之候選者個數的和(而非產品)成比例的時間執行。 然而,其亦被觀察到並非所有標記均為相互獨立的。 49 五、發明說明(47) 對照之下,就標記h,t與li而言,特定的字元字串就這些 二個標記之一為有效的視其他二者之選擇而定。例如, <®> 是否就Η為有效的?其答案視對h與t的選擇而 定。若h= “ <HTML>”,由於execHLRT不會跳越不相關的 黑體字 “<B>A Simple Product 目錄</B>”,則“<b>,,對 η 不會為有效的標記。反之,若h= “</TH></TR>,則li= 不會有問題。類似地,li與t相互作用:若t:= “</HTML> ”, 1ι= “<B>”是不能接受的,但若t= “</TABLE>”,則為可接 又的。其結果為,二個標記h,t與li之候選者必須聯合地 被考慮。因此,h,t與li之候選者的所有組合被列舉,且 有效者被選擇。 此改良後之第二步驟涉及精確地將標記候選者為有效 的狀況條件特色化。 首先考慮標記η與Γρ。在本方法已定出該屬性之某些 實例的開頭後,本方法企囷定出該屬性之此實例的結束。 因此,標記「或Γρ之一候選者“ u,,必須滿足二限制: 限制Cl: “u”必須為任何網頁例之一屬性的任何實 例之一子字串。 須马在每一網頁例之該屬性的每 實例後立刻出現之文字的字首。 若這些限制被標記ri或rp之-候選者“u”違反時 則每-包Ιί就該等例子e的至少之—為失敗的。若限制( 被違反’則屬性k會太短;若C2被違反,其將太長。 彙整而&,所論及的是,若某候選者“u”要有效地1 1280488 五、發明說明(48) 為標記η或Γρ之值時,針對某一組ε之例子,該等條件必 須成立。這些條件將被稱為valid(u,r,£)。其被看出,若 且唯若候選者“u”就標記η與Γρ針對例子集合ε滿足限 制C1與C2時,validr(u,γ,ε )會成立。回到本例,若該vaUdr 檢定被應用至用candsr產生之候選者時,會發現: 就項目屬性之右標記而言: validr(</B></TD><l>,i,ε)=真 就價格屬性之右標記而言: validr(</l></TD〉<TR>办 <TR>TD<B>,ρ,ε)=假 該execHLRT程序搜尋標記lp。標記ιρ之候選者“u,, 須滿足二限制: 限制C3: “u”必須為一適當的文字尾,其在每一網 頁例之屬性k的每一實例前立刻出現。 若此限制被違反,則每一包裹將不同意例子ε。至少, 開頭指標bm,ρ(由execHLRT計算)將為不正確的—會小於 或大於該正確值、或未定義的,視“ u,,如何違反該限制而 定。 彙整而言,若某些候選者“u”依據例子集合ε為標記 lp之有效值,則所論及之條件必須成立。這些條件被稱為 v^d1(U,l,〇。其被看出,若且唯若候選者“u,,就標記ip 針對C滿足限制C3,則validi (u,丨,〇成立。回到該簡單 產品目錄例Ppc,其被看出: validj (</B></TD><l>,p,ε)=真, 51 1280488V. Description of Invention (44) "</,,, "</B", "</B>" etc. The 0 mark li is the most complicated because the string before the first attribute is in the previous The first attribute and the last attribute of the "tuple" appear between the first "tuple" at the beginning of the page. In this example, the string considered is " <TR><TD><B>, and 44 </l></TD></TR><TR>^<TD><B>". It is clear that li is the suffix of this string. Therefore, the candidate for li can be generated by enumerating the suffix of this segment. To generalize this effort, it is summarized that given the set of examples and the candSi(i,p,s) written, the candidates for the marks li and lp appear in the project attributes or in each case by enumeration. The suffix of the shortest string to the left of each instance of the price attribute is generated (as mentioned in the previous paragraph, the project attribute is somewhat special. The shortest between adjacent tuples or before the first tuple The suffix of the string must be enumerated). For example, if ε = {(Ppc, Lpc)丨, then: candSj(i,p,E) = {</1></TD></TR>nI^ <TR><TD><B> '...} cands丨(ί,ρ,ε)= {</B></TD><TD><l>, candidates for the right mark are similarly generated, sentence/have Two things are different. First, the string under consideration occurs to the right of the appropriate attribute (not to the left). Second, r, and rp must be the prefix (not the end of the string) of these strings. For example, in the simple product catalog example, the tag must be the prefix of the string " </B></TD><TD><l>, and % must be "<1></TD> ; <TR>,, and </l></TD><TR> ^ <TR><TD><B>, both of which are prefixes. In particular, the set of examples is given. The right mark of the cands Jk, s) is written. 47 1280488 5. The description of the invention (45) by enumerating the prefix of the shortest string occurring on the right side of each instance of the attribute k in each case. Was produced. (Like the above, li is a special case.) Similarly, rp is a special case. The shortest string that occurs between adjacent "tuples" or the last "tuple" is listed. For example, candsr(i,p,e)= {</B></TD><TR><l>,...} candsr(i,p,e) = {</1></TD><TR>vI^<TR><TD><B> '...} A similar analysis applies to the head and tail tags. The "header" is the prefix of the page before the first item attribute appears. Note that the "header" is presented here as a string. When a package is actually implemented, in order to improve the performance of the present invention, it is preferred that the "head" be presented as an integer so that the human shopper or buyer can use the package to find product information without having to look at the content. Quickly jump over the head of the page. To convert a header to an integer, just find the number of rows that the header string spans. The mark identifying the "tail" is very similar to the right marks li and Lp. These tails are the suffix of the string after the last price attribute of the page. Cands(head,E)={<HTML>>|^<nTLE>".PRIEC($US)</rH></rR> ,...} cands(head,£)= {< ;/l></TD></TR>>||</TABLE>...<HTML> » Simple to learn these two tags under each of these tags The virtual program of the method is provided on page 26. Since the module is executed at a time proportional to the number of candidates for each tag, and since each tag has many candidates, the execution time will be 48 1280488. V. The invention (46) is very slow. A more efficient process can be achieved by observing the mutually independent markers Ip, rp. Moreover, a candidate is not associated with any other tag as to whether the particular store is valid. For example, it can assess whether “/B” is satisfactory to the factory without any reasonable arrangement of any other markings. To see that this independence is properly established, you must recall the execHRLT order. At each point of its execution, execHRLT searches for its input page P for any of the tokens ^, Ιρ’Γρ. If any of these searches fail in determining the correct position in P, the indication of the output by execHRLT will be incorrect. However, whether these searches send back the correct answer depends only on the mark of the consideration and the example of the page, rather than other marks. Alternatively, once a particular candidate (L, Ip, Γρ) is selected for some of the markers, the candidate will not be invalid regardless of whether the candidate is selected by other markers. The change of this claim also has an intuitive meaning; if a candidate is invalid, it cannot be repaired regardless of how carefully the candidate is selected by other markers. Note that this independence is guaranteed: this is not just an inspiration to promote learning. The importance of this observation is that the three markers Γ, 1ρ, Γρ can be learned in isolation. In the virtual code, it can be learned as follows: 1 - Generate candidate groups: 2. Select a valid candidate for each tag. This methodology is much faster than the procedure of Section 26; it is performed at a time proportional to the sum of the number of candidates for each marker (not the product). However, it has also been observed that not all markers are independent of each other. 49. INSTRUCTIONS (47) In contrast, in the case of the marks h, t and li, a particular character string is valid for one of the two markers depending on the choice of the other two. For example, is <®> effective? The answer depends on the choice of h and t. If h= " <HTML>", since execHLRT does not skip the irrelevant boldface "<B>A Simple Product directory</B>", then "<b>,, η will not be A valid tag. Conversely, if h= “</TH></TR>, then li= will not be a problem. Similarly, li interacts with t: if t:= “</HTML> ”, 1ι= “<B>” is unacceptable, but if t= “</TABLE>”, it is Again. As a result, the two candidates h, t and li must be considered jointly. Therefore, all combinations of candidates for h, t and li are enumerated, and valid ones are selected. This improved second step involves characterizing the conditional conditions that effectively mark the candidate. First consider the marks η and Γρ. After the method has determined the beginning of some instances of this property, the method attempts to determine the end of this instance of the property. Therefore, the flag "or candidate" u, must satisfy the second limit: Limit Cl: "u" must be a substring of any instance of any of the attributes of any web page instance. The prefix of the text that must appear immediately after each instance of this attribute in each web page instance. If these restrictions are flagged by ri or rp - the candidate "u" is violated, then each packet - at least - of the example e is failed. If the limit is violated, the attribute k will be too short; if C2 is violated, it will be too long. Consolidation and &, what is said, if a candidate "u" is to be effective 1 1280488 5, invention description ( 48) When marking the value of η or Γρ, these conditions must be true for the case of a certain set of ε. These conditions will be called valid(u,r,£). It is seen that if and only candidates When "u" marks η and Γρ for the example set ε to satisfy the limits C1 and C2, validr(u, γ, ε) holds. Back to this example, if the vaUdr check is applied to the candidate generated by candsr , you will find: For the right tag of the project attribute: validr(</B></TD><l>,i,ε)= true for the right tag of the price attribute: validr(</ l></TD><TR>do<TR>TD<B>, ρ, ε)=false the execHLRT program search mark lp. The candidate of the mark ιρ "u,, must satisfy the second limit: limit C3 : "u" must be an appropriate end of the text that appears immediately before each instance of attribute k of each page instance. If this restriction is violated, each package will not Meaning ε. At least, the opening indicator bm, ρ (calculated by execHLRT) will be incorrect - it will be less than or greater than the correct value, or undefined, depending on "u, how to violate this limit. If some candidate "u" is based on the example set ε as the effective value of the mark lp, the conditions discussed must be established. These conditions are called v^d1(U,l,〇. It is seen that if Only if the candidate "u," marks ip for C to satisfy the limit C3, then validi (u, 丨, 〇 holds. Go back to the simple product catalog example Ppc, which is seen: validj (</B></TD><l>,p,ε)=true, 51 1280488
52 128048852 1280488
五、發明說明(5〇) (1) 如第15C圖描述地使用模組啟發式搜尋方法論以 組成標示(LabelOracle)。此被稱為分辨器:其之一為項目 '分辨器及另一為智慧型價格分辨器。 (2) 因為語意分辨買者代理器20每次由線上人類買者 或使用者接收一請求時擷取賣主描述資料是不效率的,此 些描述若其為語意分辨買者代理器20第一次請求其所欲 集合之搜尋·相符·抽取時將僅由資料庫(較佳地為一微軟 Access資料庫)或順從SQL之資料庫伺服器22被擷取。然 後該賣主描述將被砝存於記憶體或快取記憶體以便在其他 稍後語意分辨買者代理器20請求時之瞬間的擷取使用。 (3) 在記憶體或快取記憶體中之賣主描述將每天自動 地被更新一次。 (4) 本發明之系統創立多執行線並同步地刪除數個語 意分辨學習者代理器以透過全球資訊網接觸各種被指定之 線上賣主。此多執行線方法論之使用較佳地被建立於微軟 公司所提供之DCOM上。每一語意分辨買者代理器20以 人類買者或使用者提供之產品資訊並虛擬地壓下「鍵入」 下智慧地填入賣主之搜尋表格。 (5) 另一方面,本發明之語意分辨買者代理器20藉由 加速賣主回應時間並如被多執行線促成地分配被送回之搜 尋結果網頁至分離的記憶體位置而針對現在湲駕購物者/ 買者之線上購買的整個過程之全球資訊網上的大網路交 通。 獲得一訓練網頁涉及對賣主網站進行例子查詢。例 53 1280488 五、發明說明(51 ) 如’第12圖顯示由對如httD://www· 800· com之網站的查詢 來之例子網頁的外觀。 該分辨器被稱為標示(LabelOracle)之由模組啟發性搜 哥方法論組成的法則現在將更詳細地被描述。一分辨器尋 找一網頁上特定屬性之實例。例如,給予第12圖之例子網 頁下,一項目分辨器會找出包含於該網頁之所有「項目」, 如產品“HM381MD”與“MD203,,。一分辨器須為夠聰明以 除掉此雜訊。 例如再次給予第12圖之例,智慧的價格分辨器能分辨 1價格1」與「價格2」,其例如分別為「標價」與「你的 價格」。該等分辨器實例再被確認以標示整個網頁。例如, 給予「項目」之分辨器與另一個「價格」之分辨器,該確 認產生一 LabelOracle,其標示這些屬性對之網頁。 若該項目屬性事先知道所有項目,分辨該等「項目」 是為簡單的模型配對問題。不過,此需要項目名稱/型號之 大的佇列,此為不可行的。此外,維護如此大的價格資料 庫是耗費成本的。因此,保證此項目名稱/型號之佇列為完 鳘且更新的為不務實的。 幸好,賣主藉由對所有產品使用統一的觀看企圖創造 辨識之感覺。例如,賣主以與DVD產品相同之格式提出 小碟片(MD)產品資訊。藉由採用此規律之益處,其被假設 每一產品以相同格式被描述。 本發明僅由特定網域之例子學習一包袠,並企囷將此 網域套到在網際網路上以全球一致的格式被組織之外國語 54 1280488V. INSTRUCTIONS (5〇) (1) The module heuristic search methodology is used to describe the composition (LabelOracle) as described in Figure 15C. This is called a resolver: one of them is the project 'resolver and the other is a smart price resolver. (2) It is inefficient to retrieve the vendor description data each time the buyer agent 20 receives a request from an online human buyer or user, if such a description is semantically distinguishing the buyer agent 20 first. The search, match, and extraction of the desired request will be retrieved only by the database (preferably a Microsoft Access database) or the database server 22 obeying SQL. The vendor description will then be stored in memory or cache memory for use at other times when the later request is resolved by the buyer agent 20. (3) The description of the vendor in the memory or cache memory will be automatically updated once a day. (4) The system of the present invention creates multiple execution lines and simultaneously deletes several semantically separated learner agents to access various designated online sellers through the World Wide Web. The use of this multiple execution line methodology is preferably built on DCOM provided by Microsoft Corporation. Each semantically distinguishes the buyer agent 20 from the product information provided by the human buyer or user and virtually presses "type" to intelligently fill the seller's search form. (5) On the other hand, the semantics of the present invention distinguishes the buyer agent 20 by speeding up the vendor response time and, if the multi-execution line is used to distribute the returned search result web page to the separate memory location, Shoppers/Buyers buy online for the entire process of global information online on the big internet traffic. Obtaining a training webpage involves an example query of the seller's website. Example 53 1280488 V. INSTRUCTIONS (51) If 'FIG. 12 shows the appearance of an example web page by querying a website such as httD://www.800.com. The ruler, called LabelOracle, consists of a modular heuristic search methodology that will now be described in more detail. A resolver finds an instance of a particular attribute on a web page. For example, given the example web page of Figure 12, a project resolver will find all the "projects" included in the web page, such as the products "HM381MD" and "MD203,". A resolver must be smart enough to remove this. For example, in the case of Figure 12 again, the intelligent price resolver can distinguish between 1 price 1" and "price 2", which are, for example, "price" and "your price". These resolver instances are then confirmed to mark the entire web page. For example, to give the "item" discriminator and another "price" discriminator, it is confirmed that a LabelOracle is generated, which marks the web pages of these attributes. If the project attribute knows all the items in advance, distinguishing the "projects" is a simple model matching problem. However, this requires a large queue of project names/models, which is not feasible. In addition, maintaining such a large price database is costly. Therefore, it is guaranteed that the name/model number of this project is completed and updated to be unpractical. Fortunately, sellers create a sense of identity by using a unified viewing attempt for all products. For example, the seller presents small disc (MD) product information in the same format as the DVD product. By adopting the benefits of this rule, it is assumed that each product is described in the same format. The present invention learns only one package from the example of a specific domain, and attempts to put this domain on the Internet to be organized in a globally consistent format. Mandarin 54 1280488
五、發明說明(52) 言的所有其他網域。在較佳實施例中,該等訓練例唯獨起 源於一網域,如賣主網站之MD網域。此形成項目分辨器 之結果,其僅須分辨如MD之產品的特定網域。在此方式 下,要維護項目名稱之特定網域的完全更新之命名法為可 行的。V. All other domains of the invention (52). In the preferred embodiment, the training examples originate solely from a domain, such as the MD domain of the vendor website. This results in the formation of a project discriminator that only has to distinguish the specific domain of the product such as MD. In this mode, it is possible to maintain a fully updated nomenclature for a particular domain of the project name.
本發明藉由激發一模組啟發的搜尋來辨識「價格」。例 如,價格總是跟在金額符號($)後及價格通常是一浮點數字 等。若有一個以上之價格就一項目被找到,則如「你的價 格」、「我們的價格」、「標價」、「原價」等關鍵字再因之被 抽取。 如稍早簡要地被描述者,語意分辨買者代理器20如何 運作之機制被顯示於第14與15A-15C圖。控制流程包含 在圖中被標示之八個步驟: 步驟1 : 當使用者確定需要特定產品或服務時’取代一對一地 以人工搜尋產品資訊與價格而瀏覽全球資訊網上之不同多 語言的賣主網站地,本發明提供一入口,對產品資訊之請 求这過其經由互動式代理器字元圊形使用者介面 (IACGUI),一般被習知為互動式代理器字元購物者/買者介 面被鍵入一次以達成相同的目的,但有更好、更快與更可 靠的結果。 此結果之產品描述被儲存於SRBA 20之會員變數 m_ProdDesc内。該搜尋亦促成使用者訂作該代理器如何透 過「進階搜尋」功能而作用,其提供可選擇之參數,如選 55 1280488The present invention recognizes "price" by stimulating a module-inspired search. For example, the price always follows the amount sign ($) and the price is usually a floating point number. If more than one price is found, an item such as "your price", "our price", "price", "original price" and so on will be extracted. The mechanism for semantically distinguishing how the buyer agent 20 operates is shown in Figures 14 and 15A-15C, as briefly described earlier. The control process consists of the eight steps marked in the figure: Step 1: When the user determines that a specific product or service is required, 'substituting one-to-one to manually search for product information and prices to browse different global languages on the global information network. At the seller's website, the present invention provides an entry for the request for product information via the interactive agent character user interface (IACGUI), which is generally known as an interactive agent character shopper/buyer. The interface is typed once for the same purpose, but with better, faster and more reliable results. The product description for this result is stored in the membership variable m_ProdDesc of SRBA 20. The search also enables users to customize how the agent works through the "Advanced Search" function, which provides optional parameters such as 55 1280488
五、發明說明(53) 擇之賣主、時限(限制)、價格範圍、任何製造商、關鍵字 等。 步驟2 : 作又。又例如線上貝者或使用者平台使用Windows 98 作業系統且以語言版本“B”執行(或較佳地其平台以英語 版本執行Windows 2000與/或其平台被安裝個人web語言 版本“B”),微軟網際網路探險家將在其登入本發明之平台 後催促其下載語言“B”顯示軟體。當線上買者或使用者 ‘‘ A”以當地語言“B”之字元鍵入一產品型式作為本發明 之入口被提供之文字盒内的關鍵字,第2圊之語意分辨買 者代理器20將使用預先描述之例子資料(在即時包裹歸納 學習後稍早在賣主描述中被擷取)實施資料抽取,且其包含 浯§ B之當地字元字串。這些賣主描述被儲存於資料庫 24(較佳地為微軟Access資料庫)内賣主描述佇列中的預先 定義之資料結構内。該資料抽取涉及對「擊中」之暫時搜 尋一其包含價格、描述、及由先前搜尋結果來之產品的相 關資訊一其使在第15C圖之步驟312展現地被使用者鍵入 之語言“B”的正確當地字元字串駐於儲存在伺服器22之 0己憶姐或快取έ己憶趙内。因為語言“ B’’之字元字串為特殊 的當地語言’任何被找到之「擊中」將為使用當地語言‘‘ Β” 與具有語言‘‘ Β”之字元字串的被辨識之賣主網站所用。 回想第14圖之步驟7,在學習過程之際,於第2圖之 語意分辨學習者代理器18已學習由線上賣主來之包袠 後,賣主描述被儲存在離線資料庫24(較佳地為微軟Access 56 1280488 五、發明說明(54 ) 資料庫)或資料庫伺服器22(較佳地為順從SQL之微軟資料 庫伺服器)内的賣主描述佇列28中。因為語意分辨買者代 理器20每次由線上人類買者或使用者接收一請求時擷取 賣主描述資料是不效率的,此些描述若其為語意分辨買者 代理器20第一次請求其所欲集合之搜尋-相符_抽取時將僅 由資料庫(較佳地為一微軟Access資料庫)或順從SQL之資 料庠伺服器22被擷取。然後該賣主描述將被儲存於記憶體 或快取記憶體以便為同一個或新的使用者在其他稍後語意 分辨買者代理器20請求時之更多瞬間的擷取使用。 在記憶體或快取記憶體中之賣主描述將每天自動地被 更新一次。 步驟3 : 在使用被擷取之賣主描述下,本發明之系統創立多執 行線並同步地刪除數個語意分辨學習者代理器以透過全球 資訊網接觸各種被指定之線上賣主。 步驟4 : 此多執行線方法論之使用較佳地被建立於微軟公司所 提供之DCOM上。每一語意分辨買者代理器2〇以人類買 者或使用者提供之產品資訊並虛擬地壓下「鍵入」下智慧 地填入賣主之搜尋表格。 步驟5 : 然後母一賣主送回具有被清求之產品或一錯誤訊息之 資訊的搜尋結果網頁。 步驟6,7 : 1280488 五、發明說明(55) 該等搜尋結果網頁透過全球資訊網被送回語意分辨買 者代理器20。值得注意的是數個結果網頁可同時回到語意 分辨買者代理器20。本發明之語意分辨買者代理器20藉 由加速賣主回應時間並如被多執行線促成地分配被送回之 搜尋結果網頁至分離的記憶體位置而針對現在浚駕購物者 /買者之線上購買的整個過程之全球資訊網上的大網路交 通。 步驟8 : 語意分辨買者代理器20依據對應的賣主描述分析被 送回的網頁。相關的資訊與資料由所送回的網頁被抽取, 並在所有搜尋結果網頁已到達或搜尋超過時間後以如第 15B圖顯示的輸出格式之方式被顯示。 參照第28圖,該使用者/買者與伺服器22通訊,以如 顯示地透過一 Active Server Page(ASP)棺(NextGen.asp)執 行在伺服器機器22上之處理中的DLL檔(NextGen.dll)。 較佳地是,發展語意分辨買者代理器20作為一 Active Component產生數個好處。首先,整趙績效可被改進。以 Visual C一撰寫語意分辨買者代理器20允許該代理器成為 強徤的且使得ActiveX Component之強大功能為可用的。 在HTML與撰寫程式碼不須供應迂迴的解法以符合應用之 需求。在利用ActiveX Component下,該代理器可利用在 用戶端之HTML檔中添加幾行程式碼而被執行,而將所有 複雜處理丟到伺服器側被執行。 其次,ActiveX Component提供對其他應用之可再用 58 1280488 五、發明說明(56) 性,而取代在每一應用模組中複製類似的功能。一 ActiveX Component可被創立以對所有Active Server Pages為可存 取的。換言之,其不需要所有的邏輯以ASP模組被編碼。 因此’此消除在應用中之冗餘。雖然語意分辨買者代理器 在單一應用内被創立,其不阻礙也與其他應用整合之能 力。此外,此特點可協助顯著地降低發展時間。V. Description of the invention (53) Select the seller, time limit (restriction), price range, any manufacturer, keyword, etc. Step 2: Do it again. For another example, the online browser or user platform uses the Windows 98 operating system and executes in the language version "B" (or preferably its platform executes the Windows 2000 in English version and/or its platform is installed in the personal web language version "B") Microsoft Internet Explorer will urge him to download the language "B" display software after logging into the platform of the invention. When the online buyer or user ''A' enters a product type in the local language "B" as the keyword in the text box provided by the entry of the present invention, the second meaning is to distinguish the buyer agent 20 The data extraction will be performed using the pre-described example data (obtained earlier in the vendor description after the instant package induction learning), and it contains the local character string of § § B. These vendor descriptions are stored in the database 24 (preferably the Microsoft Access database) within the pre-defined data structure in the vendor description column. The data extraction involves a temporary search for "hit" containing the price, description, and results from previous searches. The information about the product is such that the correct local character string of the language "B" typed by the user in step 312 of Fig. 15C is stored in the server 22 stored in the server 22 or the memory. Zhao Nei. Because the language "B'' character string is a special local language' any "hit" found will be recognized using the local language ''Β' and the character string with the language ''Β') Used by the seller's website. Recalling step 7 of Figure 14, on the occasion of the learning process, after the semantics of Figure 2 distinguishes that the learner agent 18 has learned the burden from the online vendor, the vendor description is stored in the offline database 24 (preferably The Vendor Description column 28 in Microsoft Access 56 1280488 V. Inventive Note (54) Database or Database Server 22 (preferably Microsoft SQL Server Compliance Server). It is inefficient to retrieve the vendor description data each time the buyer agent 20 receives a request from an online human buyer or user, if such a description is semantically different, the buyer agent 20 first requests it. The desired search-consistent_extraction will only be retrieved from the database (preferably a Microsoft Access database) or the SQL-compliant data server 22. The vendor description will then be stored in memory or cache memory to allow more simultaneous use by the same or new user in other later semantics to resolve the buyer agent 20 request. The vendor description in memory or cache memory will be automatically updated once a day. Step 3: Under the description of the vendor being used, the system of the present invention creates a multi-execution line and simultaneously deletes a number of semantically distinguished learner agents to contact various designated online vendors via the global information network. Step 4: The use of this multi-execution line methodology is preferably built on DCOM provided by Microsoft Corporation. Each semantically distinguishes the buyer's agent 2 from the product information provided by the human buyer or user and virtually presses the "type" to intelligently fill in the seller's search form. Step 5: The parent and seller then return the search result webpage with the information of the requested product or an error message. Steps 6, 7 : 1280488 V. Description of the Invention (55) The search results web pages are sent back to the semantics to distinguish the buyer agent 20 via the World Wide Web. It is worth noting that several result pages can be returned to the semantics at the same time to distinguish the buyer agent 20. The semantics of the present invention distinguishes the buyer agent 20 from the online shopper/buyer online by accelerating the vendor response time and ascending the retrieved search result web page to a separate memory location by multiple execution lines. Buying the entire process of global information online on the big internet traffic. Step 8: semantically distinguishing the buyer agent 20 analyzes the returned web page based on the corresponding vendor description. Relevant information and materials are extracted from the returned web pages and displayed in the output format as shown in Figure 15B after all search results pages have been reached or searched for more than a time. Referring to Fig. 28, the user/buyer communicates with the server 22 to execute the DLL file on the server machine 22 through an Active Server Page (ASP) (NextGen.asp) as shown (NextGen). .dll). Preferably, the development of semantically distinguishing the buyer agent 20 as an Active Component produces several benefits. First of all, the overall Zhao performance can be improved. Distinguishing the buyer agent 20 in Visual C language allows the agent to be powerful and make the powerful features of ActiveX Component available. There is no need to supply a round-trip solution in HTML and scripting to match the needs of the application. Under ActiveX Component, the agent can be executed by adding a few itineraries to the HTML file of the client, and all complex processing is thrown to the server side to be executed. Second, ActiveX Component provides reusability for other applications, instead of copying similar functionality in each application module. An ActiveX Component can be created to be accessible to all Active Server Pages. In other words, it does not require all logic to be encoded in the ASP module. So this eliminates redundancy in the application. Although the semantics of the buyer agent is created within a single application, it does not hinder the ability to integrate with other applications. In addition, this feature can help significantly reduce development time.
第三 '將ASP Component連接至DLL(動態連結程式 館)檔案是有益的,原因為這些檔案係獨立地被編洚及連 結。要更新ASP Component不須額外的再編譯與再連結。 因此,稍後被更新之版本的速度改進或新功能可使使用 DLL之ActiveX Component受益。此外,DLL可藉由共用 數個模組間之共同程式碼與資源的單一複製而降低記憶體 與磁碟空間需求。 若有數個要素使用相同的靜態連結程式館,該程式館 之數個相同複製被要求被儲存及被執行。然後,若同步地 運轉’在記憶趙中將有數個相同的複製。所以很明顯的是 使用靜態連結程式館會有冗餘及空間浪費的結果。 若一 DLL代替靜態連結程式館被使用,則僅需要該等 程式碼與資源之一複製。此因由網際網路有很多同時的連 接,故可使伺服器維持於最小的負荷。 語意分辨買者代理器20較佳地為一 ActiveX Component其被發展成一處理中之dll。其可允許使用者 透過全球資訊網創立SRBA之一物件。為了在使用者與词 服器間通訊,ASP被用以作用成使用者與伺服器間之閘道 59 1280488 五、發明說明(57) 器。 ASP為一開放應用環境,其中HTML網頁、記錄與 ActiveX Component被組合以創立以Web為基礎之應用。 此外,其被構建成網際網路伺服器應用程式介面(ISAPI), 其在微軟公司之網際網路資訊伺服器(IIS)產品上或在一層 IIS相關的Web伺服器上執行。 為了實施ASP,微軟ActiveX Scripting被使用,如在 管理 ActiveX Component 過程中被使用之 Visual Basic(VB) 撰寫。其藉由添加能力以激發ActiveX Component在伺服 器上執行成如DLL而使語言成動態的。 創立語意分辨買者代理器之一物件的程式邏輯為當使 用者開始搜尋所欲產品之價格時,語意分辨買者代理器20 之一物件被創立。 在Active Server Page中,虛擬碼中之模組被編碼如 下: Start sub session Set agent=sever Creat object( “NextGen·") End sub 當該網頁被載入時,上面的模組創立語意分辨買者代 理器要素20之一物件,而NextGen為該ActiveX Component 之名稱。語意分辨買者為NextGen要素中代理器之名稱。 在語意分辨買者代理器20已被創立後,如第29圊顯 示之一連接在使用者與伺服器間被建立。 60 1280488The third 'Connecting the ASP Component to the DLL (Dynamic Link Library) file is beneficial because these files are independently compiled and linked. There is no need to recompile and relink the ASP Component to update. Therefore, speed improvements or new features of later versions can benefit ActiveX Component using DLLs. In addition, DLLs can reduce memory and disk space requirements by sharing a single copy of common code and resources between several modules. If several elements use the same static link library, several identical copies of the library are required to be stored and executed. Then, if you run synchronously, there will be several identical copies in Memory Zhao. So it is obvious that using static link libraries will result in redundancy and wasted space. If a DLL is used instead of a static link library, only one of the code and resources needs to be copied. This is because the Internet has many simultaneous connections, so the server can be kept at a minimum load. The semantically distinguishing buyer agent 20 is preferably an ActiveX Component which is developed into a processing dll. It allows users to create one of the SRBA's objects through the World Wide Web. In order to communicate between the user and the lexical device, the ASP is used to act as a gateway between the user and the server. 59 1280488 V. Inventive Description (57). ASP is an open application environment in which HTML pages, records, and ActiveX Components are combined to create Web-based applications. In addition, it is built into the Internet Server Application Interface (ISAPI), which runs on Microsoft's Internet Information Server (IIS) product or on a layer of IIS-related Web servers. To implement ASP, Microsoft ActiveX Scripting is used, as written in Visual Basic (VB), which is used during the management of ActiveX Components. It makes the language dynamic by adding capabilities to motivate the ActiveX Component to execute as a DLL on the server. The logic of creating a semantically distinguishing one of the buyer's agents is to semantically distinguish one of the objects of the buyer's agent 20 from being created when the user begins to search for the price of the desired product. In the Active Server Page, the module in the virtual code is encoded as follows: Start sub session Set agent=sever Creat object(“NextGen·") End sub When the web page is loaded, the above module creates a semantic meaning to buy One of the agent elements 20, and NextGen is the name of the ActiveX Component. The semantics distinguishes the buyer as the name of the agent in the NextGen element. After semantically distinguishing the buyer agent 20 has been created, as shown in Figure 29. A connection is established between the user and the server. 60 1280488
五、發明說明(58) 語意分辨買者代理器使用一可連接的物件以維護“一 對一 ”(頻道),使用者透過其與伺服器通訊如一使用者請求 至該伺服器以比較價格,而向外之介面被語意分辨買者代 理器20用作為連接(頻道),伺服器透過其與使用者通訊如 一回應,其中送回被請求之搜尋結果至使用者。使用者可 存取該等性質並透過IConnectionPoint激發該語意分辨買 者代理器之方法。 語意分辨買者代理器20運用下面被設立之方法論: 1. OnStartPage(Unknown Agent) 此方法論被用以啟動語意分辨買者代理器之物件,其 在ASP被載入時自動地被傳呼。 2. OnEndPage() 此方法論被用以中止語意分辨買者代理器之物件,其 在ASP被未載入時自動地被傳呼。 3. GetSearch(BStr input, BStr ^output) 此方法論被用以在使用者已提供如型號之產品描述後 於網際網路上搜尋所需要之產品價格。Input為使用者之產 品描述,而Output為搜尋結果網頁之輸出。此方法之傳呼 語法為: OutputName=AdObjectName.GetSearch( “Product Name”) 在上面的程式碼中,AdObjectName為物件之實例名 稱,而“Product Name”為買者/使用者代理器要比較價格之 產品名稱,及OutputName為取得被送回值之變數。參照下面 的虛擬碼作為一例, 61 1280488 五、發明說明(59) If result = Agent Get search( “Radar detector”) 伺服器側之程式邏輯 登錄要素 在使用者可啟動語意分辨買者代理器之一物件前,其 要素必須用下列的命令在伺服器被登錄: Register path\Nextgen.dll 其中path為Nextgen.dll被健存之絕對路徑。 當語意分辨買者代理器之物件透IconnectionPoint傳 呼Get Search之方法時,伺服器機器中之語意分辨買者代 理器的實例執行動態連結程式館(DLL)。見第30圖。 要透過ODBC連接至SQL伺服器必須提供資料來源名 稱(DSN),身份(ID)與關鍵字。RETCODE為儲存由SQL伺 服器被送回值之變數。SQL_SUCCESS表示成功的擷取。 CString sDSN= “NextGen” ; CString sID= “NextGen” ; CString sPassword= “NextGen” ; RETCODE re ; rc = SQL Connect(hdbc,sDSN,“,,,sID,,,sPassword,”,,); if(rc = = SQL_SUCCESS) else 62 1280488 五、發明說明(6〇) …錯誤處置… } 執行SQL查詢: 在由SQL擷取所欲之學習者介面前須有特定的查詢 //分配一敘述句處置 SQLAllocStmt(hdbc,& hstmt): //查詢敘述句;model為由使用者來之型號 CString sQuery= “select’’from tbl_electronic where model=+ ‘‘model’’ ; rc = SQLExecDirect(hstmt,sQuery,””); if(rc==SQL_SUCCESS) { ...擷取欄位...V. Description of the invention (58) semantically distinguishing the buyer agent using a connectable object to maintain "one-to-one" (channel) through which the user communicates with the server, such as a user requesting to the server to compare prices, The outward interface is semantically distinguished by the buyer agent 20 as a connection (channel) through which the server communicates with the user as a response, in which the requested search result is sent back to the user. The user can access these properties and motivate the semantics to distinguish the buyer agent through IConnectionPoint. Semantic Resolution The Buyer Agent 20 uses the methodology established below: 1. OnStartPage (Unknown Agent) This methodology is used to initiate semantically distinguishing the object of the Buyer Agent, which is automatically paged when the ASP is loaded. 2. OnEndPage() This method is used to abort objects that semantically distinguish the buyer's agent, which is automatically paged when the ASP is not loaded. 3. GetSearch(BStr input, BStr ^output) This methodology is used to search for the price of the product on the Internet after the user has provided a description of the product type. Input is the user's product description, and Output is the output of the search results page. The paging syntax for this method is: OutputName=AdObjectName.GetSearch("Product Name") In the above code, AdObjectName is the instance name of the object, and "Product Name" is the buyer/user agent to compare the price of the product. The name, and OutputName are the variables that get the value returned. Refer to the following virtual code as an example, 61 1280488 V. Invention Description (59) If result = Agent Get search (“Radar detector”) The program logic login element on the server side can be used to initiate the semantics of the buyer agent. Before the object, its elements must be logged in at the server with the following command: Register path\Nextgen.dll where path is the absolute path where Nextgen.dll is saved. When semantically distinguishing the object of the buyer agent through the method of IConnectionPoint calling Get Search, the semantic machine in the server machine distinguishes the instance of the buyer agent from the dynamic link library (DLL). See figure 30. To connect to the SQL server via ODBC, you must provide the source name (DSN), identity (ID) and keywords. RETCODE is a variable that stores the value returned by the SQL server. SQL_SUCCESS indicates a successful capture. CString sDSN= “NextGen” ; CString sID= “NextGen” ; CString sPassword= “NextGen” ; RETCODE re ; rc = SQL Connect(hdbc,sDSN,“,,,sID,,,sPassword,”,,); if( Rc = = SQL_SUCCESS) else 62 1280488 V. Description of the invention (6〇) ... error handling... } Execute SQL query: In the SQL learner's desired learner profile, there must be a specific query // assign a narrative to dispose of SQLAllocStmt ( Hdbc, & hstmt): / / query statement; model is the model CString sQuery = "select''from tbl_electronic where model=+ ''model''; rc = SQLExecDirect(hstmt,sQuery,"" ); if(rc==SQL_SUCCESS) { ...take the field...
擷取襴位: 在進行豈詢後,賣主描述將被儲存於稱為vendor_ description之陣列内。在此陣列内有兩個成員變數:一包 裹與一賣主URL。 re = SQLGetData(hstmt,column_number, data_type,sWrapper,sizeof(sWrapper)); if(rc= = SQL_SUCCESS) vendor_description[i].wrapper=s Wrapper, rc = SQLGetData(hstmt,column_number,data_type,sURL 63 1280488 五、發明說明(61 ) sizeof(sURL)); if(rc=SQL_SUCCESS) vendor_description[i].url=sURL ; 填寫表格: 若其有N個賣主描述,買者代理器將啟動N條執行線 以填寫在賣主描述中被定出之每一賣主中的表格。 為執行每條執行線,其語法為: "nNo Vendor為賣主描述之數目 Int nNoVendor=no_of_vendor_description ; For(Int nCount=0;nCount<nNoVendor,nCount++) { //ThreadFillForm為實施表格填寫之級 Thread* ThreadFillForm ; Thread = AfxBeginThread(RUNTIME_CLASS(Thread FillForm),···) 就每條執行線而言,其時間限制較佳地為約5秒。若 賣主未在5秒内送回結果,此賣主將就這次被棄置,否則 其結果將被儲存於記憶體内用於下一個處理。 當使用者在所提供之盒内輸入對本發明之入口的購買 請求之關鍵字時,其被決定是否有任何相關的賣主描述, 即包含該關鍵字之賣主描述p包含包袠與URL之所有相關 的賣主描述再由離線資料庫被擷取。此後,語意分辨買者 代理器與每一線上賣主之可搜尋指標平行地進行,將之填 64 1280488Capture Location: After the query, the vendor description will be stored in an array called vendor_ description. There are two member variables in this array: a package and a vendor URL. Re = SQLGetData(hstmt,column_number, data_type,sWrapper,sizeof(sWrapper)); if(rc= = SQL_SUCCESS) vendor_description[i].wrapper=s Wrapper, rc = SQLGetData(hstmt,column_number,data_type,sURL 63 1280488 V. DESCRIPTION OF THE INVENTION (61) sizeof(sURL)); if(rc=SQL_SUCCESS) vendor_description[i].url=sURL ; Fill in the form: If there are N vendor descriptions, the buyer agent will start N execution lines to fill in A table in each seller that is defined in the seller description. To execute each execution line, the syntax is: "nNo Vendor is the number of seller descriptions Int nNoVendor=no_of_vendor_description ; For(Int nCount=0;nCount<nNoVendor,nCount++) { //ThreadFillForm is the level Thread* for the implementation form ThreadFillForm ; Thread = AfxBeginThread(RUNTIME_CLASS(Thread FillForm),···) For each execution line, the time limit is preferably about 5 seconds. If the seller does not return the result within 5 seconds, the seller will be discarded this time, otherwise the result will be stored in memory for the next process. When the user enters the keyword of the purchase request for the entry of the present invention in the provided box, it is determined whether there is any relevant vendor description, that is, the vendor description p containing the keyword contains all the relevant information of the package and the URL. The seller's description is then retrieved by the offline database. Thereafter, the semantics distinguishes the buyer's agent in parallel with the searchable indicators of each online seller, and fills it in 64 1280488
五、發明說明(62) 寫並提交至賣主網站。在賣主網站,該買者代理器將傳呼 一成員函數 httpPost以完成此工作。該httpPost成員函數 ,依據賣主描述郵寄URL與表格資料至一賣主並送回一 HTML回應作為一字串變數。該httpPost成員函數送回一 布林值,此處真表示HTML文件之成功的擷取,及假表示 錯誤發生。若該送回值為真,該被產生之項目名稱與價格 將由HTML文件被抽取。郵寄表格之流程被顯示於苐31 圖。 在步驟1002中,用於該節區之C internet Session物件 被創立。該Cinternet Session級為一網際網路節區連接至 一伺服器。典型上此級稍早在一節區中被使用以建立對 Web伺服器之連接。 在步驟1004中,一 CHttpConnection物件藉由傳呼 Cinternet Session 物件之 GetHttpConnection 成員函數被創 立。該CHttpConnection級建立與一伺服器之HTTP連接。 在步驟 1006 中,一 CHttpFile 物件之 OpenRequest 成員函數而被創立。該CHttpFile級讓在網際網路上被傳 送之檔案被處理成好像是以區域磁碟檔案工作似的。其以 CHttpConnection物件工作以讀取與寫入網際網路資料。 步驟1008傳呼CHttpFile物件之SendRequest成員函 數以送出POST請求與表格資料至遠端的HTTP伺服器。 步驟1010, 1012與1014重複地傳呼CHttpFile物件之 Read成員函數,其送回多塊回應資料至程式。當Read送 回0時,沒有資料被留下來要擷取。 65 1280488 五、發明說明(63) 抽取價格: 在取得結果網頁後,語意分辨買者代理器20將就每一 結果網頁針對一般化的故障樣板配對。若該網頁與該樣板 不配,其假設係為一成功的搜尋。然後買者代理器20將為 對應的賣主使用該包裹以由成功的網頁將標題與結尾剝 除。例如,假設一使用者以型號MD 203搜尋MD產品’ 被給予之包裹為{7,<B>,</B>,<八>,</tarle>丨,且結 果網頁在下面被顯不· <HTML> <TITLE>A Simple Product Catalogs</TITLE> <B0DY> <H2>MD Price </H2> <TABLE B0RDER=1> <TR BGC0L0R=0RANGE> <TH>ModelNumber</TH> <TH〉PRICE(US$)</TH> </TR> <TR><TD><B>HM381MD</B><TD><1>399.95</1> </TD></TR> <TR><TD><B>MD2070</B><TD><TD><1>599.95 </l></TD></TR> <TR><TD><B>MD203</B></TD><TD><1>249.95</1> </TD></TR> <TR><TD><B>MDR3</B></TD><TD><1>399.95</1> 66 12804885. Description of the invention (62) Write and submit to the seller's website. On the seller's website, the buyer agent will page a member function httpPost to do the job. The httpPost member function, based on the vendor description, mails the URL and form data to a vendor and sends back an HTML response as a string variable. The httpPost member function returns a Boolean value, which really indicates the success of the HTML file, and false indicates that the error occurred. If the return value is true, the generated project name and price will be extracted from the HTML file. The process of mailing the form is shown in Figure 31. In step 1002, a C internet Session object for the section is created. The Cinternet Session level is connected to a server by an internet segment. Typically this level is used earlier in the section to establish a connection to the web server. In step 1004, a CHttpConnection object is created by paging the GetHttpConnection member function of the Cinternet Session object. The CHttpConnection level establishes an HTTP connection to a server. In step 1006, a CHttpFile object's OpenRequest member function is created. The CHttpFile level allows files that are transmitted over the Internet to be processed as if they were working as a local disk archive. It works with the CHttpConnection object to read and write Internet data. Step 1008 pages the SendRequest member function of the CHttpFile object to send the POST request and the form data to the remote HTTP server. Steps 1010, 1012 and 1014 repeatedly page the Read member function of the CHttpFile object, which returns multiple pieces of response data to the program. When Read returns 0, no data is left to be retrieved. 65 1280488 V. INSTRUCTIONS (63) EXTRACTION PRICE: After obtaining the results page, semantically distinguishing the buyer agent 20 will pair each resulting web page with a generic fault template. If the web page does not match the template, the assumption is a successful search. The buyer agent 20 will then use the package for the corresponding vendor to strip the title and end from the successful web page. For example, suppose a user searches for an MD product by model MD 203 'The package is given as {7, <B>, </B>, <eight>, </tarle>, and the result page is The following is displayed. <HTML><TITLE>A Simple Product Catalogs</TITLE><B0DY><H2>MDPrice</H2><TABLEB0RDER=1><TRBGC0L0R=0RANGE><TH>ModelNumber</TH><TH>PRICE(US$)</TH></TR><TR><TD><B>HM381MD</B><TD><;1>399.95</1></TD></TR><TR><TD><B>MD2070</B><TD><TD><1>599.95<lt;/l></TD></TR><TR><TD><B>MD203</B></TD><TD><1>249.95</1></TD></TR><TR><TD><B>MDR3</B></TD><TD><1>399.95</1> 66 1280488
五、發明說明(64) </TD></TR> </TABLE> <HR WIDTH=200 ALIGN=LEFT> </BODY> </TABLE> 在該包裹中,有用的資訊由第7行開始並在</TABLE> 結束,故語意分辨買者代理器20將在抽取型號與價格前切 除無用的資訊。該HTML檔在標題與結尾資訊被剝除後 為: Number</TH><TH>PRICE(US$)</TH></TR> <TR><TD><B>HM381MD</B><TD><1>399.95</1> </TD></TR> <TR><TD><B>MD2070</B></TD><TD><1 >599.95 < 1><TD></TR> <TR><TD><B>MD203</B></TD><TD><1>249.95</1> <TD></TR> <TR><TD><B>MDR3</B></TD><TD><1>399.95</1> </TD></TR> 然後語意分辨買者代理器20將使用模型配對以抽取 產品之型號與價格。在包袠中,型號之模型為<3>”<^>, 且價格之模型為</1>#<1>,其中*代表型號及#代表價格。 該代理器首先將抽取型號HM381MD並與使用者請求之型 號“MD 203”比較。如其不相符,語意分辨買者代理器20 尋找另一個型號至其找到型號MD 203為止。在型號被找 67 1280488V. INSTRUCTIONS (64) </TD></TR></TABLE><HR WIDTH=200 ALIGN=LEFT></BODY></TABLE> Useful information in the package Starting at line 7 and ending at </TABLE>, the semantically distinguishing buyer agent 20 will cut out useless information before extracting the model and price. The HTML file is stripped after the title and end information is: Number</TH><TH>PRICE(US$)</TH></TR><TR><TD><B>HM381MD</B><TD><1>399.95</1></TD></TR><TR><TD><B>MD2070</B></TD><TD><1>599.95<1><TD></TR><TR><TD><B>MD203</B></TD><TD><1>249.95</1><TD></TR><TR><TD><B>MDR3</B></TD><TD><1>399.95</1></TD></TR> Then semantically distinguishing the buyer agent 20 will use the model pairing to extract the model and price of the product. In the package, the model of the model is <3><^>, and the price model is </1>#<1>, where * represents the model number and # represents the price. The agent will first The model HM381MD is extracted and compared with the model "MD 203" requested by the user. If it does not match, the buyer agent 20 is semantically distinguished to find another model until it finds the model MD 203. The model is found 67 1280488
五、發明說明(65) 到後,語意分辨買者代理器20使用價格模型以抽取該型號 後之第一個價格。當型號與價格已被抽取後,語意分辨買 者代理器20停止由該網頁抽取資訊並將該資訊置於稱為 array一Item[]之陣列内。 array_Item[]為N條執行線之共用資料,且所有執行 線可存取此成員變數。其有一條以上之執行線同時存取 array__Item[]之風險,此造成存取之違規。為了將此共用資 料保護於一致的狀態,一關鍵段被用以保護一執行線免於 同時修改該資料。其被宣示為: CCriticalSection m_csDoor ; 在插入一元素至array Jtem前,下面的行 m csDoor Lock(); 被添加,其被用以開始該關鍵段。在關鍵段内之所有變數 將被鎖定以防止其他執行線存取該特定變數。在完成插入 後,下面的行 m_osDoor.Unlock(); 被添加,其被用以表示此關鍵段之結束。所有被鎖定之變 數將被解除鎖定以允許其他的執行線存取該成員變數。在 如此做下,array_Item之成員變數可安全地被所有執行線 共用。 在特定時段中,儲存產品價格之陣列sort_item將用一 快速排序方法被排序。 該快速排序方法可如下列地被實施: IF left<righ THEN 68 1280488V. INSTRUCTIONS (65) Upon that, the semantics distinguishes the buyer agent 20 from using the price model to extract the first price after the model. When the model and price have been extracted, the semantically distinguishing buyer agent 20 stops extracting information from the web page and places the information in an array called array-Item[]. array_Item[] is the shared data of N execution lines, and all the execution lines can access this member variable. There is a risk that more than one execution line accesses array__Item[] at the same time, which causes an access violation. In order to protect this shared material in a consistent state, a critical segment is used to protect an execution line from modifying the data at the same time. It is declared as: CCriticalSection m_csDoor ; Before inserting an element into the array Jtem, the following line m csDoor Lock(); is added, which is used to start the key segment. All variables within the critical segment will be locked to prevent other execution lines from accessing that particular variable. After the insertion is complete, the following line m_osDoor.Unlock(); is added, which is used to indicate the end of this critical section. All locked variables will be unlocked to allow other threads to access the member variable. In doing so, the member variables of array_Item can be safely shared by all execution lines. During a specific time period, the array of stored product prices sort_item will be sorted using a quick sort method. This quick sorting method can be implemented as follows: IF left<righ THEN 68 1280488
五、發明說明(66) BEGIN Pivot:=parition(list, left, right); Quicksort(list, left, pivot-1); Quicksort(list, pivot+1,right); END 此結構之一「關鍵」值為被選擇在程式碼之每一回複 中被定位。然後此函數以二方向掃描整個結構。小於該關 鍵之值被傳送到該結構之左邊,而較大者被傳送到右邊。 這些「左至右」與「右至左」及交換持續至一旗標條件告 訴他們停止為止。 一 HTML檔案被送回使用者,其將被儲存於成員變數 m_output内,且其顯示被SRBA 20執行之搜尋的排序後之 結果。 //被用以顯示内容至瀏覽器之字串 *define HTTP_HEADER “Content-type:text/htm\n\n” V顯示至丨割覽器之程式瑪 以此應用在前述光碟上被提出者為一電腦程式附錄, 其提供實施本發明之被選擇特色的程式碼段落。明確地 說,以“3.1學習階段”被標示的部分中,其原始程式為提 供為“3.1.1主COOSA應用類”一用於COOSA應用之主類 別檔案、“ 3.1.2添加賣主類”一添加一賣主類別至資料 庫、“3.1.3 COOSADoc類”一為語意分辨學習者代理器激 發文件與畫面之顯示、“3.1.4 COOSA觀看類”一學習者介 面與其功能之畫面、“3.1.5訓練資料類”一激發語意分辨 69 1280488 五、發明說明(67 ) 學習者代理器、及“3·1·6賣主類,,一宣告標示法則以處理 所有賣主Web網頁。在標示為“購買階段,,中,原始程式 碼被提供為“ 3 · 1 · 1代理器類,,_宣告語意分辨買者代理 器、以及“3.2.2執行線處理,,一語意分辨買者代理器之處 理部分。 現在參照第32至39圖,本發明中相關使用之GUI或 互動代理器字元購物者/買者介面將被詳細描述。在第32 圖中’GUI或互動代理器字元購物者/買者介面(1 acs/BU 之「主選單」畫面被顯示用於本發明。其將被注意,其產 品之選擇「頻道」(分類)在此「主選單」畫面之右上角被 提供。「快速搜尋」亦在畫面左邊被提供。在其下方有一盒, 其中自我打字之動畫指導線上人類使用者如何使用該快速 搜哥選項。該左邊畫面面板亦提供一組盒子讓會員登入作 為暫時試用或終身會員。(本發明之大多數入口功能為被失 效的至使用者認證通過為止)。在左下角被提供一組連結至 線上賣主,其曾經在本發明之入口註冊,而在右邊可看到 一個大的訊息盒被標示為「回饋」被提供讓使用者透過 e-mail鍵入意見訊息至e-mail伺服器,較佳的是微軟公司 執行Outlook Express牌之e-mail词服器。 第33圖為本發明使用之GUI或購物者/買者介面之畫 面簡化圖,其中公司在回應於「政府對企業」之文字圖示 下被顯示,其已在先前畫面(未顯示出)被線上買者/使用者 點選。然而要注意,就是此畫面無法作用,因這些公司, 即所謂的政府對企業e商務服務或平台提供者藉由在整個 70 1280488 五、發明說明(68) 封閉連之電腦網路環境中納入認證安全介面而嚴格限制會 員對其Web伺服器之資料庫的存取權。V. INSTRUCTIONS (66) BEGIN Pivot:=parition(list, left, right); Quicksort(list, left, pivot-1); Quicksort(list, pivot+1,right); END One of the structures is "key" The value is selected to be located in each reply of the code. This function then scans the entire structure in two directions. Values less than this key are passed to the left of the structure, and the larger are passed to the right. These "left to right" and "right to left" exchanges continue until a flag condition tells them to stop. An HTML file is sent back to the user, which will be stored in the member variable m_output and which displays the sorted results of the search performed by the SRBA 20. //The string used to display the content to the browser *define HTTP_HEADER "Content-type:text/htm\n\n" V shows the program to the viewer. The application is presented on the aforementioned disc. A computer program appendix that provides coded paragraphs for implementing selected features of the present invention. Specifically, in the section labeled "3.1 Learning Phase", the original program is provided as "3.1.1 Main COOSA Application Class" - a main category file for COOSA applications, " 3.1.2 Adding Seller Classes" Add a vendor category to the database, "3.1.3 COOSADoc class" is a semantically distinguishing learner agent to stimulate the display of files and screens, "3.1.4 COOSA viewing class" a learner interface and its function screen, "3.1. 5 training materials class "inspired semantics to distinguish 69 1280488 five, invention description (67) learner agent, and "3·1·6 seller class, a declaration of the rules to deal with all sellers web pages. In the label as "purchase In the stage, the original code is provided as "3 · 1 · 1 agent class, _ declares the semantics to distinguish the buyer agent, and "3.2.2 execution line processing, a word to distinguish the buyer agent processing" section. Referring now to Figures 32 through 39, the GUI or interactive agent character shopper/buyer interface used in the present invention will be described in detail. In Figure 32, the 'GUI or Interactive Agent Character Shopper/Buyer interface (1 acs/BU's "Main Menu" screen is displayed for the present invention. It will be noted that its product selection "Channel" ( The classification is provided in the upper right corner of the "Main Menu" screen. "Quick Search" is also provided on the left side of the screen. There is a box below it, where the self-typing animation guides the online human user how to use the quick search option. The left screen panel also provides a set of boxes for members to log in as a temporary trial or lifetime membership. (Most of the entry functions of the present invention are expired until the user passes the authentication.) A set of links is provided to the online seller in the lower left corner. It has been registered at the entrance of the present invention, and on the right side, a large message box is marked as "Feedback" which is provided for the user to type an opinion message into the e-mail server via e-mail, preferably Microsoft Corporation implements the Outlook Express e-mail word server. Figure 33 is a simplified diagram of the GUI or shopper/buyer interface used by the present invention, in which the company responds to The government-to-business icon is displayed, and it has been selected by the online buyer/user on the previous screen (not shown). However, it should be noted that this screen does not work because these companies, the so-called government The enterprise e-commerce service or platform provider strictly restricts members' access to the database of their web server by including the authentication security interface in the closed computer network environment. .
第34圖為本發明使用之GUI或購物者/買者介面之畫 面顯示的簡化圖,其中在使用者已在第33圖之畫面點選 「進階搜尋」選項後被提供之選擇中被選擇的公司之細茹 被提供。注意在此畫面中,在五種網域之標籤下之面板枢 中的標題可看到「進階代理器已開啟丨」。此外,在畫面底 部,使用者被提供一對話盒,其可被填入用於使用本發明 提供之語意分辨買者代理器功能執行搜尋,然而再次地 說,要注意就是此畫面無法作用,因這些公司,即所謂的 政府對企業e商務服務或平台提供者藉由在整個封閉連之 電腦網路環境中納入認證安全介面而嚴格限制會員對其 Web伺服器之資料庫的存取權。 第35圖為本發明使用之GUI或購物者/買者介面之畫 面顯示的簡化圖,其中公司在回應於「企業對企業」之文 子圖示下被顯示’其已在先前畫面(未顯示出)被線上買者/ 使用者點選。然而注意’就是此畫面無法作用,因這此公 司,即所謂的企業對企業e商務服務或平台提供者藉由在 整個封閉連之電腦網路環境中納入認證安全介面而嚴格限 制會員對其Web伺服器之資料庫的存取權。 第36圖為本發明使用之GUI或構物者/買者介面之畫 面顯示的簡化圊,其中在使用者已在第35圊之畫面點選 「進階搜尋」選項後被提供之選擇中被選擇的公司之細節 被提供。Figure 34 is a simplified diagram of the screen display of the GUI or shopper/buyer interface used in the present invention, wherein the user has selected the option provided after clicking the "Advanced Search" option on the screen of Figure 33. The finest of the company is provided. Note that in this screen, the title in the panel pivot under the labels of the five domains can be seen as "Advanced Agent is turned on". In addition, at the bottom of the screen, the user is provided with a dialog box that can be filled in to perform a search using the semantics of the present invention to distinguish the buyer agent function, but again, it is noted that this screen does not work because These companies, the so-called government-to-business e-business services or platform providers, severely restrict members' access to their Web server's database by including a certified security interface throughout the closed computer network environment. Figure 35 is a simplified diagram of the screen display of the GUI or shopper/buyer interface used in the present invention, in which the company is displayed in response to the "business to business" text icon, which is already in the previous screen (not shown) ) is selected by the online buyer/user. However, note that this is not the case, because the company, the so-called enterprise-to-business e-business service or platform provider, strictly limits the membership to its Web by including a certified security interface throughout the closed computer network environment. Access to the server's database. Figure 36 is a simplified diagram of the screen display of the GUI or the constructor/buyer interface used in the present invention, wherein the user has selected the option provided after clicking the "Advanced Search" option on the 35th screen. Details of the selected company are provided.
1280488 五、發明說明(69) 第37圖為本發明使用之GUI或購物者/買者介面之畫 面顯示的簡化圖,其中被選擇之項目與其描述在回應於使 用者選擇「網域A」標藏下被顯示。 第38圖為本發明使用之(31;1或購物者/買者介面之畫 面顯示的簡化圖,其中在回應於使用者已在第37圖之畫面 上點選「進階搜尋」下被列出網域A中之銷售項目。 第39圖為本發明使用之(^忉或購物者/買者介面之畫 面顯不的簡化圖,其中使用本發明之語意分辨買者代理器 特點被執行之搜尋結果的細節被提供。該購物者/買者回應 於使用者透過在第38圖畫面底部顯示之搜尋參數畫面提 出搜尋請求。 其將進一步被了解,雖然本發明已以網際網路與全球 爹Λ網為準被描述,本發明相等地適用於最近被引進之系 統與下一代之系統。例如,無線應用發展工具J2ME(Java 對微編輯)可被用於納入線上知慧型多語言與網域獨立的 價格比較能力至行動/無線平台,包括3(}或Web電話之型 式、互動式與終極TV、口袋PC、掌上型電子記事薄、以 Web賦能之全合一(allMn-one)掌上型合成器、無線筆記薄 等用於在首頁上以全年365天、每週7天、每天24小時 (24/7/3 65)之基礎一氣呵成地遞送無數產品與多語言加值 之企業-Web服務至行動工作者與網民。 進一步言之,本發明可被用以經由有線與行動/無線平 台遞送各種產品與具有此類促成與功能之多語言加值企業 Web服務及如價格比較、e錢包整合、具有協商能力之代 1280488 五、發明說明(70) 理器間通訊一代理器對代理器(A-to-A)契約協商一真實世 界模擬能力至多重e商務段,包括消費者對企業、消費者 對消費者、及企業對企業拍賣、政府對企業交易等。這些 A對A或A商務活動將以動態方式組合鍵盤、滑鼠與尖點 裝置之一合時地在全球整體市場架構Global Ensemble Marketplace Framework)上被構建及被啟動。1280488 V. INSTRUCTIONS (69) Figure 37 is a simplified diagram of the GUI display of the GUI or shopper/buyer interface used in the present invention, wherein the selected item and its description are in response to the user selecting the "Domain A" flag. Hiding is displayed. Figure 38 is a simplified diagram of the screen display of the 31; 1 or shopper/buyer interface used in the present invention, which is listed in response to the user having clicked "Advanced Search" on the screen of Figure 37. Out of the sales item in the domain A. Figure 39 is a simplified diagram of the screen used by the invention (^忉 or the shopper/buyer interface), wherein the characteristics of the buyer agent are used to distinguish the buyer agent from being executed. The details of the search results are provided. The shopper/buyer responds to the user by requesting a search through the search parameter screen displayed at the bottom of Figure 38. It will be further understood that although the present invention has been Internet and globally The network is described as being accurate, and the present invention is equally applicable to recently introduced systems and next generation systems. For example, the wireless application development tool J2ME (Java for micro-editing) can be used to incorporate online knowledge-based multi-language and network. Domain-independent price comparison capability to mobile/wireless platforms, including 3 (} or web phone type, interactive and ultimate TV, pocket PC, handheld electronic notepad, web-enabled all-in-one (allMn-one) Pocket Synthesizers, wireless notebooks, etc. are used to deliver countless products and multi-language bonuses on the front page for 365 days a year, 7 days a week, 24 hours a day (24/7/3 65). Service to mobile workers and netizens. Further, the present invention can be used to deliver various products via wired and mobile/wireless platforms with multi-language value-added enterprise web services with such facilitation and functionality and such as price comparison, e-wallets Integrated, negotiating generation 1280488 V. Invention Description (70) Inter-processor communication-agent-A-to-A contract negotiation for a real-world simulation capability to multiple e-commerce segments, including consumer-to-business Consumer-to-consumer, business-to-business auctions, government-to-business transactions, etc. These A-to-A or A business activities will dynamically combine one of the keyboard, mouse and cusp devices in a global market structure. Built and launched on the Global Ensemble Marketplace Framework).
在此處已被運用的名詞與術語被用作為描述之詞句且 為非限制的,且其不企圖這些名詞與術語之使用排除被顯 示及被描述之特點或其部分之特點,其被明瞭各種修改可 能在本發明申請專利範圍之領域内。The nouns and terms that have been used herein are used as descriptive words and are non-limiting, and they are not intended to exclude the use of these nouns and terms to exclude features that are displayed and described, or features thereof. Modifications may be made in the field of the scope of the patent application of the present invention.
元件標號 對照 表 元件編號 譯 名 元件編號 譯 10 代理器 110 步驟 12 使用者/買者 120 步驟 14 線上賣主 130 步驟 16 全球資訊網/網際網路 140 步驟 18 學習者代理器 150 步驟 20 構物者代理器 210 步驟 ΊΊ ‘Am 伺服器 220 步驟 24 離線資料庫 230 步驟 26 系統管理員 240 步驟 27 賣主佇列 250 步驟 28 賣主描述佇列 300 步驟 100 流程囷 310 步驟 名 73 1280488 五、發明說明(71 ) 元 件標號對照表 元件編號 譯 名 元件編號 譯 名 312 步驟 320 步驟 330 步驟 340 步驟 350 步驟 358 步驟 360 步驟 370 步驟 380 步驟 390 步驟 1002 步驟 1004 步驟 1006 步驟 1008 步驟 1010 丨 |步驟 1012 步驟 1014 步驟 74Component Labeling Table Component Number Translating Component Number Translator 10 Agent 110 Step 12 User/Buyer 120 Step 14 Online Seller 130 Step 16 World Wide Web/Internet 140 Step 18 Learner Agent 150 Step 20 Constructor Agent Step 210 Am 'Am Server 220 Step 24 Offline Library 230 Step 26 System Administrator 240 Step 27 Vendor Queue 250 Step 28 Vendor Description Queue 300 Step 100 Flow 囷 310 Step Name 73 1280488 V. Description of Invention (71) Component Labeling Table Component Number Translating Component Number Translated Name 312 Step 320 Step 330 Step 340 Step 350 Step 358 Step 360 Step 370 Step 380 Step 390 Step 1002 Step 1004 Step 1006 Step 1008 Step 1010 Step 步骤 Step 1012 Step 1014 Step 74
Claims (1)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US23657400P | 2000-09-29 | 2000-09-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
TWI280488B true TWI280488B (en) | 2007-05-01 |
Family
ID=38742514
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW90124089A TWI280488B (en) | 2000-09-29 | 2001-09-28 | Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWI280488B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9043409B2 (en) | 2009-06-11 | 2015-05-26 | Qualcomm Incorporated | Methods and apparatus for a plug-in model for publishing structured meta-data based discovery |
US10275603B2 (en) | 2009-11-16 | 2019-04-30 | Microsoft Technology Licensing, Llc | Containerless data for trustworthy computing and data services |
US10348700B2 (en) | 2009-12-15 | 2019-07-09 | Microsoft Technology Licensing, Llc | Verifiable trust for data through wrapper composition |
US10348693B2 (en) | 2009-12-15 | 2019-07-09 | Microsoft Technology Licensing, Llc | Trustworthy extensible markup language for trustworthy computing and data services |
CN113011081A (en) * | 2021-02-02 | 2021-06-22 | 电子科技大学 | Unmanned aerial vehicle navigation method based on meta-learning |
-
2001
- 2001-09-28 TW TW90124089A patent/TWI280488B/en not_active IP Right Cessation
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9043409B2 (en) | 2009-06-11 | 2015-05-26 | Qualcomm Incorporated | Methods and apparatus for a plug-in model for publishing structured meta-data based discovery |
US10275603B2 (en) | 2009-11-16 | 2019-04-30 | Microsoft Technology Licensing, Llc | Containerless data for trustworthy computing and data services |
US10348700B2 (en) | 2009-12-15 | 2019-07-09 | Microsoft Technology Licensing, Llc | Verifiable trust for data through wrapper composition |
US10348693B2 (en) | 2009-12-15 | 2019-07-09 | Microsoft Technology Licensing, Llc | Trustworthy extensible markup language for trustworthy computing and data services |
CN113011081A (en) * | 2021-02-02 | 2021-06-22 | 电子科技大学 | Unmanned aerial vehicle navigation method based on meta-learning |
CN113011081B (en) * | 2021-02-02 | 2022-03-22 | 电子科技大学 | Unmanned aerial vehicle navigation method based on meta-learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7536323B2 (en) | Online intelligent multilingual comparison-shop agents for wireless networks | |
US7555448B2 (en) | Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks | |
US11372935B2 (en) | Automatically generating a website specific to an industry | |
Bonzanini | Mastering social media mining with Python | |
US9405736B1 (en) | Method and system for automatically downloading and storing markup language documents into a folder based data structure | |
US7971147B2 (en) | Methods and systems for automatically creating a site menu | |
US20020111934A1 (en) | Question associated information storage and retrieval architecture using internet gidgets | |
US20050165789A1 (en) | Client-centric information extraction system for an information network | |
CN105164710A (en) | Entity bidding | |
WO2002001389A2 (en) | Web-based collaborative data collection system | |
JP2008530639A (en) | Apparatus, method and system for integration, information processing and self-assembled advertising, electronic commerce and online client interaction | |
US20170109442A1 (en) | Customizing a website string content specific to an industry | |
RU2632149C2 (en) | System, method and constant machine-readable medium for validation of web pages | |
TWI280488B (en) | Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks | |
CN102708114B (en) | The method carrying out real-time online search process by interconnective computer network | |
Zhang | Building a Full Stack Classified Service Web Application | |
Darie et al. | Beginning PHP and PostgreSQL E-commerce: From Novice to Professional | |
Silvestre | An integrated library system on the CERN document server | |
West et al. | Create a Product Catalog | |
Chachad | Implementation and Web Mounting of the WebOMiner_S Recommendation System | |
Ng | Using smart card to purchase text information from digital library/Ng Han Teck | |
Seanghor | E-BUSINESS OF PHONE ONLINE SHOP USING B2C CASE STUDY: PHONE SHOP IN CAMBODIA | |
Prokopova et al. | The Analysis, Design and Implementation of Optimized Web Structures | |
NZ525182A (en) | Query processor for classifiable items | |
Haile | Social Media and Search Optimization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM4A | Annulment or lapse of patent due to non-payment of fees |