Nothing Special   »   [go: up one dir, main page]

GB2412770A - Method of communicating data over a network - Google Patents

Method of communicating data over a network Download PDF

Info

Publication number
GB2412770A
GB2412770A GB0512440A GB0512440A GB2412770A GB 2412770 A GB2412770 A GB 2412770A GB 0512440 A GB0512440 A GB 0512440A GB 0512440 A GB0512440 A GB 0512440A GB 2412770 A GB2412770 A GB 2412770A
Authority
GB
United Kingdom
Prior art keywords
data
communication network
cache
information system
remote information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0512440A
Other versions
GB2412770B (en
GB0512440D0 (en
Inventor
Paul Lionel Grabinar
Simon David Wood
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Flyingspark Ltd
Original Assignee
Flyingspark Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Flyingspark Ltd filed Critical Flyingspark Ltd
Priority to GB0512440A priority Critical patent/GB2412770B/en
Priority claimed from GB0212384A external-priority patent/GB2389201B/en
Publication of GB0512440D0 publication Critical patent/GB0512440D0/en
Publication of GB2412770A publication Critical patent/GB2412770A/en
Application granted granted Critical
Publication of GB2412770B publication Critical patent/GB2412770B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/16Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/2866Architectures; Arrangements
    • H04L67/289Intermediate processing functionally located close to the data consumer application, e.g. in same machine, in same home or in same sub-network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/568Storing data temporarily at an intermediate stage, e.g. caching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/16Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
    • H04L69/166IP fragmentation; TCP segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems
    • G06F12/0813Multiuser, multiprocessor or multiprocessing cache systems with a network or matrix configuration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/972Access to data in other repository systems, e.g. legacy data or dynamic Web page generation
    • G06F17/30893
    • G06F17/30899

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Communication Control (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A communication network (200) comprises a local information processing device (235) and a remote information system (240) that operate a transport protocol and both include a processor to perform one or more of the following data processing functions to enable data to be transmitted using the transport protocol: wrap a data block in one packet or, if the data block is larger than a largest data packet the transport protocol supports, in multiple packets. If a data block was passed in more than one packet, the data block is re-assembled from its constituent packets; and duplicated data blocks are detected and deleted. Notably, both the local information processing device (235) and the remote information system (240) estimate a likely transmission time for each data packet based on a recent communication network bit rate, and forwards the estimate from one to the other.

Description

24 1 2770 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 1 -
METHODS AND SYSTEM FOR USING CACHES
Field of the Invention
This invention relates to a mechanism for operating caches that store subsets of data and that are connected to a remote information store by a communication system whose performance (i.e. data rate, latency and error rate) varies with time. The invention is applicable to, but not limited to, a cache for use in a portable computer or similar device that can be connected to a corporate information system via a packet data wireless network.
Background of the Invention
Present day communication systems, both wireless and wire-line, have a requirement to transfer data between communication units. Data, in this context, includes many forms of communication such as speech, multimedia, signa]linq communication, etc. Such data communication needs to be effectively and efficiently provided for, in order to optimise use of limited communication resources.
For data to be transferred across data communication networks, a communication unit addressing protocol is required. In this regard, the communication units are generally allocated addresses that can be read by a communication bridge, gateway and/or router, in order to determine how to transfer the data to the addressed unit.
19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 2 - The interconnection between networks is generally known as internetworking (or interned).
Networks are often divided into sub-networks, with protocols being set up to define a set of rules that allow the orderly exchange of information. Two common protocols used to transfer data in communication systems, are: Transmission Control Protocol (TCP) and Internet Protocol (IP). IP corresponds to data transfer in the network layer of the well-known OSI model whereas TOP corresponds to data transfer in the transport layer of the OSI model. Their operation is transparent to the physical and data link layers and can thus be used on any of the standard cabling networks such as Ethernet, FDDI or token ring.
In the field of this invention it is known that an
excessive amount of data traffic routed over a core portion of a data network may lead to a data overload in the network. This may lead to an undesirable, excessive consumption of the communication resource, for example bandwidth in a wireless network. To avoid such overload problems, many caching techniques have been introduced to manage the data traffic on a time basis.
An example of a cache, which may be considered as a local storage element in a distributed communication or computing system, includes network file systems, where data retrieved from a file storage system (e.g. a disk) can be stored in a cache on the computer that is requesting the data.
19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 3 A further example is a database system, where data records retrieved from the database server are stored in a client's cache. Furthermore, web servers are known to cache identified web pages in network servers closer to a typical requesting party. Web clients (browsers) are also known to cache previously retrieved web pages in a store local to the browser. As the information age has continued apace, the benefits and wide-use of caches has substantially increased.
Referring now to FIG. 1, a known data communication system 100 is illustrated that employs the use of a cache to store data locally. A local information processing device 135, such as a personal digital assistant or wireless access protocol (WAP) enabled cellular phone, includes a communication portion 115, operably coupled to a cache 110. The device 135 also includes application software 105 that cooperates with the cache 110 to enable the device 135 to run application software using data stored in, or accessible via, the cache 110. A primary use of the cache 110 is effectively as a localized data store for the local information processing device 135.
The communication portion 115 is used to connect the cache to remote information system 140, accessible over a communication network 155. In this regard, as well as for many other applications, caches are often used to reduce the amount of data that is transferred over the communication network 155. The amount of data transfer 19 0 01- 5 ( FS -1 D4) v f inal. doc 20/06/2005 - 4 is reduced if the data can be stored in the cache 110 on a local information-processing device 135. This arrangement avoids the need for data to be transferred/uploaded to the local information-processing device 135, from a data store 130 in a remote information system 140, over the communication network 155 each time a software application is run.
Furthermore, in general, caches provide a consequent benefit to system performance, as if the data needed by the local information-processing device 135 is already in the cache 110 then the cached data can be processed immediately. This provides a significant time saving when compared to transferring large amounts of data over the communication network 155. In addition, caches improve the communication network's reliability, because if the communication network fails then: (i) The data in the cache 110 is still available, allowing processing in the local information-processing device 135 to continue to the extent possible given the extent of the data in the cache 110; and (ii) The application in the local information- processing device 105 can create new items or modify existing items in the cache, which can then be used to update the remote information system 140.
In the current state of the art, caches store low-level data elements and leave it to the application 105 to re assemble the stored data into a meaningful entity. For example, customer records in a database are stored as 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 5 - rows in the customer table, but addresses are often stored as rows in the address table. In this example, the customer table row has a field that indicates which row in the associated address table is the address for that particular customer. The cache 110 would likely be configured to have the same structure as the database, replicating the table rows that relate to the objects that it holds. The inventors of the present invention have recognized inefficiencies and limitations in organising objects within caches in this manner, as will be detailed later.
Furthermore, the application 105 generally contains considerable business logic (matching that in the data store) to be able to interpret the data elements in the cache 110 and to operate on them correctly.
In addition, the cache 110 must make sure that updates of objects maintain "transactional integrity". This means that if an object comprises rows from three tables, and an operation by the application 105 changes elements in all three rows, then the corresponding three rows in the data server must all be updated before any other application is allowed to access that object. If this transactional integrity is not maintained then objects will contain incorrect data, because some fields will have been updated and others will have not.
Clearly, as the application 105 must therefore contain all of the business logic needed to interpret and maintain consistency of the low level data in the cache, 19 0 01- 5 ( FS -1 D4) v f inal. doc 20/06/2005 - 6 it is complex to build. Furthermore, there is complexity and data integrity implications associated with updating the business logic on the data store. This consumes memory and processing power on the local information processing terminal 135. For portable (battery-operated) computing terminals, this last point is particularly disadvantageous as minimising power and resource consumption is of paramount importance.
Wireless communication systems, where a communication link is dependent upon the surrounding (free space) propagation conditions, are known to be occasionally unreliable. Hence, the need to maintain transactional integrity over unreliable communication networks means that specially designed, complicated protocols are needed. Such protocols need to hold the state of any transaction that is in progress should the local information processing device become disconnected from the communication network for any length of time (for example if a wireless device moves into an area with no radio coverage). Once re-connected the transactions that were in progress must then be completed.
Thus, there exists a need to provide an improved organization of data objects within a cache, wherein the aforementioned problems are substantially alleviated.
In the context of cache usage, it is important to be able to retrieve lists of items as quickly and efficiently as possible. For example, a user may perform a search, for example, to find all customers whose name begins with 19001-5 (FS-1 Do) vfinal.doc 20/06/2005 - 7 - "T". It is important to the user that the retrieval of this data list is performed quickly. Current cache-based applications 105 retrieve these lists by sending a request to a server 125 on a remote information system 140 for the search to be carried out. The server 125 then returns the entire list. Clearly, such lists are also returned for other purposes. These lists often require large amounts of data, the processing of which consumes a lot of power.
Frequently the cache 110 already contains some of the objects that will be returned with the entire retrieved list, following a request. For example, where a list includes all the sales leads for a customer, and this list has previously been downloaded. When asking for all the leads again, the request must be made on the data store 130 as there may have been new leads added since the last find. However, the inventors of the present invention have recognized that even if one or two new leads have been added, most will still exist in the cache and will still be valid. Nevertheless, by requesting all leads from the data store 130, the current list retrieval techniques ignore any data items from the list that already exist in the cache. This inefficiency means that there are unnecessary data transfers over the communication network 155, which further reduce performance and increase costs.
Thus, there also exists a need to provide an improved mechanism for retrieving data objects from within a 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 8 - cache, wherein the aforementioned problems are substantially alleviated.
One benefit of some cache designs is that data items can be created and updated within the cache 110, and only later are new or modified items 'flushed' to the remote information store 140. Examples include network file systems and database systems. Notably, the caches used in web browsers do not have this capability. In order to maintain transactional integrity, once the cache begins to update the remote information system with the changed items, the system does not allow any of those items to be updated in the cache 110 by the using application 105 until all remote updates have been completed.
Locking the cache 110, while updates to the data store are in progress, is acceptable if the update is quick and reliable, for example over a high speed LAN or direct serial connection to a PC. However, if the update is slow and unreliable, as is typically the case over a wireless communication network, then this method can block use of the application 105 for a considerable time.
This restricts the utility of the application 105 to the device user.
Thus, there also exists a need to provide an improved mechanism for updating data objects to a remote information store, wherein the aforementioned problems associated with locking the cache are substantially alleviated.
19 0 01- 5 ( FS -1 D4) v f inal. doc 20/06/2005 _ 9 As indicated, a communications protocol must be run over the communication network to define the information to be retrieved as well as to recover from any network problems. Current cache management communications protocols 145 are designed for wireline networks.
Examples of such protocols include: (i) Server Message Block (SMB), which is the Windows file management protocol, runs over TCP/IP; (ii) Network File System (NFS), which is the UNIX file management protocol, runs over UDP/IP; (iii) Hyper Text Transfer Protocol (HTTP), which is the web page retrieval protocol, runs over TCP/IP; and (iv) Distributed Component Object Model (DCOM), which is a remote method invocation protocol, runs over TCP/IP.
However, if the communication network suffers degradation in service or a total failure, which is a common occurrence in the types of wireless networks that this invention serves, the request for data can often not be satisfied. Current cache management communications protocols (SMB, NFS, HTTP, DCOM eta) do not store the request nor do they re-transmit the request when the network is re-connected. Instead, the application must carry out an extensive recovery procedure, which often results in a further attempt to obtain the data after a suitable pre-defined interval. Unfortunately, this means that the application writer needs to be aware of how the underlying communications system operates and accordingly 19 0 01-5 ( FS -1 D! ) vf inal. doc 20/06/2005 - 10 write the program code needed to effect and manage the re-tries.
If different applications use the same cache or cache structure, then each one must implement the re-try mechanisms. This means that the applications themselves have additional complexity and hence required extra development and test time.
A need therefore also exists for an improved cache management communications protocol wherein the abovementioned disadvantages associated with prior art arrangements may be alleviated.
Statement of Invention
In accordance with a first aspect of the present invention, there is provided a communication network, as claimed in claim 1.
In accordance with a second aspect of the present invention, there is provided a request server, as claimed in Claim 7.
In accordance with a third aspect of the present invention, there is provided a local information processing device, as claimed in claim 8.
In accordance with an fourth aspect of the present invention, there is provided a remote information system, as claimed in Claim 9.
19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 11 Further aspects of the present invention are as claimed in the dependent claims.
The preferred embodiments of the present invention address the following aspects of cache operation and data communication networks. In particular, the inventive concepts described herein find particular applicability in wireless communication systems for connecting portable computing devices having a cache to a remote data source.
The inventive concepts address problems, identified by the inventors, in at least the following areas: (i) Organisation of objects within the cache; (ii) Retrieving lists of data items; (iii) Updating the cache when previous updates are being flushed; and (iv) Cache management communications protocol to update the cache.
Brief Description of the Drawings
FIG. 1 illustrates a known data communication system, whereby data is passed between a local information processing device and a remote information system.
Exemplary embodiments of the present invention will now be described, with reference to the accompanying drawings, in which: FIG. 2 illustrates a functional block diagram of a data communication system, whereby data is passed between a 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 local information processing device and a remote information system, in accordance with a preferred embodiment of the present invention; FIG. 3 illustrates a preferred message sequence chart for retrieving a data list from a cache, in accordance with the preferred embodiment of the present invention; FIG. 4 illustrates a functional block diagram of a cache management communication protocol, in accordance with the preferred embodiment of the present invention; FIG. 5 illustrates the meanings of the terms "message", "block" and "packet" as used within this invention; FIG. 6 shows a flowchart illustrating a method of determining an acceptable re-transmit time, in accordance with the preferred embodiment of the present invention; and ElG. 7 shows a flowchart illustrating a method of determining an acceptable re-transmit time, in accordance with an alternative embodiment of the present invention.
Description of Preferred Embodiments
Referring next to FIG. 2, a functional block diagram 200 of a data communication system is illustrated, in accordance with a preferred embodiment of the present invention. Data is passed between a local information processing device 235 and a remote information system 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 13 240, via a communication network 155. The preferred embodiment of the present invention is described with reference to a wireless communication network, for example one where personal digital assistants (PDAs) communicate over a GPRS wireless network to an information database. However, it is within the contemplation of the invention that the inventive concepts described herein can be applied to any data communication network - wireless or wireline.
Notably, in the preferred embodiment of the present invention, a single data object is used to represent a complete business object rather than part of a business object with external references to the other components of the object. In the context of the present invention, the term 'business object' is used to encompass data objects from say, a complete list of Space Shuttle components to a list of customer details. An example of a business object could be an XML fragment defining a simple customer business object as follows <customer> <name> "Company name" </name> <mailing address liner> "mailing address line 1" </mailing address liner> <mailing address line2> "mailing address line 2" </mailing address line2> 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 14 - <delivery address liner> "delivery address line 1" </delivery address liner> <delivery address line2> "delivery address line 2" </delivery address line2> </customer> Where the tagged items are referred to as "properties".
In accordance with the preferred embodiment of the present invention, the request server 225 has been adapted to contain a logic function 228 that creates each business object from the various tables of data stored within the associated data store 130 in the remote information system 240. This logic function 228 is specific to the data store 130 and/or the structure of the data it contains.
Business objects are then passed between the cache 210 and the request server 225.
If a new object is created, or the properties of an existing object are changed, the cache 210 passes the changed properties back to the request server 225.
Advantageously, in accordance with the preferred embodiment of the present invention, the logic function 228 performs the required updates on the appropriate table rows in the database within the data store 130.
Thus, the application 105 and cache 210 are shielded from 19 0 01- 5 ( FS -1 D4) v f inal. doc 20/06/2005 needing to know anything about how the data is stored on the data store 130. Advantageously, this makes the task of the application writer much easier. Furthermore, by enabling the cache 210 to pass the changed properties back to the logic function 228 in the request server 225, it is easier to connect the local information processing device 235 to a different type of data store 130, simply by re-writing the logic function 228 in the request server 225.
It is also within the contemplation of the invention that an extra property can be added to an object for the application to use. A corresponding extra property of the object needs to be added to the logic function 228 in the request server 225. Advantageously, the provision of the logic function 228 ensures that no changes are needed in the cache 210, because the cache 210 is just a general purpose store that saves lists of objects, objects and object properties, without knowing how the three types of entity interrelate other than by data contained within the entities themselves. For example, an object list entity contains a list of the unique identity numbers of the business objects in the list; an object contains a list of the unique identity numbers of the properties in the object.
When carrying out updates the cache 210 preferably sends all the changed properties to the remote request server 225 in one update message. The update message is either received successfully or it is not received at all.
Hence, there is no possibility that only some of the 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 16 updates will be received. In this manner, transactional integrity of the data is guaranteed.
Notably, in accordance with the preferred embodiment, S updates made by the application 105 to existing objects in the cache 210 do not update the cached object, but are attached to the object as an update request. When the local information-processing device 235 is operably coupled to the remote information system 240, for example, when the wireless device 235 is within coverage range of the wireless information system 240, update requests are sent to the request server 225. The request server 225 then updates the data store 130.
Once the request server 225 receives a confirmation from the data store 130 that the update request has been successful, the request server 225 signals to the cache 210 that the update request was successful. Only then does the cache 210 update its copy of the object. Hence, advantageously, the cache 210 can be synchronized to the data store 130 on the remote information system 240. In this manner, the application 105 is able to modify objects in the cache 210 that have already been changed, during the time that change is being implemented in the data store 130.
Until this success confirmation is received, the update request is preferably marked as "in progress".
If a further update is made by the application 105 to a property that has an "in progress" update request, it is 19001-S (FS-1 D4) vfinal.doc 20/06/2005 - 17 - envisaged that the second update is attached to the first update request as a 'child' update request. In accordance with the preferred embodiment of the present invention, the cache 210 has been adapted to include logic that ensures that this child update request commences only after the 'parent' update request has completed successfully. If a further update is made by the application 105, whilst the current child update request has not yet been effected, the further update is preferably merged with the current child update request.
When the application 105 requests a data object from the cache 210, the cache 210 carries out the following steps: (i) Reads the properties from the cached object; (ii) Applies any updates from an attached update request to the properties; (iii) Applies any further updates from an attached child update request to the properties; and (iv) Returns the updated object to the application.
More generally, it is envisaged that the aforementioned processing or memory elements may be implemented in the respective communication units in any suitable manner.
For example, new apparatus may be added to a conventional communication unit, or alternatively existing parts of a conventional communication unit may be adapted, for example by reprogramming one or more processors therein.
As such, the required implementation (or adaptation of existing unit(s)) may be implemented in the form of processor-implementable instructions stored on a storage 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 medium, such as a floppy disk, hard disk, PROM, RAM or any combination of these or other storage multimedia.
In the case of other network infrastructures, wireless or wircline, implementation of the processing operations may be performed at any appropriate node such as any other appropriate type of server, database, gateway, etc. Alternatively, it is envisaged that the aforementioned operations may be carried out by various components distributed at different locations or entities within any suitable network or system.
It is further envisaged that the applications that use caches in the context hereinbefore described, will often be ones in which a human user requests information from the data store (or serving application) 130. The application 105 will then preferably display the results of the data retrieval process on a screen of the local information processing device 235, to be viewed by the user.
Referring now to FIG. 3, a message sequence chart 300 for retrieving a data list from a remote information system 240 via a cache 210 is illustrated, in accordance with the preferred embodiment of the present invention. The message sequence chart 300 illustrates messages between the software application 105, the cache 210 and the remote information system 240.
The application 105 makes a request 305 for a data object list from the cache 210. If the communication network is 19001-5 (FS-1 D] ) vfinal.doc 20/06/2005 - 19 operational, the cache 210 makes a corresponding request 310 to the remote system 240 for the IDs of all the objects that are contained within the list. Once the cache 210 receives the ID list 315 it forwards the ID list 320 to the application 105.
For example, if the list contains three IDs then the application 105 then makes three individual requests 325, 330 and 335 to the cache 210 for each object whose ID was returned in the list. In this example, let us assume that valid copies of the first and second objects, relating to request 325 and 330 first and second objects are already in the cache 210.
In accordance with the preferred embodiment of the present invention, the cache is configured to recognize that the first and second requested data objects are stored within the cache 210. Advantageously, the first and second requested data objects are then returned directly 340 and 345 to the application 105 from the cache 210. However, the cache 210 recognizes that no valid copy of the third object is contained in the cache 210. Hence, the cache 210 requests a copy 350 of the third object from the remote information system 240.
Once the cache receives the copy 355 of the third object, the cache 210 passes the third object 360 to the application 105.
In this manner, retrieval of a desired list of objects is performed efficiently and effectively, by utilizing existing data object stored in the cache 210.
19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 20 Furthermore, utilization of the communication network is kept to a minimum, where it is limited to the initial list request 310, 315, and retrieval of a data object 350, 355 that was not already stored in the cache 210.
Although FIG. 3 illustrates the first and second objects being sent to the application 105 from the cache 210 after the request 350 has been sent to the information system 240, a skilled artisan would appreciate that such transmission of data objects may be sent immediately, whilst a resource is being accessed on the communication network to request the third data object.
Referring now to FIG. 4, a functional block diagram of a cache management communication protocol 400 is illustrated, in accordance with the preferred embodiment of the present invention. The cache management communications protocol 400 preferably includes a variable block size and a variable re-transmit time. The cache management communications protocol 400 is also preferably symmetric between the two communicating entities.
In the following explanation, communications from the cache 210 to the request server 225 are described, for clarity purposes only. Communications from the request server 225 to the cache 210 are, substantially identical in form, except that all data flows in the opposite direction to that described here.
19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 21 The cache management communications protocol 400 passes blocks of data that include one or moremessages between the cache 210 and the request server 225. The cache management communications protocol 400 operates on a transport protocol 150 that runs within the communication network 155. The transport protocol 150 carries the data blocks 420 in one or more packets 430, depending on the relative sizes of the block and the packets, as shown in greater detail with respect to FIG. 5.
To use the cache management communications protocol 400 described in this invention, the transport protocol 150 and communication network components 155 preferably has one or more of the following capabilities: (i) Ihe ability to wrap the data block in one packet or, if the data block is larger than the largest packet the transport protocol 150 allows, in multiple packets; (ii) Route the packets 430 from the source to the destination; (iii) If the data block 420 was passed in more than one packet 532, 534, re-assemble the data block from its constituent packets; and (iv) Detect and delete data blocks duplicated in the communication network 155.
In the preferred embodiment, the transport protocol 150 has the following further characteristics, singly or preferably in combination, in order to optimise use of the cache management communications protocol 400: 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 22 (i) Packets lost from multi-packet data blocks are detected and re- transmitted without involvement of the cache 210 or request server 225; (ii) The communication network components in the local information- processing device 235 and the remote information system 240 estimate the likely transmission time for each packet and the current communication network bit rate. The local information-processing device 235 and the remote information system 240 then pass this information to their respective users, the cache 210 or request server 225; (iii) The communication network components in the local informationprocessing device 235 and the remote information system 240 are configured to inform their respective users, the cache 210 or request server 225, when transmission of a message commences.
The only protocol known to possess all these features is the Reliable Wireless Protocol developed by flyingSPARKT.
However, it is envisaged that the inventive concepts related to the cache management communication protocol, as described herein, may be applied to any transport protocol, such as the Wireless Transport Protocol (WTP), which is part of the Wireless Access Protocol (MAP) protocol suite.
For improved efficiency on the communication network 155, it is preferred that the transport protocol 150 does not run in an 'acknowledged' mode. In this regard, the acknowledgment of a request message from the cache 210 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 23 equates to the response message received from the request server 225. The approach to using a response message as an acknowledgement removes the need for any additional acknowledgements to be sent by the transport protocol 150.
In this regard, as the cache 210 receives no explicit acknowledgement that the data block that was sent has been received at the request server 225, the cache 210 needs to track what blocks have been sent. If no response message is received within a defined time for any of the request messages within the block, then that block is identified as lost. The block is then preferably re-transmitted by the cache 210. In order for the cache 210 not to re-transmit blocks unnecessarily, but to re-transmit them as soon as it is clear that the response has not been received by the request server 225, the cache 210 needs to estimate the time within which a response would be typically expected. In a typical data communication environment, such as a packet data wireless network, this time will depend on a number of the following: (i) The available bandwidth of the network, (ii) The loading on the channel, (iii) The size of the block of data transmitted, and (iv) The amount of processing in the remote information system 240 to retrieve the data requested.
Two preferred examples, for determining an acceptable re- transmit time within the cache management communications 19001-5 (FS-1 Do) vfinal.doc 20/06/2005 - 24 protocol 400 are described with reference to FIG. 6 and FIG. 7. The descriptions detail information flow from the cache 210 to the request server 225. However, the same descriptions apply equally well to information flow from the request server 235 to the cache 230, albeit that data flows in the reverse direction and the actions of the cache 230 and the request server 235 are swapped.
Referring now to FIG. 6, a flowchart 600 indicating one example for determining an acceptable re-transmit time is illustrated. First, a minimum re-transmit time ('l'min), a maximum re-transmit time (TmaX), a time-out reduction factor a and a time-out increase factor A, are set in step 605, where a and are both less than unity. When the system starts, the time-out (Tout) is set to the mid- point between TmaX and Tmin, as shown in step 610.
When notified by the local communication unit 115 that transmission of a data block has started or, in the absence of this capability, when the data block is passed to the local communications unit 115 in step 615, a timer for substantially each message (or a subset of messages) that is included in the block is commenced in the Cache 230, as in step 620. If a response for a message is received before the timer expires in step 625, the actual time, Tact' that the request- response message pair took is calculated. In addition, Tout is reduced to: 19 0 01- 5 ( FS -1 D4) v f inal. doc 20/06/2005 - 25 (l-a).Tout + a.Tact [1] down to a minimum of Tmin, as shown in step 630.
If the timer expires in step 635, the message is re-sent in step 640. Tout is then increased to: (lJ-)-Tout [2] up to a maximum of TmaX, as shown in step 645.
In this manner, the re-transmit timer is adaptively adjusted, using a and based on the prevailing communication network conditions.
Although not indicated in the above example, it is envisaged that a retransmit timer margin may be incorporated, whereby an increase or decrease in Tout would not be performed. In this manner, the method has an improved chance of reaching a steady state condition.
It is envisaged that Tmin, TmaX' a and may be selected based on theoretical studies of the cache management communications protocol 400. Alternatively, or in addition, they may be selected based on trial and error when running each particular implementation.
19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 26 - Referring now to FIG. 7, a flowchart 700 indicating a second example for determining an acceptable re-transmit time, is illustrated. This example assumes that the local communication unit 235 and remote information system 240 can provide continually-updated estimates of the transmission time in both directions (Tup and Tdown) for maximum-sized packets. Furthermore, it is assumed that the application 105 is able to provide an estimate, Tproc, of the processing time of each request type at the data store (or serving application) 130.
First, a lower bound (LB) and an upper bound (UB) are set to the acceptable levels of the proportion of packets that are re-transmitted, where LB and UB are greater than zero and less than unity. In addition, an averaging message count M is initialized, where M is an integer greater than zero, as shown in step 705. When the system starts, a safety margin p is set to a suitable value, say 0.5, as in step 710. A successful message counter (SMC) and a failed message counter (FMC) are then set to zero, as shown in step 712.
When notified by the local communication unit 235 that transmission of a block has started or, in the absence of this capability, when the block is passed to the local communication unit 235 in step 715, a timer for substantially each message (or a subset of messages) included in the data block are commenced as shown in step 720. The timers are set separately for each message, to: 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 27 (l+p)(Tup + Tdown + Tproc) [3] Where the Tproc is specific to that message type, as shown in step 722.
If a response is received in step 725 before the timer expires, the SMC value is incremented, as shown in step 730. If the timer expires in step 735, the message is re-sent in step 740 and FMC incremented, as shown in step 745.
The sum of FMC + SMC is then calculated, and if the sum is determined to be greater than 'M' in step 750, then the success ratio (a) is set, in step 755, to: = SMC/(FMC -l SMC) In this regard, either FMC or SMC is incremented each time a message is sent, so FMC + SMC is the total number of messages sent (including retries) since they were zeroed. Thus, is the proportion of messages that are sent successfully.
If > UB in step 760, then p is decreased to p.UB/q, as shown in step 765. However, if < LB in step 770, then p is increased to p.LB/q, as shown in step 775. The process then returns to step 712 whereby FMC and SMC are reset.
19001-5 (FS-1 D] ) vfinal.doc 20/06/2005 - 28 It is envisaged that the values for LB, UB, and M may also be selected based on theoretical studies of the cache management communications protocol 400.
Alternatively, or in addition, they may be selected based on trial and error when running each particular implementation.
As shown in FIG. 4, the fundamental unit of data passed between the application 105 and the request server 225 is a message. These messages may contain requests for data (an object or a list of objects), replies to requests (responses containing one or more or a list of objects), updates of data that already exist, etc. It is envisaged that each message may be a different size. Frequently a group of messages will be sent out together, concatenated into a single block of data, as shown in FIG. 5. In this regard, the cache 210 groups messages together into the optimum size of data block.
When the communications reliability is high, large blocks should be sent in order to: (i) Minimise the overhead needed, (ii) To provide a more rapid transmission, and (iii) Provide a more efficient use of the communication network 155.
When reliability is low, and blocks need to be re- transmitted, blocks should be small to reduce the probability that an individual block is corrupted. The block size should also be kept small to reduce the amount 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 29 of data that needs to be resent in the event of a corrupted block.
Current cache management communications protocols do not have these features. The preferred embodiment of the present invention provides a mechanism to address these deficiencies. A preferred algorithm for achieving this adaptive block size is described below.
First, let us set an upper bound (UB) and a lower bound (LB) to the number of messages that may be contained in a block. When the system starts, the Block Size (BS) is set to the mid point between UB and LB.
1S If a block is sent successfully, then the BS is increased by a Success Increment (SI) up to a maximum of UB. In this context, 'sent successfully' means one of the following: (i) A response was received for at least one of the messages in the block (this is relevant when using an unacknowledged transport protocol 150); or (ii) There was no notification from the communication network 155 that the block was not received successfully (this is relevant when using an acknowledged transport protocol 150).
If a block is re-transmitted, then the BS is reduced by a Failure Decrement (ED) value, down to a minimum of LB.
Although not indicated in the above example, it is envisaged that a data block size margin may be 19 0 01- 5 ( FS -1 D4) v f inal. doc 20/06/2005 30 incorporated, whereby an increase or decrease in BS would not be performed. In this manner, the method has an improved chance of reaching a steady state condition.
When presented with a set of messages from the application 105, the cache 230 groups a BS number of messages into each block. It is envisaged that UB, LB, SI and/or ED may be selected based on theoretical studies of the cache management communications protocol and/or by trial and error in each particular implementation.
An optional enhancement to the above block size selection algorithm is to set UB as being dependent upon the available communication network bit rate, as notified by the local communication unit 115. When bit rates are high, UB may be seL at a higher level to take advantage of the higher available bandwidth. When bit rates are low, UB should be reduced to a value that ensures that the round trip time for a request/response is sufficiently short so that the user will still experience an acceptable response time from the system.
If a large number of data requests and response messages are sent in a block, the remote information system 240 may appear to the user to be relatively unresponsive. In order to improve the responsiveness of the remote information system 240, for large collections of data request messages, the preferred embodiment of the present invention limits the first transmitted block to a small number of messages. This number may be a fixed value, defined for each implementation, or it may be specified 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 by the application. As such, the number may be adjusted depending on, inter-alia: (i) The type of the request, (ii) Any preferences set by the user, and (ill) The task that the user is currently performing.
Advantageously, this technique ensures that the first few requested objects are retrieved quickly. Thus, a small part of the list appears quickly on the screen, providing the user with good feedback and a speedy indication that the system is working and is responsive.
It will be understood that the data communication system described above provides at least the following advantages: With regard to organization of data objects within a cache: (i) All business logic is removed from the application and cache, thereby making them easier to implement and increase the portability between the cache 210 and different data stores 130; (ii) The cache and application are isolated from any changes to the structure of the data in the data store 130, thereby making it easier to upgrade the data store; and (iii) Transactional integrity is improved.
With regard to the retrieval of lists of items: 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 32 (i) The amount of data sent over the communication network is kept to a minimum; and (ii) A rapid response to the user is provided, by displaying items available from the cache immediately on a communication unit's screen.
With regard to updating the cache when previous updates are being flushed: (i) The application 105 is allowed to keep on using and modifying data in the cache 210, even during extended data store update periods, by use of an attached update request; and (ii) Transactional integrity is improved.
With regard to the provision of an improved cache management communications protocol: (i) The need for the application to recover from network problems is removed, thereby making the application easier to write; (ii) The communication demand is varied to match the communication network's capabilities; thereby maximising data transfer performance; (iii) Ensures that the response from the request server occurs in a reasonable time to allow the application to provide good user feedback; and (iv) Ensures a quick turn round of the initial data items in a list, to allow the application to provide good user feedback.
Whilst the specific and preferred implementations of the embodiments of the present invention are described above, 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 33 it is clear that one skilled in the art could readily apply variations and modifications of such inventive concepts.
Thus, an improved mechanism for organizing data objects within a cache has been described wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.
Furthermore, an improved mechanism for retrieving data objects from within a cache has been described, wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.
Moreover, an improved mechanism for updating data objects to a remote information store has been described, wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.
In addition, an improved cache management communications protocol has been described, wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.

Claims (9)

19001-5 (FS-1 D! ) vfinal.doc ZO/06/2005 - 34 Claims 1. A communication
network (200) comprising a local information processing device (235) and a remote information system (240) that operate a transport protocol such that said local information processing device (235) and said remote information system (240) includes a processor to perform one or more of the following data processing functions to enable data to be transmitted using said transport protocol: wrap a data block in one packet or, if said data block is larger than a largest data packet the transport protocol supports, in multiple packets; if a data block was passed in more than one packet, re-assemble said data block from its constituent packets; and detect and delete data blocks duplicated in a communication network; the communication network characterized by: both of said local information processing device (235) and said remote information system (240) estimate a likely transmission time for each data packet based on a recent communication network bit rate, and forwards said estimate from one to the other.
2. The communication network (200) according to Claim 1, further characterized in that the local communication unit 235 and remote information system 240 provide continually-updated estimates of the transmission time in both directions (Tup, Tdown) 19001-5 (FS-1 D4) vfinal.doc 20J06/2005 - 35
3. The communication network (200) according to Claim 2, further characterized in that the local communication unit 235 and remote information system 240 provide continually-updated estimates of the transmission time in both directions (Tup, Tdown) for maximum-sized packets.
4. 'I'he communication network (200) according to any preceding Claim, wherein said local communication unit (235) comprises a cache (210) operably coupled to an application (105) further characterized in that the application 105 is arranged to provide an estimate (Tproc) of the processing time of each request type at the remote information system (240).
5. The communication network (200) according to Claim 1, further characterized by said local information processing device (235) and/or said remote information system (240) being configured to inform said respective user (210, 235), when transmission of a data packet commences.
6. The communication network (200) according to Claim 1, further characterized by said processor of said local information processing device (235) and/or said remote information system (240) being configured to: detect at least one data packet lost from at least one multi-packet data block; and 19001-5 (FS-1 D4) vfinal.doc 20/06/2005 - 36 re-transmit said at least one lost data packet without involvement of said user (210, 235).
7. A request server (225) located in the remote information system (240) and adapted to operate in the data communication network according to any of the preceding Claims.
a. A local information processing device (235) adapted to operate in the data communication network according to any of preceding Claims 1 to 6.
9. A remote information system (240) adapted to operate in the data communication network according to any of preceding Claims 1 to 6.
GB0512440A 2002-05-29 2002-05-29 Method and system for using caches Expired - Fee Related GB2412770B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB0512440A GB2412770B (en) 2002-05-29 2002-05-29 Method and system for using caches

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0212384A GB2389201B (en) 2002-05-29 2002-05-29 Methods and system for using caches
GB0512440A GB2412770B (en) 2002-05-29 2002-05-29 Method and system for using caches

Publications (3)

Publication Number Publication Date
GB0512440D0 GB0512440D0 (en) 2005-07-27
GB2412770A true GB2412770A (en) 2005-10-05
GB2412770B GB2412770B (en) 2006-11-15

Family

ID=34855770

Family Applications (1)

Application Number Title Priority Date Filing Date
GB0512440A Expired - Fee Related GB2412770B (en) 2002-05-29 2002-05-29 Method and system for using caches

Country Status (1)

Country Link
GB (1) GB2412770B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5987493A (en) * 1997-12-05 1999-11-16 Insoft Inc. Method and apparatus determining the load on a server in a network
WO1999059278A1 (en) * 1998-05-14 1999-11-18 Telefonaktiebolaget Lm Ericsson (Publ) Data transmission over a communications link with variable transmission rates
WO2000052831A2 (en) * 1999-03-05 2000-09-08 Tantivy Communications, Inc. Forward error correction on multiplexed cdma channels enabling high performance coding
WO2001003382A1 (en) * 1999-07-02 2001-01-11 Telefonaktiebolaget Lm Ericsson (Publ) Packet loss tolerant reshaping method
EP1372304A2 (en) * 2002-06-14 2003-12-17 Matsushita Electric Industrial Co., Ltd. Method, transmitter and receiver for adapting the encoding rate to a varying transport rate

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5987493A (en) * 1997-12-05 1999-11-16 Insoft Inc. Method and apparatus determining the load on a server in a network
WO1999059278A1 (en) * 1998-05-14 1999-11-18 Telefonaktiebolaget Lm Ericsson (Publ) Data transmission over a communications link with variable transmission rates
WO2000052831A2 (en) * 1999-03-05 2000-09-08 Tantivy Communications, Inc. Forward error correction on multiplexed cdma channels enabling high performance coding
WO2001003382A1 (en) * 1999-07-02 2001-01-11 Telefonaktiebolaget Lm Ericsson (Publ) Packet loss tolerant reshaping method
EP1372304A2 (en) * 2002-06-14 2003-12-17 Matsushita Electric Industrial Co., Ltd. Method, transmitter and receiver for adapting the encoding rate to a varying transport rate

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
H. Gilbert, "Introduction to TCP/IP", 2 February 1995, retrieved from www.yale.edu/pclt/COMM/TCPIP.HTM on 25/1/05 *

Also Published As

Publication number Publication date
GB2412770B (en) 2006-11-15
GB0512440D0 (en) 2005-07-27

Similar Documents

Publication Publication Date Title
US6775298B1 (en) Data transfer mechanism for handheld devices over a wireless communication link
AU2007313956B2 (en) Offline execution of Web based applications
US7363291B1 (en) Methods and apparatus for increasing efficiency of electronic document delivery to users
KR100289520B1 (en) Client/server communication system
EP1530859B1 (en) Heuristics-based routing of a query message in peer to peer networks
US6182111B1 (en) Method and system for managing distributed data
EP2266043B1 (en) Cache optimzation
KR100295003B1 (en) Time coherent caching system
US6795434B1 (en) Replicated server discovery in client-proxy servers
US7581028B2 (en) Method for efficient image distribution and management
US20210124719A1 (en) Local networked storage linked to remote networked storage system
US6983318B2 (en) Cache management method and system for storing dynamic contents
JP2004535631A (en) System and method for reducing the time to send information from a communication network to a user
US20070226229A1 (en) Method and system for class-based management of dynamic content in a networked environment
US20120192080A1 (en) Tailoring content based on available bandwidth
US20060167969A1 (en) Data caching based on data contents
US6868453B1 (en) Internet home page data acquisition method
US20120117253A1 (en) Methods for reducing latency in network connections and systems thereof
US20060155819A1 (en) Methods and system for using caches
US6615275B2 (en) System for increasing data access in network having compression device for determining and controlling data/object compression based on predetermined maximum percentage of CPU processing capacity
Wang et al. Grid-oriented storage: A single-image, cross-domain, high-bandwidth architecture
WO2002089000A1 (en) A system for caching data during peer-to-peer data transfer
WO2010031297A1 (en) Method of wireless application protocol (wap) gateway pull service and system thereof
CN101902449A (en) Emulation for persistent HTTP connection between network devices
GB2412770A (en) Method of communicating data over a network

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee

Effective date: 20080529