Nothing Special   »   [go: up one dir, main page]

GB2412769A - System for managing cache updates - Google Patents

System for managing cache updates Download PDF

Info

Publication number
GB2412769A
GB2412769A GB0512432A GB0512432A GB2412769A GB 2412769 A GB2412769 A GB 2412769A GB 0512432 A GB0512432 A GB 0512432A GB 0512432 A GB0512432 A GB 0512432A GB 2412769 A GB2412769 A GB 2412769A
Authority
GB
United Kingdom
Prior art keywords
cache
data
request
objects
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0512432A
Other versions
GB0512432D0 (en
GB2412769B (en
Inventor
Paul Lionel Grabinar
Simon David Wood
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Flyingspark Ltd
Original Assignee
Flyingspark Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Flyingspark Ltd filed Critical Flyingspark Ltd
Priority to GB0512432A priority Critical patent/GB2412769B/en
Priority claimed from GB0212384A external-priority patent/GB2389201B/en
Publication of GB0512432D0 publication Critical patent/GB0512432D0/en
Publication of GB2412769A publication Critical patent/GB2412769A/en
Application granted granted Critical
Publication of GB2412769B publication Critical patent/GB2412769B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24552Database cache management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/172Caching, prefetching or hoarding of files
    • G06F17/30893

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A cache (210) for use in a local information processing device (235), said cache characterised in that said cache stores at least one business object comprising a plurality of data objects as one retrievable entity) and for operably coupling to an application (105) and arranged to store at least one new data object or at least one modification to an existing data object from said application (105), wherein said cache (210) is characterised by storing a request from said application (105) to update one or more data objects, as an update request attached to said one or more data objects. In this manner, transactional integrity of data that is to be modified between an application and a cache is improved.

Description

19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 1 -
METHODS AND SYSTEM FOR USING CACHES
Field of the Invention
This invention relates to a mechanism for operating caches that store subsets of data and that are connected to a remote information store by a communication system whose performance (i.e. data rate, latency and error rate) varies with time. The invention is applicable to, but not limited to, a cache for use in a portable computer or similar device that can be connected to a corporate information system via a packet data wireless network.
Background of the Invention
Present day communication systems, both wireless and wire-line, have a requirement to transfer data between communication units. Data, in this context, includes many forms of communication such as speech, multimedia, signalling communication, etc. Such data communication needs to be effectively and efficiently provided for, in order to optimise use of limited communication resources.
For data to be transferred across data communication networks, a communication unit addressing protocol is required. In this regard, the communication units are generally allocated addresses that can be read by a communication bridge, gateway and/or router, in order to determine how to transfer the data to the addressed unit.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 2 - The interconnection between networks is generally known as internetworking (or interned).
Networks are often divided into sub-networks, with protocols being set up to define a set of rules that allow the orderly exchange of information. Two common protocols used to transfer data in communication systems, are: Transmission Control Protocol (TCP) and Internet Protocol (IP). IP corresponds to data transfer in the network layer of the well-known OSI model whereas TCP corresponds to data transfer in the transport layer of the OSI model. Their operation is transparent to the physical and data link layers and can thus be used on any of the standard cabling networks such as Ethernet, FDDI or token ring.
In the field of this invention it is known that an
excessive amount of data traffic routed over a core portion of a data network may lead to a data overload in the network. This may lead to an undesirable, excessive consumption of the communication resource, for example bandwidth in a wireless network. To avoid such overload problems, many caching techniques have been introduced to manage the data traffic on a time basis.
An example of a cache, which may be considered as a local storage element in a distributed communication or computing system, includes network file systems, where data retrieved from a file storage system (e.g. a disk) can be stored in a cache on the computer that is requesting the data.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 3 - A further example is a database system, where data records retrieved from the database server are stored in a client's cache. Furthermore, web servers are known to S cache identified web pages in network servers closer to a typical requesting party. Web clients (browsers) are also known to cache previously retrieved web pages in a store local to the browser. As the information age has continued apace, the benefits and wide-use of caches has substantially increased.
Referring now to FIG. 1, a known data communication system 100 is illustrated that employs the use of a cache to store data locally. A local information processing device 135, such as a personal digital assistant or wireless access protocol (WAP) enabled cellular phone, includes a communication portion 115, operably coupled to a cache 110. The device 135 also includes application software 105 that cooperates with the cache 110 to enable the device 135 to run application software using data stored in, or accessible via, the cache 110. A primary use of the cache 110 is effectively as a localized data store for the local information processing device 135.
The communication portion 115 is used to connect the cache to remote information system 140, accessible over a communication network 155. In this regard, as well as for many other applications, caches are often used to reduce the amount of data that is transferred over the communication network 155. The amount of data transfer 19001-4 (FS-1 D3) vfinal.doc 20J06/2005 - 4 - is reduced if the data can be stored in the cache 110 on a local information-processing device 135. This arrangement avoids the need for data to be transferred/uploaded to the local information-processing device 135, from a data store 130 in a remote information system 140, over the communication network 155 each time a software application is run.
Furthermore, in general, caches provide a consequent benefit to system performance, as if the data needed by the local information-processing device 135 is already in the cache 110 then the cached data can be processed immediately. This provides a significant time saving when compared to transferring large amounts of data over the communication network 155. In addition, caches improve the communication network's reliability, because if the communication network fails then: (i) The data in the cache 110 is still available, allowing processing in the local information-processing device 135 to continue to the extent possible given the extent of the data in the cache 110; and (ii) The application in the local information- processing device 105 can create new items or modify existing items in the cache, which can then be used to update the remote information system 140.
In the current state of the art, caches store low-level data elements and leave it to the application 105 to re assemble the stored data into a meaningful entity. For example, customer records in a database are stored as 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 5 - rows in the customer table, but addresses are often stored as rows in the address table. In this example, the customer table row has a field that indicates which row in the associated address table is the address for that particular customer. The cache 110 would likely be configured to have the same structure as the database, replicating the table rows that relate to the objects that it holds. The inventors of the present invention have recognised inefficiencies and limitations in organizing objects within caches in this manner, as will be detailed later.
Furthermore, the application 105 generally contains considerable business logic (matching that in the data store) to be able to interpret the data elements in the cache 110 and to operate on them correctly.
In addition, the cache 110 must make sure that updates of objects maintain "transactional integrity". This means that if an object comprises rows from three tables, and an operation by the application 105 changes elements in all three rows, then the corresponding three rows in the data server must all be updated before any other application is allowed to access that object. If this transactional integrity is not maintained then objects will contain incorrect data, because some fields will have been updated and others will have not.
Clearly, as the application 105 must therefore contain all of the business logic needed to interpret and maintain consistency of the low level data in the cache, 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 6 it is complex to build. Furthermore, there is complexity and data integrity implications associated with updating the business logic on the data store. This consumes memory and processing power on the local information processing terminal 135. For portable (battery-operated) computing terminals, this last point is particularly disadvantageous as minimising power and resource consumption is of paramount importance.
Wireless communication systems, where a communication link is dependent upon the surrounding (free space) propagation conditions, are known to be occasionally unreliable. Hence, the need to maintain transactional integrity over unreliable communication networks means that specially designed, complicated protocols are needed. Such protocols need to hold the state of any transaction that is in progress should the local information processing device become disconnected from the communication network for any length of time (for example if a wireless device moves into an area with no radio coverage). Once re-connected the transactions that were in progress must then be completed.
Thus, there exists a need to provide an improved organization of data objects within a cache, wherein the aforementioned problems are substantially alleviated.
In the context of cache usage, it is important to be able to retrieve lists of items as quickly and efficiently as possible. For example, a user may perform a search, for example, to find all customers whose name begins with 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 7 - "T". It is important to the user that the retrieval of this data list is performed quickly. Current cache-based applications 105 retrieve these lists by sending a request to a server 125 on a remote information system 140 for the search to be carried out. The server 125 then returns the entire list. Clearly, such lists are also returned for other purposes. These lists often require large amounts of data, the processing of which consumes a lot of power.
Frequently the cache 110 already contains some of the objects that will be returned with the entire retrieved list, following a request. For example, where a list includes all the sales leads for a customer, and this list has previously been downloaded. When asking for all the leads again, the request must be made on the data store 130 as there may have been new leads added since the last find. However, the inventors of the present invention have recognised that even if one or two new leads have been added, most will still exist in the cache and will still be valid. Nevertheless, by requesting all leads from the data store 130, the current list retrieval techniques ignore any data items from the list that already exist in the cache. This inefficiency means that there are unnecessary data transfers over the communication network 155, which further reduce performance and increase costs.
Thus, there also exists a need to provide an improved mechanism for retrieving data objects from within a 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 8 - cache, wherein the aforementioned problems are substantially alleviated.
One benefit of some cache designs is that data items can be created and updated within the cache 110, and only later are new or modified items 'flushed' to the remote information store 140. Examples include network file systems and database systems. Notably, the caches used in web browsers do not have this capability. In order to malotain transactional integrity, once the cache begins to update the remote information system with the changed items, the system does not allow any of those items to be updated in the cache 110 by the using application 105 until all remote updates have been completed.
Locking the cache 110, while updates to the data store are in progress, is acceptable if the update is quick and reliable, for example over a high speed LAN or direct serial connection to a PC. However, if the update is slow and unreliable, as is typically the case over a wireless communication network, then this method can block use of the application 105 for a considerable time.
This restricts the utility of the application 105 to the device user.
Thus, there also exists a need to provide an improved mechanism for updating data objects to a remote information store, wherein the aforementioned problems associated with locking the cache are substantially alleviated.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 9 _ As indicated, a communications protocol must be run over the communication network to define the information to be retrieved as well as to recover from any network problems. Current cache management communications protocols 145 are designed for wireline networks.
Examples of such protocols include: (i) Server Message Block (SMB), which is the Windows file management protocol, runs over TCP/IP; (ii) Network File System (NFS), which is the UNIX file management protocol, runs over UDP/IP; (iii) Hyper Text Transfer Protocol (HTTP), which is the web page retrieval protocol, runs over TCP/IP; and (iv) Distributed Component Object Model (DCOM), which is a remote method invocation protocol, runs over TCP/IP.
However, if the communication network suffers degradation in service or a total failure, which is a common occurrence in the types of wireless networks that this invention serves, the request for data can often not be satisfied. Current cache management communications protocols (SMB, NFS, HTTP, DOOM eta) do not store the request nor do they re-transmit the request when the network is re-connected. Instead, the application must carry out an extensive recovery procedure, which often results in a further attempt to obtain the data after a suitable pre-defined interval. Unfortunately, this means that the application writer needs to be aware of how the underlying communications system operates and accordingly 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 10 write the program code needed to effect and manage the re-tries.
If different applications use the same cache or cache structure, then each one must implement the re-try mechanisms. This means that the applications themselves have additional complexity and hence required extra development and test time.
A need therefore also exists for an improved cache management communications protocol wherein the abovementioned disadvantages associated with prior art arrangements may be alleviated.
Statement of Invention
In accordance with a first aspect of the present invention, there is provided a cache, as claimed in Claim 1.
Further aspects of the present invention are as claimed in the dependent claims.
The preferred embodiments of the present invention address the following aspects of cache operation and data communication networks. In particular, the inventive concepts described herein find particular applicability in wireless communication systems for connecting portable computing devices having a cache to a remote data source.
The inventive concepts address problems, identified by 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 11 the inventors, in at least updating the cache when previous updates are being flushed.
Brief Description of the Drawings
FIG. l illustrates a known data communication system, whereby data is passed between a local information processing device and a remote information system.
Exemplary embodiments of the present invention will now be described, with reference to the accompanying drawings, in which: FIG. 2 illustrates a functional block diagram of a data communication system, whereby data is passed between a local information processing device and a remote information system, in accordance with a preferred embodiment of the present invention; FIG. 3 illustrates a preferred message sequence chart for retrieving a data list from a cache, in accordance with the preferred embodiment of the present invention; FIG. 4 illustrates a functional block diagram of a cache management communication protocol, in accordance with the preferred embodiment of the present invention; FIG. 5 illustrates the meanings of the terms "message", "block" and "packet" as used within this invention; 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 12 FIG. 6 shows a flowchart illustrating a method of determining an acceptable re-transmit time, in accordance with the preferred embodiment of the present invention; and FIG. 7 shows a flowchart illustrating a method of determining an acceptable re-transmit time, in accordance with an alternative embodiment of the present invention.
Description of Preferred Embodiments
Referring next to FIG. 2, a functional block diagram 200 of a data communication system is illustrated, in accordance with a preferred embodiment of the present invention. Data is passed between a local information processing device 235 and a remote information system 240, via a communication network 155. The preferred embodiment of the present invention is described with reference to a wireless communication network, for example one where personal digital assistants (PDAs) communicate over a GPRS wireless network to an information database. However, it is within the contemplation of the invention that the inventive concepts described herein can be applied to any data communication network - wireless or wireline.
Notably, in the preferred embodiment of the present invention, a single data object is used to represent a complete business object rather than part of a business object with external references to the other components of the object. In the context of the present invention, 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 13 the term 'business object' is used to encompass data objects from say, a complete list of Space Shuttle components to a list of customer details. An example of a business object could be an XML fragment defining a simple customer business object as follows <customer> <name> "Company name" </name> <mailing address liner> "mailing address line l" </mailing address liner> <mailing address line2> "mailing address line 2" </mailing address line2> <delivery address liner> "delivery address line l" </delivery address liner> <delivery address lined> "delivery address line 2" </delivery address line2> </customer> Where the tagged items are referred to as "properties".
In accordance with the preferred embodiment of the present invention, the request server 225 has been adapted to contain a logic function 228 that creates each business object from the various tables of data stored 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 14 within the associated data store 130 in the remote information system 240. This logic function 228 is specific to the data store 130 and/or the structure of the data it contains.
Business objects are then passed between the cache 210 and the request server 225.
If a new object is created, or the properties of an existing object are changed, the cache 210 passes the changed properties back to the request server 225.
Advantageously, in accordance with the preferred embodiment of the present invention, the logic function 228 performs the required updates on the appropriate table rows in the database within the data store 130.
Thus, the application 105 and cache 210 are shielded from needing to know anything about how the data is stored on the data store 130. Advantageously, this makes the task of the application writer much easier. Furthermore, by enabling the cache 210 to pass the changed properties back to the logic function 228 in the request server 225, it is easier to connect the local information processing device 235 to a different type of data store 130, simply by re-writing the logic function 228 in the request server 225.
It is also within the contemplation of the invention that an extra property can be added to an object for the application to use. A corresponding extra property of the object needs to be added to the logic function 228 in the request server 225. Advantageously, the provision of 19 0 01- 4 ( FS -1 D3) v f inal. doc 20/06/2005 - 15 the logic function 228 ensures that no changes are needed in the cache 210, because the cache 210 is just a general purpose store that saves lists of objects, objects and object properties, without knowing how the three types of entity interrelate other than by data contained within the entities themselves. For example, an object list entity contains a list of the unique identity numbers of the business objects in the list; an object contains a list of the unique identity numbers of the properties in the object.
When carrying out updates the cache 210 preferably sends all the changed properties to the remote request server 225 in one update message. The update message is either received successfully or it is not received at all.
Hence, there is no possibility that only some of the updates will be received. In this manner, transactional integrity of the data is guaranteed.
Notably, in accordance with the preferred embodiment, updates made by the application 105 to existing objects in the cache 210 do not update the cached object, but are attached to the object as an update request. When the local information-processing device 235 is operably coupled to the remote information system 240, for example, when the wireless device 235 is within coverage range of the wireless information system 240, update requests are sent to the request server 225. The request server 225 then updates the data store 130.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 16 - Once the request server 225 receives a confirmation from the data store 130 that the update request has been successful, the request server 225 signals to the cache 210 that the update request was successful. Only then does the cache 210 update its copy of the object. Hence, advantageously, the cache 210 can be synchronized to the data store 130 on the remote information system 240. In this manner, the application 105 is able to modify objects in the cache 210 that have already been changed, during the time that change is being implemented in the data store 130.
Until this success confirmation is received, the update request is preferably marked as "in progress".
If a further update is made by the application 105 to a property that has an "in progress" update request, it is envisaged that the second update is attached to the first update request as a 'child' update request. In accordance with the preferred embodiment of the present invention, the cache 210 has been adapted to include logic that ensures that this child update request commences only after the 'parent' update request has completed successfully. If a further update is made by the application 105, whilst the current child update request has not yet been effected, the further update is preferably merged with the current child update request.
When the application 105 requests a data object from the cache 210, the cache 210 carries out the following steps: (i) Reads the properties from the cached object; 19001-4 (FS-1 D3) vfinal.doc 20J 06/2005 (ii) Applies any updates from an attached update request to the properties; (iii) Applies any further updates from an attached child update request to the properties; and (iv) Returns the updated object to the application.
More generally, it is envisaged that the aforementioned processing or memory elements may be implemented in the respective communication units in any suitable manner.
For example, new apparatus may be added to a conventional communication unit, or alternatively existing parts of a conventional communication unit may be adapted, for example by reprogramming one or more processors therein.
As such, the required implementation (or adaptation of existing unit(s)) may be implemented in the form of processor-implementable instructions stored on a storage medium, such as a floppy disk, hard disk, PROM, RAM or any combination of these or other storage multimedia.
In the case of other network infrastructures, wireless or wireline, implementation of the processing operations may be performed at any appropriate node such as any other appropriate type of server, database, gateway, etc. Alternatively, it is envisaged that the aforementioned operations may be carried out by various components distributed at different locations or entities within any suitable network or system.
It is further envisaged that the applications that use caches in the context hereinbefore described, will often 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 18 - be ones in which a human user requests information from the data store (or serving application) 130. The application 105 will then preferably display the results of the data retrieval process on a screen of the local information processing device 235, to be viewed by the user.
Referring now to FIG. 3, a message sequence chart 300 for retrieving a data list from a remote information system 240 via a cache 210 is illustrated, in accordance with the preferred embodiment of the present invention. The message sequence chart 300 illustrates messages between the software application 105, the cache 210 and the remote information system 240.
The application 105 makes a request 305 for a data object list from the cache 210. If the communication network is operational, the cache 210 makes a corresponding request 310 to the remote system 240 for the IDs of all the objects that are contained within the list. Once the cache 210 receives the ID list 315 it forwards the lD list 320 to the application 105.
For example, if the list contains three IDs then the application 105 then makes three individual requests 325, 330 and 335 to the cache 210 for each object whose ID was returned in the list. In this example, let us assume that valid copies of the first and second objects, relating to request 325 and 330 first and second objects are already in the cache 210.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 19 In accordance with the preferred embodiment of the present invention, the cache is configured to recognise that the first and second requested data objects are stored within the cache 210. Advantageously, the first and second requested data objects are then returned directly 340 and 345 to the application 105 from the cache 210. However, the cache 210 recognises that no valid copy of the third object is contained in the cache 210. Hence, the cache 210 requests a copy 350 of the third object from the remote information system 240.
Once the cache receives the copy 355 of the third object, the cache 210 passes the third object 360 to the application 105.
In this manner, retrieval of a desired list of objects is performed efficiently and effectively, by utilising existing data object stored in the cache 210.
Furthermore, utilisation of the communication network is kept to a minimum, where it is limited to the initial list request 310, 315, and retrieval of a data object 350, 355 that was not already stored in the cache 210.
Although FIG. 3 illustrates the first and second objects being sent to the application 105 from the cache 210 after the request 350 has been sent to the information system 240, a skilled artisan would appreciate that such transmission of data objects may be sent immediately, whilst a resource is being accessed on the communication network to request the third data object.
19001-4 (FS-1 D3) vfinal.doc ZO/06/2005 - 20 Referring now to FIG. 4, a functional block diagram of a cache management communication protocol 400 is illustrated, in accordance with the preferred embodiment of the present invention. The cache management communications protocol 400 preferably includes a variable block size and a variable re-transmit time. The cache management communications protocol 400 is also preferably symmetric between the two communicating entities.
In the following explanation, communications from the cache 210 to the request server 225 are described, for clarity purposes only. Communications from the request server 225 to the cache 210 are, substantially identical in form, except that all data flows in the opposite direction to that described here.
The cache management communications protocol 400 passes blocks of data that include one or more messages between the cache 210 and the request server 225. The cache management communications protocol 400 operates on a transport protocol 150 that runs within the communication network 155. The transport protocol 150 carries the data blocks 420 in one or more packets 430, depending on the relative sizes of the block and the packets, as shown in greater detail with respect to FIG. 5.
To use the cache management communications protocol 400 described in this invention, the transport protocol 150 and communication network components 155 preferably has one or more of the following capabilities: 19001-4 (FS-l D3) vfinal.doc 20/06/2005 - 21 (1) The ability to wrap the datablock in one packet or, if the data block is larger than the largest packet the transport protocol 150 allows, in multiple packets; (ii) Route the packets 430 from the source to the destination; (iii) If the data block 420 was passed in more than one packet 532, 534, re-assemble the data block from its constituent packets; and (iv) Detect and delete data blocks duplicated in the communication network 155.
In the preferred embodiment, the transport protocol 150 has the following further characteristics, singly or preferably in combination, in order to optimise use of the cache management communications protocol 400: (i) Packets lost from multi-packet data blocks are detected and retransmitted without involvement of the cache 210 or request server 225; (ii) The communication network components in the local informationprocessing device 235 and the remote information system 240 estimate the likely transmission time for each packet and the current communication network bit rate. The local information-processing device 235 and the remote information system 240 then pass this information to their respective users, the cache 210 or request server 225; (iii) The communication network components in the local information- processing device 235 and the remote information system 240 are configured to inform their 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 22 respective users, the cache 210 or request server 225, when transmission of a message commences.
The only protocol known to possess all these features is S the Reliable Wireless protocol developed by flyingSPARK_.
However, it is envisaged that the inventive concepts related to the cache management communication protocol, as described herein, may be applied to any transport protocol, such as the Wireless Transport Protocol (WTP), which is part of the Wireless Access Protocol (WAP) protocol suite.
For improved efficiency on the communication network 155, it is preferred that the transport protocol 150 does not IS run in an 'acknowledged' mode. In this regard, the acknowledgment of a request message from the cache 210 equates to the response message received from the request server 225. The approach to using a response message as an acknowledgement removes the need for any additional acknowledgements to be sent by the transport protocol 150.
In this regard, as the cache 210 receives no explicit acknowledgement Lhat the data block that was sent has been received at the request server 225, the cache 210 needs to track what blocks have been sent. If no response message is received within a defined time for any of the request messages within the block, then that block is identified as lost. The block is then preferably re-transmitted by the cache 210. In order for 19001-4 (FS-l D3) vfinal.doc 20/06/2005 - 23 the cache 210 not to re- transmit blocks unnecessarily, but to re-transmit them as soon as it is clear that the response has not been received by the request server 225, the cache 210 needs to estimate the time within which a response would be typically expected. In a typical data communication environment, such as a packet data wireless network, this time will depend on a number of the following: (i) The available bandwidth of the network, (ii) The loading on the channel, (iii) The size of the block of data transmitted, and (iv) The amount of processing in the remote information system 240 to retrieve the data requested.
Two preferred examples, for determining an acceptable re transmit time within the cache management communications protocol 400 are described with reference to FIG. 6 and FIG. 7. The descriptions detail information flow from the cache 210 to the request server 225. However, the same descriptions apply equally well to information flow from the request server 235 to the cache 230, albeit that data flows in the reverse direction and the actions of the cache 230 and the request server 235 are swapped.
Referring now to FIG. 6, a flowchart 600 indicating one example for determining an acceptable re-transmit time is illustrated. First, a minimum re-transmit time (Tmin), a maximum re-transmit time (TmaX), a time-out reduction factor a and a time-out increase factor A, are set in 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 24 step 605, where a and are both less than unity. When the system starts, the time-out (Tout) is set to the mid- point between TmaX and Tmin, as shown in step 610.
When notified by the local communication unit 115 that transmission of a data block has started or, in the absence of this capability, when the data block is passed to the local communications unit 115 in step 615, a timer for substantially each message (or a subset of messages) that is included in the block is commenced in the Cache 230, as in step 620. If a response for a message is received before the timer expires in step 625, the actual time, Tact' that the request- response message pair took is calculated. In addition, Tout is reduced to: (l-a).TOut + a.Tact [1] down to a minimum of Tmin, as shown In step 630.
If the timer expires in step 635, the message is re-sent in step 640. Tout is then increased to: (l+).Tout [2] up to a maximum of TmaX' as shown in step 645.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 25 In this manner, the retransmit timer is adaptively adjusted, using a and based on the prevailing communication network conditions.
Although not indicated in the above example, it is envisaged that a retransmit timer margin may be incorporated, whereby an increase or decrease in Tout would not be performed. In this manner, the method has an improved chance of reaching a steady state condition.
It is envisaged that Tmin, TmaX, a and may be selected based on theoretical studies of the cache management communications protocol 400. Alternatively, or in addition, they may be selected based on trial and error when running each particular implementation.
Referring now to FIG. 7, a flowchart 700 indicating a second example for determining an acceptable re-transmit time, is illustrated. This example assumes that the local communication unit 235 and remote information system 240 can provide continually-updated estimates of the transmission time in both directions (Tup and Tdown) for maximum-sized packets. Furthermore, it is assumed that the application 105 is able to provide an estimate, Tproc' of the processing time of each request type at the data store (or serving application) 130.
First, a lower bound (LB) and an upper bound (UB) are set to the acceptable levels of the proportion of packets 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 26 that are re-transmitted, where LB and UB are greater than zero and less than unity. In addition, an averaging message count M is initialized, where M is an integer greater than zero, as shown in step 705. When the system starts, a safety margin p is set to a suitable value, say 0.5, as in step 710. A successful message counter (SMC) and a failed message counter (FMC) are then set to zero, as shown in step 712.
When notified by the local communication unit 235 that transmission of a block has started or, in the absence of this capability, when the block is passed to the local communication unit 235 in step 715, a timer for substantially each message (or a subset of messages) included in the data block are commenced as shown in step 720. The timers are set separately for each message, to: (1+p)(TUp + Tdown + Tproc) [3] Where the Tproc is specific to that message type, as shown in step 722.
If a response is received in step 725 before the timer expires, the SMC value is incremented, as shown in step 730. If the timer expires in step 735, the message is re-sent in step 740 and FMC incremented, as shown in step 745.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 27 The sum of FMC + SMC is then calculated, and if the sum is determined to be greater than 'M' in step 750, then the success ratio (a) is set, in step 755, to: = SMC/(FMC + SMC) [4] In this regard, either FMC or SMC is incremented each time a message is sent, so FMC + SMC is the total number of messages sent (including retries) since they were zeroed. Thus, is the proportion of messages that are sent successfully.
If > UB in step 760, then p is decreased to p.UB/q, as shown in step 765. However, if c LB in step 770, then p is increased to p.LB/, as shown in step 775. The process then returns to step 712 whereby FMC and SMC are reset.
It is envisaged that the values for LB, UB, and M may also be selected based on theoretical studies of the cache management communications protocol 400.
Alternatively, or in addition, they may be selected based on trial and error when running each particular implementation.
As shown in FIG. 4, the fundamental unit of data passed between the application 105 and the request server 225 is a message. These messages may contain requests for data (an object or a list of objects), replies to requests (responses containing one or more or a list of objects), 19001-4 (FS-1 D3) viinal.doc 20/06/2005 - 28 updates of data that already exist, etc. It is envisaged that each message may be a different size. Frequently a group of messages will be sent out together, concatenated into a single block of data, as shown in FIG. 5. In this regard, the cache 210 groups messages together into the optimum size of data block.
When the communications reliability is high, large blocks should be sent in order to: (i) Minimise the overhead needed, (ii) To provide a more rapid transmission, and (iii) Provide a more efficient use of the communication network 155.
When reliability is low, and blocks need to be re- transmitted, blocks should be small to reduce the probability that an individual block is corrupted. The block size should also be kept small to reduce the amount of data that needs to be re-sent in the event of a corrupted block.
Current cache management communications protocols do not have these features. The preferred embodiment of the present invention provides a mechanism to address these deficiencies. A preferred algorithm for achieving this adaptive block size is described below.
First, let us set an upper bound (UB) and a lower bound (LB) to the number of messages that may be contained in a block. When the system starts, the Block Size (BS) is set to the mid point between UB and LB.
19001-4 (FS-1 D3) vfinal.doc 20J06J2005 - 29 - If a block is sent successfully, then the BS is increased by a Success Increment (SI) up to a maximum of UB. In this context, 'sent successfully' means one of the following: (i) A response was received for at least one of the messages in the block (this is relevant when using an unacknowledged transport protocol 150); or (ii) There was no notification from the communication network 155 that the block was not received successfully (this is relevant when using an acknowledged transport protocol 150).
If a block is re-transmitted, then the BS is reduced by a Failure Decrement (ED) value, down to a minimum of LB.
Although not indicated in the above example, it is envisaged that a data block size margin may be incorporated, whereby an increase or decrease in BS would not be performed. In this manner, the method has an improved chance of reaching a steady state condition.
When presented with a set of messages from the application 105, the cache 230 groups a BS number of messages into each block. It is envisaged that UB, LB, SI and/or FD may be selected based on theoretical studies of the cache management communications protocol and/or by trial and error in each particular implementation.
An optional enhancement to the above block size selection algorithm is to set UB as being dependent upon the 19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 30 available communication network bit rate, as notified by the local communication unit 115. When bit rates are high, UB may be set at a higher level to take advantage of the higher available bandwidth. When bit rates are low, UB should be reduced to a value that ensures the the round trip time for a request/response is sufficiently short so that the user will still experience an acceptable response time from the system.
If a large number of data requests and response messages are sent in a block, the remote information system 240 may appear to the user to be relatively unresponsive. In order to improve the responsiveness of the remote information system 240, for large collections of data request messages, the preferred embodiment of the present invention limits the first transmitted block to a small number of messages. This number may be a fixed value, defined for each implementation, or it may be specified by the application. As such, the number may be adjusted depending on, interalia: (i) The type of the request, (ii) Any preferences set by the user, and (iii) The task that the user is currently performing.
Advantageously, this technique ensures that the first few requested objects are retrieved quickly. Thus, a small part of the list appears quickly on the screen, providing the user with good feedback and a speedy indication that the system is working and is responsive.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 31 It will be understood that the data communication system described above provides at least the following advantages: With regard to organization of data objects within a cache: (i) All business logic is removed from the application and cache, thereby making them easier to implement and increase the portability between the cache 210 and different data stores 130; (ii) The cache and application are isolated from any changes to the structure of the data in the data store 130, thereby making it easier to upgrade the data store; and (iii) Transactional integrity is improved.
With regard to the retrieval of lists of items: (i) The amount of data sent over the communication network is kept to a minimum; and (ii) A rapid response to the user is provided, by displaying items available from the cache immediately on a communication unit's screen.
With regard to updating the cache when previous updates are being flushed: (i) The application 105 is allowed to keep on using and modifying data in the cache 210, even during extended data store update periods, by use of an attached update request; and (ii) Transactional integrity is improved.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 32 With regard to the provision of an improved cache management communications protocol: (i) The need for the application to recover from network problems is removed, thereby making the application easier to write; (ii) The communication demand is varied to match the communication network's capabilities; thereby maximizing data transfer performance; (iii) Ensures that the response from the request lO server occurs in a reasonable time to allow the application to provide good user feedback; and (iv) Ensures a quick turn round of the initial data items in a list, to allow the application to provide good user feedback.
Whilst the specific and preferred implementations of the embodiments of the present invention are described above, it is clear that one skilled in the art could readily apply variations and modifications of such inventive concepts.
Thus, an improved mechanism for organizing data objects within a cache has been described wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.
Furthermore, an improved mechanism for retrieving data objects from within a cache has been described, wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.
19001-4 (FS-1 D3) vfinal.doc 20/06/2005 - 33 Moreover, an improved mechanism for updating data objects to a remote information store has been described, wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.
In addition, an improved cache management communications protocol has been described, wherein the abovementioned disadvantages associated with prior art arrangements have been substantially alleviated.

Claims (7)

19001-4 (FS-1 D3) vfinal.doc - 34 Claims 1. A cache (210) for use in a
local information processing device (235) and for operably coupling to an application (105) and arranged to store at least one new data object or at least one modification to an existing data object from said application (105), wherein said cache (210) is characterized by storing a request from said application (105) to update one or more data lo objects, as an update request attached to said one or more data objects.
2. The cache (210) according to Claim 1, wherein said cache stores at least one business object comprising a plurality of data objects as one retrievable entity.
3. The cache (210) according to Claim 1, wherein said at least one business object includes one or more of the following: one or more lists of objects, objects and/or object properties.
4. The cache (210) according to Claim 3, wherein said cache is further characterized by said at least one business object comprising the following three types of entity: one or more lists of objects, objects, and object properties, where the three types of entity are interrelated only by data contained within the entities themselves.
5. The cache (210) according to Claim l or Claim 4, wherein said cache is further characterized by storing a 19001-4 (FS-1 D3) vfiral.doc 20/06/2005 - 35 new request to update a data object, which at that time is being used to update the data store (130), as a child request of an original update request.
6. The cache (210) accordlog to Claim 5, wherein said cache is further characterized by a merging function to merge at least one additional new update request on a data object containing the original update request, provided said at least one original update request is not at that time being used to update data on said data store (130).
7. The cache (210) according to any of the preceding Claims, wherein said cache is further characterized by: means for reading the properties from the cached object; means for applying, operably coupled to said means for reading, any updates from an attached update request to the properties and any further updates from an attached child update request to the properties; and means for transmitting the updated object to the application.
GB0512432A 2002-05-29 2002-05-29 Methods and systems for using caches Expired - Fee Related GB2412769B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB0512432A GB2412769B (en) 2002-05-29 2002-05-29 Methods and systems for using caches

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0212384A GB2389201B (en) 2002-05-29 2002-05-29 Methods and system for using caches
GB0512432A GB2412769B (en) 2002-05-29 2002-05-29 Methods and systems for using caches

Publications (3)

Publication Number Publication Date
GB0512432D0 GB0512432D0 (en) 2005-07-27
GB2412769A true GB2412769A (en) 2005-10-05
GB2412769B GB2412769B (en) 2006-09-13

Family

ID=34855763

Family Applications (1)

Application Number Title Priority Date Filing Date
GB0512432A Expired - Fee Related GB2412769B (en) 2002-05-29 2002-05-29 Methods and systems for using caches

Country Status (1)

Country Link
GB (1) GB2412769B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102841914A (en) * 2011-05-19 2012-12-26 巴比禄股份有限公司 File managing apparatus and file managing method

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6026413A (en) * 1997-08-01 2000-02-15 International Business Machines Corporation Determining how changes to underlying data affect cached objects
US6029175A (en) * 1995-10-26 2000-02-22 Teknowledge Corporation Automatic retrieval of changed files by a network software agent
US6119151A (en) * 1994-03-07 2000-09-12 International Business Machines Corp. System and method for efficient cache management in a distributed file system
US6185608B1 (en) * 1998-06-12 2001-02-06 International Business Machines Corporation Caching dynamic web pages
WO2001043399A1 (en) * 1999-12-10 2001-06-14 Sun Microsystems, Inc. Maintaining cache consistency for dynamic web content
WO2001088762A2 (en) * 2000-05-16 2001-11-22 Open Market, Inc. Distributed dynamic web page caching system
WO2002025496A2 (en) * 2000-09-22 2002-03-28 Clearway Acquisition, Inc. Serving dynamic web-pages

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6119151A (en) * 1994-03-07 2000-09-12 International Business Machines Corp. System and method for efficient cache management in a distributed file system
US6029175A (en) * 1995-10-26 2000-02-22 Teknowledge Corporation Automatic retrieval of changed files by a network software agent
US6026413A (en) * 1997-08-01 2000-02-15 International Business Machines Corporation Determining how changes to underlying data affect cached objects
US6185608B1 (en) * 1998-06-12 2001-02-06 International Business Machines Corporation Caching dynamic web pages
WO2001043399A1 (en) * 1999-12-10 2001-06-14 Sun Microsystems, Inc. Maintaining cache consistency for dynamic web content
WO2001088762A2 (en) * 2000-05-16 2001-11-22 Open Market, Inc. Distributed dynamic web page caching system
WO2002025496A2 (en) * 2000-09-22 2002-03-28 Clearway Acquisition, Inc. Serving dynamic web-pages

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102841914A (en) * 2011-05-19 2012-12-26 巴比禄股份有限公司 File managing apparatus and file managing method

Also Published As

Publication number Publication date
GB0512432D0 (en) 2005-07-27
GB2412769B (en) 2006-09-13

Similar Documents

Publication Publication Date Title
US6775298B1 (en) Data transfer mechanism for handheld devices over a wireless communication link
JP3953109B2 (en) Client / server communication system
KR100295003B1 (en) Time coherent caching system
EP1530859B1 (en) Heuristics-based routing of a query message in peer to peer networks
AU2007313956B2 (en) Offline execution of Web based applications
US6170013B1 (en) Method and apparatus for controlling access to network information sources
EP1659755B1 (en) Method and apparatus for pre-packetised caching for network servers
JP2004535631A (en) System and method for reducing the time to send information from a communication network to a user
US20050193083A1 (en) Method for efficient image distribution and management
JP2010244571A (en) Transaction accelerator for client-server communication system
EP1046256A1 (en) Enhanced domain name service
US7349902B1 (en) Content consistency in a data access network system
US20120117253A1 (en) Methods for reducing latency in network connections and systems thereof
US20060155819A1 (en) Methods and system for using caches
US6615275B2 (en) System for increasing data access in network having compression device for determining and controlling data/object compression based on predetermined maximum percentage of CPU processing capacity
WO2012102893A2 (en) Site-aware distributed file system access from outside enterprise network
CN101902449B (en) Computer implementation method and system for persistent HTTP connection between network devices
WO2002089000A1 (en) A system for caching data during peer-to-peer data transfer
GB2412769A (en) System for managing cache updates
Ott et al. Applying dtn to mobile internet access: An experiment with http
Miu et al. Performance analysis of a dynamic parallel downloading scheme from mirror sites throughout the internet
GB2412770A (en) Method of communicating data over a network
Mattson Enhancing HTTP to improve page and object retrieval time with congested networks
CN116827935A (en) File transmission method
Sanchez-Loro et al. A proxy-based system for dynamic content negotiation and collaborative optimization in heterogenic environments

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee

Effective date: 20080529