Lists

Terms and Conditions
Lists hosted on this site
Email the Postmaster
Tips for posting to public mailing lists

Re: Managing EOF caching

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Managing EOF caching

Subject: Re: Managing EOF caching
From: Michael DeMan <email@hidden>
Date: Thu, 19 Jan 2006 01:48:29 -0800

Oh, one more item.

If you can, do saveChanges() frequently. Don't sacrifice the integrity of your sandbox (EOEditingContext), but don't abuse it either.

The best I can tell, nobody at Apple has messed much with the saveChanges code and it still runs on a lot original stuff from the late 90s. In essence, my guess is that it is technically correct but nobody ever optimized it and its probably so complicated to optimize that everybody is afraid to do it. I know if I worked at Apple, the last thing I would want to be responsible for is making 'saveChanges () in EOF go faster' and some how introducing bugs.

The catch with saving changes often is transactional. Wherever possible, if you know that you can commit the sandbox to the database, do it, it will save massive CPU cycles. There is something really bad in the saveChanges() method that is at least on the order of n^2 or worse.

The way to theorize this is to break your stuff into well know transactions that have to be committed to the database completely. Also, you can avoid saveChanges *AND* blow away your cache by using stored procedures on the database side. Instead of saving changes, lock your EOEditiingContext, use the stored procedure to commit, and mark the committed objects+tables as 'dirty' in the snapshot.

If you are using nested editing contexts and a bunch of those fancy 'out of the box widgets' then even this may not help without some detailed time spent on the code.


On Jan 19, 2006, at 1:35 AM, Michael DeMan wrote:

The cheezy option is always hardware. Quick fix, and usually pretty cheap. Definitely not elegant.

If this is a production application, I would check swap utilization on the server, and if its low, and especially if there are some bucks for a few extra GB of RAM, do as follows:

1. Get the production system working, deferring the elegant work with a crappy fix. i.e., make sure you have lots of RAM and toss parameters into your shell or Monitor startups like: -Xms512m -XX:NewSize=4m -Xms256m
right along with your -DWOAutoOpenInBrowser=false statements, etc.
Your apps will get huge, but keep in mind that on a decent UNIX system with WebObjects (and nearly any server-side java app) you can expect a good 100MB+ of RAM to be unused, shared, and swapped out forever. When you look at RAM utilization, only look at whats resident.

2. In regards to more elegant solutions, look at some of the fetches and/or 1:M relationship accesses you do in the application. You can take some of those subroutines and when they're done, have them discard only those objects from your primary editing context and snapshot that they brought in for their local use and that you expect will not be needed in the near future.

3. Even if they are needed in the near future, have your database do the work and optimize your fetches to include only the columns of data you need and where possible use stored procedures. Obviously on this one 'near future' means minutes not milliseconds.
4. If you need the stuff in RAM, buy more RAM!
5. If you're running multiple instances, implement a global cross- instance EODatabaseObjectStoreCoordinator (or whatever the class is called, I forget right now), that will cache all your data in one spot across all your instances on the same host. You need to do a little bit of inter-process notification, but its a well understood methodology and you can find snippets at www.wosource.com.
My 2-cents anyway.
- mike
On Jan 18, 2006, at 10:20 AM, Ken Anderson wrote:
Dave,
It depends.  If it's a to-one relationship, and either:
a) The object is already in your current editing context from your fetch, or b) The object is in a different editing context, but the database snapshot is recent enough for your editing context to accept it

then yes, there will be no database activity. This is because the object can be found or created purely based on the global ID, which can be constructed because a to-one relationship requires you to know the entire primary key.

However, if the snapshot is old, your editing context might refuse the snapshot and request a fresh one (it depends on that context's timestamp or the default timestamp lag).

Lastly, to-many faults will always require a trip to the database, since the set information is not known. You can always improve this by setting the to-many relationship to batch fault. Don't forget though, if only one instance of a to-many fault exists at a particular time in the context, no amount of batch faulting will help you. Another way to improve this is to have a method in your class that determines the to-many grouping via an in-memory qualification (if you already have a group of objects you know are the superset). This of course gets dicey, because you're doing EOF's job...but some circumstances might warrant it.

To answer your original question, fetches from fetch specs or faulting go through the same plumbing.
Ken
On Jan 18, 2006, at 12:56 PM, Dave Rathnow wrote:
Is there a difference in the way EOF caches an object if it is fetched using an EOFetchSpecification or if it is retrieved by traversing a relationship? If I do a bulk fetch using a fetchSpec and then access these object via a relationship from a related object, will EOF use the cached object or fetch it from the database?
Thanks,
Dave.
-----Original Message-----
From: Ken Anderson [mailto:email@hidden]
Sent: January 18, 2006 08:29 AM
To: Dave Rathnow
Cc: 'email@hidden'
Subject: Re: Managing EOF caching
Dave,
What you're doing is OK, and pretty typical (at least for me). I try to limit the destruction of snapshots to the destruction of an editing context, so I don't have objects lying around that are not garbage collected, but their snapshots have been removed from the database context.

In one situation, I would have a singleton manage the 'ref data' context (which stored often used reference data). When a 'transaction' editing context was destroyed, it would invalidated all the EOs in THAT editing context that did not have a companion in the ref editing context (managed by the singleton). In that manner, we would hold on to ref data indefinitely, but only hold on to transactional data for the life of the context.
Ken
On Jan 17, 2006, at 12:07 PM, Dave Rathnow wrote:
We have an application that processes well data that is reported at regular and irregular reporting periods. The bulk of the data arrives in the morning within a two hour window. The application is headless, that is, it has no UI.

When we first wrote the program we were having problems with running out of memory and the simplest solution we found was to periodically dispose of the editing context and then nuke all snapshots from the EODatabase object. This solved the memory problems but of course we lost the advantage of any EOF caching. Unfortunately, we are now at a point where we are falling too far behind during busy periods so we have to revisit the problem and find a better solution.

This application is one part of a bigger system and it's primary job is to process incoming data according to a set of business rule. The processing involves fectching object from the DB and creating other objects that are then stored in the DB and consumed by other applications in the system. Most of the fetched objects should stay in memory since they will likely be used in the future to process data from the same location. Other objects could probably be nuked after a period of time. The newly created objects are not required and could be released as soon as they are saved to the DB.

I've been playing around with different optimization strategies including batch fetching but the one that seems to work the best is to save all inserted objects in my EC before I call saveChanges and then nuke the snapshots from the EODatabase after the saveChanges has finished. My overall memory requirements have increased but they seem to level out at an acceptable level.
So here are my question:
1. Does anyone have some ideas how to handle this type of caching requirements? Is there another/better approach or different solution?

2. Assuming what I'm doing is an acceptable approach, can I control how long objects are cached by EOF and what objects are cached?

3. Does nuking snapshots in the way I'm doing it have an bad side effects.
Thanks,
Dave.
_______________________________________________ Do not post admin requests to the list. They will be ignored. Webobjects-dev mailing list (email@hidden) Help/Unsubscribe/Update your Subscription: 40anderhome.com

This email sent to email@hidden
_______________________________________________ Do not post admin requests to the list. They will be ignored. Webobjects-dev mailing list (email@hidden) Help/Unsubscribe/Update your Subscription: 40geminisolutions.com

This email sent to email@hidden


_______________________________________________
Do not post admin requests to the list. They will be ignored.
Webobjects-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden



References:  
  >RE: Managing EOF caching (From: Dave Rathnow <email@hidden>)
  >Re: Managing EOF caching (From: Ken Anderson <email@hidden>)
  >Re: Managing EOF caching (From: Michael DeMan <email@hidden>)




Prev by Date:
Re: Managing EOF caching

Next by Date:
Class Long

Previous by thread:
Re: Managing EOF caching

Next by thread:
RE: Managing EOF caching

Index(es):

Date
Thread