Lists

Terms and Conditions
Lists hosted on this site
Email the Postmaster
Tips for posting to public mailing lists

Re: Shared mmap and data consistency upon a crash

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Shared mmap and data consistency upon a crash

Subject: Re: Shared mmap and data consistency upon a crash
From: Felix Schwarz <email@hidden>
Date: Wed, 10 May 2006 23:38:20 +0200

Am 10.05.2006 um 21:28 schrieb Michael Smith:

On May 10, 2006, at 12:03 PM, email@hidden wrote:
Date: Wed, 10 May 2006 11:02:13 +0200
From: Felix Schwarz <email@hidden>
Subject: Shared mmap and data consistency upon a crash
To: email@hidden
Message-ID: <email@hidden>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Hello,
I am currently exploring the possibilites to use mmap to speed up
some of my most critical I/O,
Stop right here. What makes you think that mmap will "speed up" your I/O?
mmap() is a convenience interface, not a performance interface.
If you want to avoid blocking during I/O, consider the Posix asynchronous I/O system calls.

If you want to avoid the copy in/out from kernel space, consider using the F_NOCACHE fcntl(2) option. If you control your caching behaviour and don't expect another process will access your file data, this is typically the most "performant" way to go.


Thanks for the quick response!

I maybe should have added what it is that I'm writing: I'm writing a specialized, server-less database link library (similiar to SQLite by concept, my link library has only *very* basic functionality, though).

The idea behind using mmap() is to pull indexed records into and out of a file as fast as possible and take advantage of any caching by the VMM, especially regarding the table inside the file that keeps record of all the records it stores.

Since I've read elsewhere you can't mmap() files that are located on a network-volume, I've also written a fallback using read()/write()/ lseek()..

Some benchmarking gives me:

Adding 30000 sample records à 8100 byte using mmap():
0.72958 seconds

Adding 30000 sample records à 8100 byte using read()/write()/lseek():
1.24004 seconds

It appears, that, in both cases, the data in the file seems to be in sync when crashing. I could observe both a delay before any changes could be made to the file after an instant restart of the little test app plus the data that was written last could be reread correctly. With either method.

However, I'd still love an official confirmation from a kernel developer (or someone who is literate in that part of the kernel) since - I don't want to base data security on assumptions or lucky observations that may turn out wrong or may change with the next release of OS X - I'd love to use the mmap() solution - .. and then, I'm also plain curious ;-)

but am wondering about the reliability
of mmap and the durability of changes made to a file this way for one
particular edge case:
If
1) I mmap() a file using MAP_SHARED, PROT_READ and PROT_WRITE
2) I write into the returned address space
3) my app crashes prior to reaching munmap() and close()
If your app crashes, all bets are off - assuming you don't know where it crashed, you have no guarantee as to the state of anything it was doing, regardless of whether it had files mapped or not.

The way the data is organized in the file and the order in which its data is modified ensures its always in a consistent state - the actual modification to the table that keeps record of all records is written last and with one single memcpy() (or write() for the alternative version).

Typically, speaking as a developer, I find it's better to ship products that don't crash, rather than ones that are resistant to internal damage caused by crashes.
To me, it feels like you are trying to solve the wrong problems.
= Mike

In general, I do share your view. The focus in software development should always be on producing clean code in the first place, not trying to fix bad quality afterwards in other, new code. Rest assured, I spend considerable time in my projects on ensuring elegance and cleanness in the code and stability in the build.

However, a database link library is a special case (again, sorry for not previously specifying the nature of the project more clearly). You can make sure your code works rock-solid. But you don't know who will end up linking it into which kind of application. And you don't know which other APIs and link libraries play a role in that application.

Take OS X graphics APIs for example. There are a lot of ways that a user can crash your application if it - for example - uses QuickTime or Quartz ImageIO to load images and those choke on a file. You can neither prevent nor fix this. But your user's data should stay in a consistent state nonetheless.

Hope all that explains it a bit better.

Felix _______________________________________________
Do not post admin requests to the list. They will be ignored.
Darwin-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden



References:  
  >RE: Shared mmap and data consistency upon a crash (From: Michael Smith <email@hidden>)




Prev by Date:
RE: Shared mmap and data consistency upon a crash

Next by Date:
re: How does one get the OS Product name programatically?

Previous by thread:
RE: Shared mmap and data consistency upon a crash

Next by thread:
Re: Shared mmap and data consistency upon a crash

Index(es):

Date
Thread