Re: strategies for working w/ large amounts of text
Re: strategies for working w/ large amounts of text
- Subject: Re: strategies for working w/ large amounts of text
- From: Ian Joyner <email@hidden>
- Date: Tue, 14 Dec 2010 09:19:25 +1100
On 13 Dec 2010, at 14:03, Shane wrote:
>> Are you building a concordance database first ?
>>
>
> I have no thoughts whatsoever on how I'm building anything as of yet.
> Just researching how something of this size would be manipulated.
In the case of large texts like the Bible, you have a data source that is pretty much static. That is, if you put it in a database you won't run update operations against it.
The big question is, if you go to the trouble of breaking it into database records - maybe one per verse, what does that get you. Even putting it into XML might be a lot of work, and what happens when the next technology comes along.
Text is fairly flexible and searchable. You don't actually need a concordance (index). Searching through text is pretty fast (probably a modern processor will have found all occurrences of the word 'the' before you have got the concordance off the shelf. Spotlight technology could also help there to generate an index for you.
That said, what can you do with Cocoa? Why don't you model what you want in Core Data. That is data store agnostic so you could use a database (SQLite) or XML later. Once you have a model, this might help you see potential applications.
Having said that, I believe we are at the beginning of rearranging many linear texts into more immediately and easily useful forms, particularly teaching texts.
Ian
_______________________________________________
Cocoa-dev mailing list (email@hidden)
Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden