Monday, 6 July 2009

I have been known to abandon blogs a month in.

In this case I'm busy, but I've been working on the code to automatically identify pericopes and correspondences in the NT. Its a pretty tough problem to tackle in odd hours here or there.

Still, the data I'm getting out is almost exactly the same as the published pericope correspondences for the synoptics, which is great. Some niggly tweaks still to go.

One in particular is tricky - where to break. I think this might be a human post-processing step. We need to put a pericope boundary sometimes between words that don't appear in either pericopes. The software can't work out if the boundary should be there, or if either or both words are just omitted words from one of the bounded pericopes. I can't see any way it could possibly do that, so I think I may have to tidy the data later.

Still the Codex Sinaiticus website might eat my time to work on this for the next few days.

0 Comments:

Post a Comment