<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss'><id>tag:blogger.com,1999:blog-3708464975805257907</id><updated>2009-12-17T15:46:10.184-08:00</updated><title type='text'>New Testament Math</title><subtitle type='html'>This is a blog that describes my experiments with using statistics and mathematical analysis on the New Testament.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default?orderby=updated'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>16</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-6634906146244376045</id><published>2009-07-06T09:47:00.000-07:00</published><updated>2009-07-06T09:51:43.868-07:00</updated><title type='text'>All Gone Quiet</title><content type='html'>I have been known to abandon blogs a month in.&lt;br /&gt;&lt;br /&gt;In this case I'm busy, but I've been working on the code to automatically identify pericopes and correspondences in the NT. Its a pretty tough problem to tackle in odd hours here or there.&lt;br /&gt;&lt;br /&gt;Still, the data I'm getting out is almost exactly the same as the published pericope correspondences for the synoptics, which is great. Some niggly tweaks still to go.&lt;br /&gt;&lt;br /&gt;One in particular is tricky - where to break. I think this might be a human post-processing step. We need to put a pericope boundary sometimes between words that don't appear in either pericopes. The software can't work out if the boundary should be there, or if either or both words are just omitted words from one of the bounded pericopes. I can't see any way it could possibly do that, so I think I may have to tidy the data later.&lt;br /&gt;&lt;br /&gt;Still the Codex Sinaiticus website might eat my time to work on this for the next few days.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-6634906146244376045?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/6634906146244376045/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=6634906146244376045' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/6634906146244376045'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/6634906146244376045'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/07/all-gone-quiet.html' title='All Gone Quiet'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-384781557998593990</id><published>2009-07-06T09:34:00.000-07:00</published><updated>2009-07-06T09:39:42.003-07:00</updated><title type='text'>Codex Sinaiticus</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/commons/thumb/4/4a/Sinaiticus_text.jpg/220px-Sinaiticus_text.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 220px; height: 214px;" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/4a/Sinaiticus_text.jpg/220px-Sinaiticus_text.jpg" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;I'm sure this will be all over the biblioblogs in a few hours, but the Codex Sinaiticus project has launched their new website with the complete facsimile leaves of the codex to browse through.&lt;br /&gt;&lt;br /&gt;I'm having trouble with speed at the moment, I guess everyone is rushing there. But it is beautiful.&lt;br /&gt;&lt;br /&gt;You can find it at &lt;a href='http://www.codexsinaiticus.org/'&gt;http://www.codexsinaiticus.org/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Have fun.&lt;br /&gt;&lt;br /&gt;If you're unaware of what it is, see the &lt;a href='http://en.wikipedia.org/wiki/Codex_Sinaiticus#Facsimile_of_Codex_Sinaiticus'&gt;Wikipedia article&lt;/a&gt; for an overview.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-384781557998593990?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/384781557998593990/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=384781557998593990' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/384781557998593990'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/384781557998593990'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/07/codex-sinaiticus.html' title='Codex Sinaiticus'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-8858408232010115321</id><published>2009-06-10T17:46:00.000-07:00</published><updated>2009-06-10T17:46:00.433-07:00</updated><title type='text'>Color Map With John</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_zLASOkQThSA/Si2xH72F95I/AAAAAAAAAGE/hpZ6m_uQ_eQ/s1600-h/final.png"&gt;&lt;img style="cursor:pointer; cursor:hand;width: 400px; height: 195px;" src="http://4.bp.blogspot.com/_zLASOkQThSA/Si2xH72F95I/AAAAAAAAAGE/hpZ6m_uQ_eQ/s400/final.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5345123082640422802" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This is the colormap with John as well as the three synoptics.&lt;br /&gt;&lt;br /&gt;Because of the way color-blending works, John's colors are calculated differently.&lt;br /&gt;&lt;br /&gt;1. John only is brown.&lt;br /&gt;&lt;br /&gt;2. Any other color in John shows shared content with the same color in the other gospels, so Yellow appearing in John shows shared content in Matt + Mark. Note that the opposite isn't true: Yellow in Matt doesn't mean it is shared with John.&lt;br /&gt;&lt;br /&gt;3. There is no indication at all in the first three gospels of what is shared with John. The colors in the first three are purely indicating sharing between the 3 synoptics.&lt;br /&gt;&lt;br /&gt;Clearly John hasn't got a lot of connections with the others, except for the crucifixion narrative.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-8858408232010115321?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/8858408232010115321/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=8858408232010115321' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/8858408232010115321'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/8858408232010115321'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/06/color-map-with-john.html' title='Color Map With John'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_zLASOkQThSA/Si2xH72F95I/AAAAAAAAAGE/hpZ6m_uQ_eQ/s72-c/final.png' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-1559531083686228214</id><published>2009-06-09T17:26:00.000-07:00</published><updated>2009-06-09T17:26:01.201-07:00</updated><title type='text'>Doubting Honore</title><content type='html'>The image I included a few posts ago from Honore's paper on statistics in the NT...&lt;br /&gt;&lt;br /&gt;&lt;p class="zemanta-img" style="margin: 1em; width: 310px;"&gt;&lt;a href="http://commons.wikipedia.org/wiki/Image:Relationship_between_synoptic_gospels.png"&gt;&lt;img src="http://upload.wikimedia.org/wikipedia/commons/thumb/9/9f/Relationship_between_synoptic_gospels.png/300px-Relationship_between_synoptic_gospels.png" alt="The relationships between the three synoptic g..." style="border: medium none ; display: block;" height="390" width="300"&gt;&lt;/a&gt;&lt;span class="zemanta-img-attribution"&gt;Image via &lt;a href="http://commons.wikipedia.org/wiki/Image:Relationship_between_synoptic_gospels.png"&gt;Wikipedia&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;...doesn't agree with my coloring map. In particular I detect a &lt;em&gt;lot&lt;/em&gt; less triple tradition material than he does.&lt;br /&gt;I'm sure this is because I am tracking words, and he is tracking pericopes. I have the pericope data (I used it to generate my map), but it will need another opportunity to analyse it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-1559531083686228214?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/1559531083686228214/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=1559531083686228214' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/1559531083686228214'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/1559531083686228214'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/06/doubting-honore.html' title='Doubting Honore'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-6614042979433893727</id><published>2009-06-08T17:06:00.001-07:00</published><updated>2009-06-08T18:17:33.690-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='synoptic problem'/><category scheme='http://www.blogger.com/atom/ns#' term='visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='greek'/><title type='text'>Coloring Exercise</title><content type='html'>Phew... Its been a marathon evening trying to get the code for this working well. And there's probably lots I could say about it, but for now a pretty picture.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_zLASOkQThSA/Si2yMd3EPgI/AAAAAAAAAGM/JqJHs-wSyrA/s1600-h/final.png"&gt;&lt;img style="cursor:pointer; cursor:hand;width: 300px; height: 195px;" src="http://1.bp.blogspot.com/_zLASOkQThSA/Si2yMd3EPgI/AAAAAAAAAGM/JqJHs-wSyrA/s400/final.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5345124260002414082" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;(Image is released under a Creative Commons Attribution License 3.0, as usual).&lt;br /&gt;&lt;br /&gt;The image is a colored map of the three synoptic gospels. Each pixel in the map represents one greek word, and it is colored depending on whether it is unique to that gospel, or is found in other gospels. Exactly as we colored the synoptics when first learning about them.&lt;br /&gt;&lt;br /&gt;The color scheme is: Red, Green and Blue for Matt, Mark and Luke only. Yellow (Red+Green light) is Matt+Mark, Cyan (Green+Blue) is Mark+Luke, and Magenta (Red+Blue) is Matt+Luke. Black is triple tradition.&lt;br /&gt;&lt;br /&gt;The diagram isn't sorted in any pericope-based order. Each gospel is in its correct order. I figured that would be prettier since at this scale you don't want to know what is corresponding to what.&lt;br /&gt;&lt;br /&gt;Three things that strike me immediately (all of which I knew, but the diagram states them very clearly):&lt;br /&gt;&lt;br /&gt;1. Mark is short and there's not much in there that's unique.&lt;br /&gt;&lt;br /&gt;2. Luke has the most original material (John isn't here of course, if it were it would take that prize).&lt;br /&gt;&lt;br /&gt;3. There's a lot more yellow than purple (Matt sticks to Mark more than Luke does). &lt;br /&gt;&lt;br /&gt;And one thing I didn't realise.&lt;br /&gt;&lt;br /&gt;4. There's twice as much magenta as cyan (Luke is closer to Matt than to Mark).&lt;br /&gt;&lt;br /&gt;I don't know if number 4 is just one of those things that I never twigged and is perfectly obvious to everyone else, but the whole Markan priority thing made me assume that Luke would be more similar to Mark than Matt. Anyone else surprised at that? Or is it just me?&lt;br /&gt;&lt;br /&gt;Anyway, lots of unpacking of this data to do. I'll post briefly about the techniques used to generate it, later in the week.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-6614042979433893727?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/6614042979433893727/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=6614042979433893727' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/6614042979433893727'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/6614042979433893727'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/06/coloring-exercise.html' title='Coloring Exercise'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zLASOkQThSA/Si2yMd3EPgI/AAAAAAAAAGM/JqJHs-wSyrA/s72-c/final.png' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-5711821822994521637</id><published>2009-06-08T05:59:00.000-07:00</published><updated>2009-06-08T05:59:00.381-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='synoptic problem'/><title type='text'>Synoptic Continua</title><content type='html'>Thinking about the Synoptic problem from a systems approach at the weekend, I came up with a relatively simple model:&lt;br /&gt;&lt;br /&gt;For each pair of extant documents there is a separate synoptic problem. The overall synoptic problem is the combination of these individuals.&lt;br /&gt;&lt;br /&gt;Each pair of documents (call them A and B) sits on a continuum. &lt;br /&gt;&lt;br /&gt;At one extreme B copied directly from A, possibly omitting some material and adding additional content.&lt;br /&gt;&lt;br /&gt;At the other extreme A copied directly from B, again with possible omissions and additions.&lt;br /&gt;&lt;br /&gt;In the middle of the continuum, A and B are independent and their agreements are due to some additional source.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_zLASOkQThSA/SivECKGwEYI/AAAAAAAAAFs/iQ-BxcLAPAY/s1600-h/Synoptic+Pairs.jpg"&gt;&lt;img style="cursor:pointer; cursor:hand;width: 400px; height: 400px;" src="http://2.bp.blogspot.com/_zLASOkQThSA/SivECKGwEYI/AAAAAAAAAFs/iQ-BxcLAPAY/s400/Synoptic+Pairs.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5344580924156678530" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;These three cases are black-and-white and easy to draw pictures of. Filling in the continuum are other scenarios. Maybe A and B had some common sources (they'd both heard similar sermons from visiting preachers, they'd both got access to some of the same letters), &lt;em&gt;and&lt;/em&gt; B copied from A. Now there's a blend in the relationship between both extremes of the continuum. I've tried to show this in shades of grey in the diagram.&lt;br /&gt;&lt;br /&gt;Assuming there are 3 pairs of synoptic gospel texts - the statistical analysis of the synoptic problem should provide a point on the continuum for each of the three pairs. And (more importantly) an estimate of the error for each. It may be that the relationship between Matthew and Luke, for example, is placed somewhere between a common source (Q hypothesis) and Luke copying Matthew (Farrer hypothesis), with error margins that are significant enough that both sides can claim victory. &lt;br /&gt;&lt;br /&gt;Would they? Or would a died in the wool Q fan think of anything less than smack in the middle as being a contradiction? From what I've read most researchers seem to pay only lip-service to the possibility of the solution being a shade of grey.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-5711821822994521637?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/5711821822994521637/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=5711821822994521637' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/5711821822994521637'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/5711821822994521637'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/06/synoptic-continua.html' title='Synoptic Continua'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zLASOkQThSA/SivECKGwEYI/AAAAAAAAAFs/iQ-BxcLAPAY/s72-c/Synoptic+Pairs.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-7753802068881919883</id><published>2009-06-07T05:45:00.000-07:00</published><updated>2009-06-07T05:59:22.304-07:00</updated><title type='text'>Signal to Noise Ratios</title><content type='html'>In many things in science it is more important to know what you &lt;em&gt;can&lt;/em&gt; know, than to come up with a conclusion prematurely.&lt;br /&gt;&lt;br /&gt;One common heuristic in operations research (and science generally) is the signal to noise ratio. Data is inherently noisy. And everyone knows that the noise can obscure the signal.&lt;br /&gt;&lt;br /&gt;What isn't widely known, is that this isn't the whole picture. In order to detect signals in the midst of lots of noise, you have to be so sensitive to patterns that you'll start to detect patterns that aren't there. In other words: to avoid false negatives (not finding something that is there), you'll be forced to used techniques that can give false positives (finding things that aren't there).&lt;br /&gt;&lt;br /&gt;Strikes me that something of the same occurs in several scientific subdisciplines of theology. In the search for the historical Jesus, for example, we have to be so tuned to detecting theological bias in the scant records we have, that it is easy to spot features that aren't there. In the quest to solve the synoptic problem, similarly, we find features that are probably not there, which can lead to people declaring 'solutions' without understanding how likely they are to be true.&lt;br /&gt;&lt;br /&gt;I'm not a scholar, but I'm trying to read as much on the synoptic problem as possible. So far I've not read anything that tries to seriously understand how much we can possibly know from the source text. There are various references to "of course, we'll never really know" - but nobody trying to say how much we &lt;em&gt;could&lt;/em&gt; know.&lt;br /&gt;&lt;br /&gt;Seems to me that is a crucial question.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-7753802068881919883?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/7753802068881919883/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=7753802068881919883' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/7753802068881919883'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/7753802068881919883'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/06/signal-to-noise-ratios.html' title='Signal to Noise Ratios'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-964829373888020800</id><published>2009-06-04T04:18:00.000-07:00</published><updated>2009-06-04T04:30:50.556-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='synoptic problem'/><title type='text'>The Search for Synopsis</title><content type='html'>When I was thinking about starting this project, I emailed Mark Goodacre and asked some very nooby questions.&lt;br /&gt;&lt;br /&gt;One of which was, if there is digital data anywhere on the synopses of the first three gospels. In other words, is there data on what sections in each gospel correspond to which other sections (all 4 gospels + 1 Cor I guess would be cool too). After a little bit more digging I've found various sites with summaries of the major pericope correspondences, but no science-quality data.&lt;br /&gt;&lt;br /&gt;&lt;p class="zemanta-img" style="margin: 1em; width: 310px;"&gt;&lt;a href="http://commons.wikipedia.org/wiki/Image:Relationship_between_synoptic_gospels.png"&gt;&lt;img src="http://upload.wikimedia.org/wikipedia/commons/thumb/9/9f/Relationship_between_synoptic_gospels.png/300px-Relationship_between_synoptic_gospels.png" alt="The relationships between the three synoptic g..." style="border: medium none ; display: block;" height="390" width="300"&gt;&lt;/a&gt;&lt;span class="zemanta-img-attribution"&gt;Image via &lt;a href="http://commons.wikipedia.org/wiki/Image:Relationship_between_synoptic_gospels.png"&gt;Wikipedia&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;This was a little frustrating as I've been planning to do some cool stuff with that data. But then I figured it was a bit lightweight to be looking for this data anyway. In addressing the &lt;a class="zem_slink" href="http://en.wikipedia.org/wiki/Synoptic_problem" title="Synoptic problem" rel="wikipedia"&gt;synoptic problem&lt;/a&gt;, one of the problems I perceive (albeit as an outsider to its scholarly study) is that it is so thoroughly overlaid with preconceived ideas. A correspondence to one person may not be to another. So I need to go synopsis hunting myself, I think, with as unbiased a statistical tool as possible.&lt;br /&gt;&lt;br /&gt;This, unfortunately, is a lot harder that producing pretty graphs from word frequencies that I've been doing for the last week. So that's why there's a gap. And there's likely to be a bigger gap until I can spend some time at the weekend playing with it more too.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-964829373888020800?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/964829373888020800/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=964829373888020800' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/964829373888020800'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/964829373888020800'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/06/search-for-synopsis.html' title='The Search for Synopsis'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-5680225102146653246</id><published>2009-05-25T14:04:00.000-07:00</published><updated>2009-06-04T04:29:55.560-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='synoptic problem'/><category scheme='http://www.blogger.com/atom/ns#' term='manifesto'/><category scheme='http://www.blogger.com/atom/ns#' term='visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><category scheme='http://www.blogger.com/atom/ns#' term='greek'/><title type='text'>Motives</title><content type='html'>This blog is intended to hold notes and result from my dalliances with computational linguistics and the New Testament.&lt;br /&gt;&lt;br /&gt;My aim at this point is to play around with the NT in order to practice my greek and the learn more about the text. There are a couple of areas I'm interested in:&lt;br /&gt;&lt;br /&gt;First is learning more NT greek. I learned greek in my theology degree, and have kept up with it to a modest level. But I'm still a real amateur, so I'm interested in whether mathematical engagement with the text can help me master the language quicker or more fully.&lt;br /&gt;&lt;br /&gt;Secondly I'm interested in the Synoptic Problem, and in particular whether there are analyses that can shed light on them. I think there are some techniques from bioinformatics that could directly illuminate these problems, but getting to the point where I can do that analysis would be tricky.&lt;br /&gt;&lt;br /&gt;Thirdly I'm perpetually interested in visualization in lots of contexts. Visualization is time consuming, but if I get time it would be good to post some graphs, charts and other images here.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-5680225102146653246?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/5680225102146653246/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=5680225102146653246' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/5680225102146653246'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/5680225102146653246'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/motives.html' title='Motives'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-8732422214794542649</id><published>2009-05-25T16:05:00.000-07:00</published><updated>2009-06-02T12:37:13.865-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><category scheme='http://www.blogger.com/atom/ns#' term='greek'/><title type='text'>How much vocab do you need to read The Reader's Greek Bible</title><content type='html'>So the first question I looked at (a little warm-up really) is: what is the frequency-ordered word-list of NT greek?&lt;br /&gt;&lt;br /&gt;The motivation for this was partly so I could get a sense of how much I'd need to know for the different 'hint' levels at John Dyer's &lt;a href="http://bible.johndyer.name/"&gt;Readers Greek Bible&lt;/a&gt;. If you haven't seen the site, it allows you to get footnote hints for words that appear fewer than X times in the NT. I wanted to know roughly what my vocab size would need to be to read at each level.&lt;br /&gt;&lt;br /&gt;The answers are:&lt;br /&gt;&lt;br /&gt;&lt;table&gt;&lt;tr&gt;&lt;th&gt;Frequency&lt;/th&gt;&lt;th&gt;Number of Words&lt;/th&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;100&lt;/td&gt;&lt;td&gt;171&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;90&lt;/td&gt;&lt;td&gt;194&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;80&lt;/td&gt;&lt;td&gt;212&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;70&lt;/td&gt;&lt;td&gt;238&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;60&lt;/td&gt;&lt;td&gt;271&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;50&lt;/td&gt;&lt;td&gt;310&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;45&lt;/td&gt;&lt;td&gt;338&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;40&lt;/td&gt;&lt;td&gt;377&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;35&lt;/td&gt;&lt;td&gt;416&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;30&lt;/td&gt;&lt;td&gt;463&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;25&lt;/td&gt;&lt;td&gt;545&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;20&lt;/td&gt;&lt;td&gt;636&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;15&lt;/td&gt;&lt;td&gt;809&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;10&lt;/td&gt;&lt;td&gt;1126&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;This analysis ignores word forms (so all declensions of a noun count as the same word, for example). There are 5463 different words in the NT using this method of counting.&lt;/p&gt;&lt;p&gt;These numbers seemed very low to me. I think they make it seem simpler than it really is - with 200 greek words you would struggle to read even with help set at level 100, because you are likely to come across novel &lt;span style="font-style:italic;"&gt;forms&lt;/span&gt; of words you know. And you'd have to know the words in their strict order of occurrence, which nobody does. I'm pretty sure my greek vocab has more than a thousand words in it, but I struggle to read comfortably below level 30.&lt;/p&gt;&lt;p&gt;But still it is interesting.&lt;/p&gt;&lt;p&gt;One of the questions this raises for me, is how much vocab you'd need to know to start reading the NT.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-8732422214794542649?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/8732422214794542649/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=8732422214794542649' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/8732422214794542649'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/8732422214794542649'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/how-much-vocab-do-you-need-to-read.html' title='How much vocab do you need to read The Reader&apos;s Greek Bible'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-1188305394988731015</id><published>2009-05-30T15:32:00.000-07:00</published><updated>2009-06-02T12:33:37.449-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><category scheme='http://www.blogger.com/atom/ns#' term='reading list'/><title type='text'>Most readable chapters</title><content type='html'>This is starting to push the bounds of usefulness, I realise, but I applied the same analysis of the most readable verses to chapters.&lt;br /&gt;&lt;br /&gt;By the time you get to reading chapters, you need a lot of words, and it is unlikely you are going to worry about having to look up a few in the lexicon. So I realise this is artificial, but still, here are the results. The most readable chapters in the NT are:&lt;br /&gt;&lt;br /&gt;* John 17 (the 'High Priestly Prayer' of Jesus before his arrest).&lt;br /&gt;&lt;br /&gt;* John 16&lt;br /&gt;&lt;br /&gt;* Rev 15 (not surprising - a very short chapter about the seven plagues).&lt;br /&gt;&lt;br /&gt;* Rom 14&lt;br /&gt;&lt;br /&gt;* 1 John 1 (the start of a book - many of the simplest chapters start books, as much of the language is formulaic - 1 John 1 is also very short).&lt;br /&gt;&lt;br /&gt;Now, the thing that is most surprising about this list is just how far John 17 is ahead of the others. It it not a particularly short chapter, but is highly repetitious, and scores only 1000. John 16, coming in second, scores 2216, and then there is another gap before we get lots of chapters around the 3000 mark.&lt;br /&gt;&lt;br /&gt;Reading through the greek of John 17, it is remarkably simple.&lt;br /&gt;&lt;br /&gt;I think there's probably very little information in the result (no speculating that it was written by a different hand or anything like that), but I do find it interesting. And it is useful to now have a list of chapters that are pretty easy to read and that (importantly) any words I don't already know will be as useful as possible for my future reading.&lt;br /&gt;&lt;br /&gt;FYI, the top 20 with scores are:&lt;br /&gt;&lt;br /&gt;&lt;table&gt;&lt;tr&gt;&lt;th&gt;Position&lt;/th&gt;&lt;th&gt;Difficulty Score&lt;/th&gt;&lt;th&gt;Chapter&lt;/th&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;1&lt;/td&gt;&lt;td&gt;1000&lt;/td&gt;&lt;td&gt;John 17&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;2&lt;/td&gt;&lt;td&gt;2216&lt;/td&gt;&lt;td&gt;John 16&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;3&lt;/td&gt;&lt;td&gt;2753&lt;/td&gt;&lt;td&gt;Rev 15&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;4&lt;/td&gt;&lt;td&gt;2893&lt;/td&gt;&lt;td&gt;Rom 14&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;5&lt;/td&gt;&lt;td&gt;3033&lt;/td&gt;&lt;td&gt;1John 1&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;6&lt;/td&gt;&lt;td&gt;3033&lt;/td&gt;&lt;td&gt;1John 3&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;7&lt;/td&gt;&lt;td&gt;3096&lt;/td&gt;&lt;td&gt;John 1&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;8&lt;/td&gt;&lt;td&gt;3191&lt;/td&gt;&lt;td&gt;Rev 11&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;9&lt;/td&gt;&lt;td&gt;3211&lt;/td&gt;&lt;td&gt;Rev 17&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;10&lt;/td&gt;&lt;td&gt;3314&lt;/td&gt;&lt;td&gt;Matt 20&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;11&lt;/td&gt;&lt;td&gt;3417&lt;/td&gt;&lt;td&gt;1John 2&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;12&lt;/td&gt;&lt;td&gt;3417&lt;/td&gt;&lt;td&gt;1John 4&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;13&lt;/td&gt;&lt;td&gt;3424&lt;/td&gt;&lt;td&gt;2Cor 5&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;14&lt;/td&gt;&lt;td&gt;3431&lt;/td&gt;&lt;td&gt;2Thes 2&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td&gt;15&lt;/td&gt;&lt;td&gt;3478&lt;/td&gt;&lt;td&gt;Rom 5&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;16&lt;/td&gt;&lt;td&gt;3563&lt;/td&gt;&lt;td&gt;Matt 10&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;17&lt;/td&gt;&lt;td&gt;3580&lt;/td&gt;&lt;td&gt;Matt 7&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;18&lt;/td&gt;&lt;td&gt;3696&lt;/td&gt;&lt;td&gt;Rom 6&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;19&lt;/td&gt;&lt;td&gt;3781&lt;/td&gt;&lt;td&gt;Matt 28&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td&gt;20&lt;/td&gt;&lt;td&gt;3966&lt;/td&gt;&lt;td&gt;Rev 1&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-1188305394988731015?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/1188305394988731015/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=1188305394988731015' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/1188305394988731015'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/1188305394988731015'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/most-readable-chapters.html' title='Most readable chapters'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-44598249565160936</id><published>2009-05-28T09:34:00.000-07:00</published><updated>2009-05-28T09:34:01.029-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><title type='text'>Verb Table Heat Map</title><content type='html'>So I started playing with Grammatical analyses, as I suggested in previous posts, and I had a go at a first visualization.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://sites.google.com/site/ntmathfiles/Home/heatmap-usletter.pdf"&gt;&lt;img style="cursor:pointer; cursor:hand;width: 400px; height: 310px;" src="http://1.bp.blogspot.com/_zLASOkQThSA/Sh3EWhY3ssI/AAAAAAAAAFk/mJ7XtoWEaBI/s400/Picture+3.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5340640624330126018" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;It is a simple heat-map of a verb table showing what conjugations of the verb are most common and which are rare. Interestingly a whole slew of conjugations that appear in several grammars and other verb tables I have never appear at all in the NT. The optative future is the main offender, but there are several others.&lt;br /&gt;&lt;br /&gt;If you are primarily interested in reading the GNT, this map shows what conjugations to focus on, but there are limitations, as described on the chart.&lt;br /&gt;&lt;br /&gt;You can download it in either &lt;a href='http://sites.google.com/site/ntmathfiles/Home/heatmap.pdf'&gt;International A4&lt;/a&gt; or &lt;a href='http://sites.google.com/site/ntmathfiles/Home/heatmap-usletter.pdf'&gt;US Letter&lt;/a&gt; format. Both are pdf files and both are around 10K in size, so they're tiny.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-44598249565160936?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/44598249565160936/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=44598249565160936' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/44598249565160936'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/44598249565160936'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/verb-table-heat-map.html' title='Verb Table Heat Map'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zLASOkQThSA/Sh3EWhY3ssI/AAAAAAAAAFk/mJ7XtoWEaBI/s72-c/Picture+3.png' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-5904827226479456233</id><published>2009-05-27T14:55:00.000-07:00</published><updated>2009-05-27T15:59:28.777-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><title type='text'>A Graph</title><content type='html'>In my last post I talked about the number of headwords you'd need to know to start reading the NT, if you learned the words in decreasing order of frequency. (You'd need 33 to read Matt 16:15, as it turns out).&lt;br /&gt;&lt;br /&gt;We can continue the same process for all the verses in the bible. For each verse assigning a score based on where in the frequency word list the rarest word occurs. If we order all verses in the bible using this technique we end up with this graph.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_zLASOkQThSA/ShxDjJnXMjI/AAAAAAAAAFc/bUQjdLG6hzA/s1600-h/Picture+2.png"&gt;&lt;img style="cursor:pointer; cursor:hand;width: 400px; height: 288px;" src="http://2.bp.blogspot.com/_zLASOkQThSA/ShxDjJnXMjI/AAAAAAAAAFc/bUQjdLG6hzA/s400/Picture+2.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5340217529310065202" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Initially in the bottom left you need to learn quite a few words before you can read another verse of the bible (it takes 33 to get to verse one, another 2 words for the next verse, then 15 more before you can read number three). Over time though the payoff begins to show. In the middle of the graph, each new word you learn gives you the ability to read between two and three more verses. Then towards the end we end up in the territory of words that only appear once, and so learning them only helps us read one extra verse.&lt;br /&gt;&lt;br /&gt;You can think of this as three zones of learning:&lt;br /&gt;&lt;br /&gt;1. Early undergraduate study. Everything is tough - everything seems like a special case, and nothing joins up.&lt;br /&gt;&lt;br /&gt;2. Late undergraduate study. Suddenly you find yourself reading bits of the NT with minimal help from a lexicon. It is a great feeling.&lt;br /&gt;&lt;br /&gt;3. The point beyond where I am. You can read about 2/3 of the NT without reference, but the words you don't know really are a special case here.&lt;br /&gt;&lt;br /&gt;I expected the graph to have roughly this shape (flat initially, then steepening before flattening again), but I thought that the steep slope would be even steeper. It turns out that even at best (with a vocab around 1000 words) you get less than three new verses for every word you learn. On the positive side, however, the initial flat bit where progress is slow is far smaller than I expected, at this scale it is really only the smallest of ticks. After a couple of hundred words, you are into the most productive zone.&lt;br /&gt;&lt;br /&gt;This is deeply encouraging for students of the language. Just a couple of hundred words and you can do a lot.&lt;br /&gt;&lt;br /&gt;Of course, as I've said several times, this applies to vocab only. At some point I'll do a similar analysis of grammatical features.&lt;br /&gt;&lt;br /&gt;&lt;small&gt;PS: In case you're wondering, the little kinks at 2655 and 3495 words are where the words start to appear twice in the NT and once in the NT respectively. &lt;/small&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-5904827226479456233?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/5904827226479456233/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=5904827226479456233' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/5904827226479456233'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/5904827226479456233'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/graph.html' title='A Graph'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zLASOkQThSA/ShxDjJnXMjI/AAAAAAAAAFc/bUQjdLG6hzA/s72-c/Picture+2.png' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-1137609055669127895</id><published>2009-05-27T00:44:00.000-07:00</published><updated>2009-05-27T00:57:07.048-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='manifesto'/><title type='text'>Types of Math</title><content type='html'>There are a few different types of math that we could do with the NT. If things go well, I might even get round to doing them.&lt;br /&gt;&lt;br /&gt;* Firstly I'm going to be doing mostly counting stuff: frequencies, broad patterns, distributions, and so on. These are easy, because they only involve single quantities. We can look at the distribution of a particular verb form across the NT books, for example, or find the easiest verse to read (as in the previous post).&lt;br /&gt;&lt;br /&gt;* Secondly we could look at correlations. These are data that are derived from more than one quantity, and specifically look at how they vary in combination with one another. Strictly the distribution of verbs across NT books is a correlation measure, but here I'm particularly thinking about more text-text correlations. We could look at how pairs of unusual greek forms appear together, or determine just how similar are the same pericope in different synoptic accounts. &lt;br /&gt;&lt;br /&gt;* Thirdly we can look at models of the texts. This involves determining which of a number of models underlying the texts is more likely. So we could build a model of the synoptics with a hidden source, (Q for example), and look at whether that model is more likely than one with no hidden source. This final step is unlike the previous two because we need to construct and justify our models before we begin. Math can't magically tell us history, but it can help us understand which of several well-defined historical possibilities might be more likely, given a set of explicit (and normally quantified) assumptions.&lt;br /&gt;&lt;br /&gt;So I'm aiming for 3, but for a while all my results will be at level 1. Mostly because I'm just getting use to the text and starting to write the code that I'm using to do these calculations. At some point between 2 or 3 I suspect I'll also need to learn a good deal more textual criticism too, but then I'm in no rush to get anywhere with this.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-1137609055669127895?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/1137609055669127895/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=1137609055669127895' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/1137609055669127895'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/1137609055669127895'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/types-of-math.html' title='Types of Math'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-2782574157749131640</id><published>2009-05-26T14:40:00.000-07:00</published><updated>2009-05-26T14:40:00.785-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><category scheme='http://www.blogger.com/atom/ns#' term='greek'/><title type='text'>Word Thirty Three</title><content type='html'>Following on from my previous post, I got to thinking about a "mathematically optimal greek course". One that casts aside any concept of pedagogy and optimizes for a hypothetical learning machine. &lt;br /&gt;&lt;br /&gt;We'd want to teach this machine its vocab in roughly decreasing order of frequency. So words that appear in the NT most often are taught first. We know from the last post that after about 200 words, it would know every word that appears 100 or more times. But what does this mean.&lt;br /&gt;&lt;br /&gt;In particular, if we learned words this way, when would we be able to read our first NT verse? And after learning 200 words, how many verses would we be able to read without help?&lt;br /&gt;&lt;br /&gt;We can calculate this very easily. Using our frequency ordered word list we can assign each verse in the bible a score based on the lowest item in the word list. So a verse with no rare words will have a low score, and a verse containing a very rare word will score highly. &lt;br /&gt;&lt;br /&gt;It turns out the easiest verse to read, on this scoring system, is Matt 16:15 '"But what about you?" he asked. "Who do you say I am?"' (λέγει αὐτοῖς ὑμεῖς δὲ τίνα με λέγετε εἶναι) It scores 33. So its most uncommon word (τίνα, from τίς "who") is 33 on the frequency word list. Actually all the other words in the verse are in the top 10, even in English you can see that most of them are very common (the verb λέγω "to say" is at number 9, but is the most common verb in the NT).&lt;br /&gt;&lt;br /&gt;After Matt 16:15 comes 1 Cor 3:23 'as we are to Christ, Christ is to God' (ὑμεῖς δὲ Χριστοῦ Χριστὸς δὲ θεοῦ) with a score of 35. Then there's a small gap before we get several with scores in the 50s and 60s (some are just short, like John 10:30, others are surprisingly complex, like 1 Cor 8:6, with 27 greek words in the verse, but a score of just 50 - coming in joint 3rd overall).&lt;br /&gt;&lt;br /&gt;Now, as in the previous post, I've ignored morphology, which skews these results. You couldn't just teach someone 33 words and have them read Matt 16:15. To get a proper curriculum for the hypothetical greek learning machine, we'd need to include grammar, and that is the subject for another post.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-2782574157749131640?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/2782574157749131640/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=2782574157749131640' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/2782574157749131640'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/2782574157749131640'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/word-thirty-three.html' title='Word Thirty Three'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3708464975805257907.post-3083897132722861099</id><published>2009-05-26T12:15:00.000-07:00</published><updated>2009-05-26T12:15:01.012-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='resources'/><category scheme='http://www.blogger.com/atom/ns#' term='greek'/><title type='text'>A Note on Resources</title><content type='html'>In case you're wondering at any point. I'm using the NA27 text, as marked up and published by &lt;a href="http://jtauber.com"&gt;James Tauber&lt;/a&gt; at his now defunct MorphGNT site (the source texts are still available at &lt;a href="http://files.morphgnt.org/ccat-morphgnt/"&gt;the site&lt;/a&gt;, however).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3708464975805257907-3083897132722861099?l=ntmath.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://ntmath.blogspot.com/feeds/3083897132722861099/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3708464975805257907&amp;postID=3083897132722861099' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/3083897132722861099'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3708464975805257907/posts/default/3083897132722861099'/><link rel='alternate' type='text/html' href='http://ntmath.blogspot.com/2009/05/note-on-resources.html' title='A Note on Resources'/><author><name>Ian</name><uri>http://www.blogger.com/profile/01054171196833894625</uri><email>idmillington@gmail.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='05880288900846240754'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry></feed>