MySQL
  Home arrow MySQL arrow Page 7 - Lord Of The Strings Part 2
Dev Articles Forums 
ADO.NET  
Apache  
ASP  
ASP.NET  
C#  
C++  
ColdFusion  
COM/COM+  
Delphi-Kylix  
Design Usability  
Development Cycles  
DHTML  
Embedded Tools  
Flash  
Graphic Design  
HTML  
IIS  
Interviews  
Java  
JavaScript  
MySQL  
Oracle  
Photoshop  
PHP  
Reviews  
Ruby-on-Rails  
SQL  
SQL Server  
Style Sheets  
VB.Net  
Visual Basic  
Web Authoring  
Web Services  
Web Standards  
XML  
Mobile Linux 
App Generation ROI 
IBM® developerWorks 
Sun Developer Network 
Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
MYSQL

Lord Of The Strings Part 2
By: Simon White
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 5 stars5 stars5 stars5 stars5 stars / 26
    2004-03-17

    Table of Contents:
  • Lord Of The Strings Part 2
  • The Size of the Problem
  • Discovering String Similarities
  • A Class Called Compare
  • Analyzing the Results
  • The Finnish Line
  • Differences on the Table

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Lord Of The Strings Part 2 - Differences on the Table


    (Page 7 of 7 )

    Computing the expected number of hits is the statistical equivalent of a control experiment in the physical sciences - in other words, what would be the outcome if there were no interesting behavior to study? You might think such an exercise to be of little use, as we're pretty sure there is something interesting going on. The point is that we can now compare the actual results that we obtained to the expected results, and look at the differences. The following table orders the languages according to the size of the difference between actual and expected number of hits.

    LanguageExpected HitsActual HitsActual-Expected
    ENGLISH

    19.7

    64

    44.3

    SPANISH

    30.1

    62

    31.9

    HUNGARIAN

    6.2

    18

    11.8

    JAPANESE

    40.3

    52

    11.7

    DANISH

    8.9

    19

    10.1

    SWAHILI

    6.4

    7

    0.6

    SWEDISH

    4.2

    4

    -0.2

    FINNISH

    100.5

    100

    -0.5

    LATIN

    27

    26

    -1

    NORWEGIAN

    21.6

    19

    -2.6

    POLISH

    38.3

    25

    -13.3

    DUTCH

    62.4

    40

    -22.4

    FRENCH

    48.4

    18

    -30.4

    GERMAN

    56

    16

    -40

    Now we're getting somewhere. The languages with differences that are greater than zero may have had an influence on Tolkien. Furthermore, the size of the difference is also an indication of the level of influence. So we're beginning to see that Tolkien's mother tongue of English seems to have had the most profound influence on him. My reaction to this was, at first, one of surprise, then of reassurance. I was surprised because of the apparent dissimilarity of Tolkien's invented words to English, and the fact that the Tolkien words matched only three English words exactly. But I was reassured because Tolkien was, after all, English, and you would expect him to be heavily influenced by his native language.

    Note also the particularly strong result for Hungarian, which received nearly three times as many hits as expected. Finnish performed almost exactly as expected, indicating no appreciable influence on Tolkien, whilst French and German performed well under expectations, perhaps indicating that Tolkien was deliberately avoiding the influences of these languages.

    Conclusions

    When I started this investigation, I had no idea what the result would be. I just clung firmly onto the belief that my string similarity metric, together with a simple algorithm to iterate over the set of possible word pair comparisons, would provide an interesting result. In fact, the results are very satisfying. I found that English had a profound effect on Tolkien's invented languages, with perhaps further influences from Hungarian and Spanish. This is satisfying because it is entirely reasonable (at least the part about English!), though not exactly what I expected after reading about the (apparently unfounded) claims for the influences of Finnish. It is also satisfying because it increases my confidence in the string similarity method. And as developers, we like to have confidence in our methods.


    DISCLAIMER: The content provided in this article is not warranted or guaranteed by Developer Shed, Inc. The content provided is intended for entertainment and/or educational purposes in order to introduce to the reader key ideas, concepts, and/or product reviews. As such it is incumbent upon the reader to employ real-world tactics for security and implementation of best practices. We are not liable for any negative consequences that may result from implementing any information covered in our articles or tutorials. If this is a hardware review, it is not recommended to open and/or modify your hardware.

     

    MYSQL ARTICLES

    - MySQL and BLOBs
    - Two Lessons in ASP and MySQL
    - Lord Of The Strings Part 2
    - Lord Of The Strings Part 1
    - Importing Data into MySQL with Navicat
    - Building a Sustainable Web Site
    - Creating An Online Photo Album with PHP and ...
    - Creating An Online Photo Album with PHP and ...
    - PhpED 3.2 – More Features Than You Can Poke ...
    - Creating An Online Photo Album with PHP and ...
    - Creating An Online Photo Album with PHP and ...
    - Security and Sessions in PHP
    - Setup Your Personal Reminder System Using PHP
    - Create a IP-Country Database Using PERL and ...
    - Developing a Dynamic Document Search in PHP ...






    © 2003-2008 by Developer Shed. All rights reserved. DS Cluster 2 hosted by Hostway
    Stay green...Green IT