Archive for June, 2008

Twitturly is giving away $1,000

Tuesday, June 3rd, 2008

So why are we giving away $1,000? Well, we need help making Twitturly even better. We want to be able to have the results on Twitturly automatically be grouped together by the semantics of the results. To claim the $1,000 prize you need to be a programmer with the skills to pull it off the majority of the time. It doesn’t need to be perfect, but it should be close.

Here is the problem:

Currently, the number one and two result are, “goosh.org - the unofficial google shell”.

Goosh @ #1 and #2

Now, you and I can tell that both of these links are going to take you to a site that shows either exactly the same thing or close to the same thing. So what I want to do is make it so that the second result appears underneath the first result as a sub-item.

To help with this programming challenge, I have decided to export the final array (slightly modified to remove proprietary ranking information) and serialize this for you guys to play with. It can be accessed here and put into a variable in your PHP scripts. To get the array how it is supposed to be, you must run PHP’s unserialize function on it. Here is the more readable and prettified version.

What I am hoping for is a function that I can send the array to. Then the function will look at the titles and descriptions to determine if it is similar to any of the results before it. It only needs to check the see if the item is related to items that appear before the current position in the array. The function would then modify that object and add the parent’s urlid to a object value of “parent” (which should be the first related result). When the function is done checking, just send back the entire modified object and we’ll do the rest.

In addition to grouping Goosh, there are a few others in the array that is included in the above sample that should be grouped. For example, #4 and #6 (#3 and #5 if you are looking at the PHP array ID’s) should be grouped. #8, #16, and #38 should all be grouped together as well (in this case, #8 should be the “parent” for both #16 and #38).

All we need is the PHP function. Our GUI guy can take care of making it look presentable (Tony, thats you!)

How to solve this issue:

Thats up to you! If you want to use one of the various API’s online, it should be free are really close since we don’t have the backing to pay for an expensive API. Ideally it will be all done in your PHP code however we understand that there maybe API’s that we can tap into to help out on this, and we are willing to give them a try if they do a good job. It should look at the semantics of the text because we want all similar items to be grouped together, even if the words are not exactly the same.

About the prize:

After all entries have been reviewed, we will pick the one that returns the best matches the most often. Once we have selected a winner (by July, 7th, 2008), they’ll get the $1,000. We understand that it’s not $100,000 like some of the challenges from companies with Venture Capital have done, but Twitturly has had no financing at all and hey, if you add the decimals at the end $1,000.00 kind of looks like $100,000! ;-) In addition to the $1,000 that we are giving away, the programmer will also get the warm fuzzy feeling that he helped make Twitturly better for everyone.

When it ends:

This contest expires on July 1st, 2008. The winner will be chosen by by July, 7th, 2008.

Please post a comment

If you are interested in participating in this contest or you have any questions, please comment below. Those that say that they are going to work on a solution will be emailed instructions on how to submit their code.

Thank you and good luck!

—-

UPDATE (06/19/2008):
Since many people are asking for the URLs to be included, I have attached two more files that can be used. One is an updated serialized results file, the other is a few PHP functions that can be used to get the data in the new results file into your PHP script.

New Results File: Download
New PHP Functions: Download