The Starburst Challenge

Reply
 
LinkBack Thread Tools
  #1 (permalink)  
Old 07-12-2005
Creating
Hypography Staff Member
Administrator
Editor

Join Date: May 2005
Location: Silver Spring, MD, USA
Posts: 4,492
CraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond repute
Lightbulb The Starburst Challenge

In 1968, in the SF story “Gold at the Starbow's End" (which was expanded in 1983 to become the novel “Starburst”), the late Fred Pohl makes an interesting mathematical conjecture.

With its fictional context parenthesized, here is what I term “the Starburst Conjecture”:
1) A lengthy, meaningful text is written in everyday, understandable English. (the text, presumable several tens of pages long, explains how to solve many pressing engineering problems, including controlled, small-scale fusion power)

2) This text is encoded into a large integer using an easily decodable scheme (“godelization” – see below)

3) This integer is written as an expression using ordinary arithmetic operators. The expression is much shorter than the original text. ( The “Starburst number”, (3.875*12^26)! + 1973^854 +331^852 + 17^2008 + 3^9606 + 2^88 – 78, where “!” is the factoral unary operator, e.g: 5! = 1*2*3*4*5 = 120)

According to accepted information theory, this may be impossible. Roughly stated, information theory says that an arbitrary random integer cannot be “compressed” into an shorter integer enough that the length of the compressed integer + the length of the program required to decompress it back into the original integer are less than the original integer. This is commonly called “the Compression Challenge”, and has withstood decades of attempts to refute it. It’s important to note that the Compression Challenge doesn’t say that no integer can be compressed, just that one with a lot of randomness – and thus, a lot of information, can’t. For example, the integer 10^(10^100), a huge integer, can obviously be described by this very short expression, and, like the Starburst Number, decompressed using a program consisting of just the rules of arithmetic. The integer in the Starburst Conjecture is not an arbitrary random number – the author can carefully compose and tweak the text and encoding scheme that generates it - but neither is it as “easy” number like 10^(10^100).

Here’s my challenge to number-crunching-inclined Hypographers: prove the Starburst Conjecture by providing an example of any meaningful English text, encoding scheme, and expression satisfying the rules. Strange spacing, punctuation, capitalization, and spelling is OK. All that’s required to win is that the final expression be many times shorter than the original text. Of course, a formal proof or disproof wins, too.

PS:
1) Nobody claims that Pohl’s “Starburst number” can really be expanded into a document containing the secret of controlled fusion, or any readable text – it’s fictional, intended for illustration purposes only. This is not a challenge to discover any hidden information in the number (3.875*12^26)! + 1973^854 +331^852 + 17^2008 + 3^9606 + 2^88 - 78.

2) “Godelization” is a symbol-to-integer encoding scheme. A strength is that it can be used with any number of symbols, which you don’t have to know in advance. A weakness is that it quickly generates very large integers. I neither recommend not don’t recommend using it for this challenge, but include it in this PS to provide background on Pohl’s original example.

It works as follows:
1. Beginning with 1, assign a number to each symbol used. For example, 1=SPACE, 2=A … 27=Z, 28=PERIOD is a valid assignment. Any order can be used, and new symbols may be added as needed.
2. Initialize G = 1
3. For the first symbol, multiply G by the first prime raised to the power of the symbol (G=G*2^symbol).
4. Repeat for the 2nd symbol and prime, and so on, until all symbols in the message have been encoded.
Example: “HELLO WORLD” encodes to 2^9*3^6*5^13*7^13*11^16*13*17^24*19^16*23^19*29^13 *31^5 = 16201674618135353117026999186798460736594220862673 25990498526101998386676287418792874442737803105156 98642043413401302718344893318901988199466356250000 00000
Reply With Quote
  #2 (permalink)  
Old 07-13-2005
Turtle's Avatar
Pasquinader
Latest blog: Meh
Platinum Subscription
Sponsor
Arrow Re: The Starburst Challenge

___The challenge is dealt & I meet the small blind. Now I need to firm up a few points before the flop. (Forgive the obscure poke references; it's late).
___First if we may, I call for a set message length in characters, rather than the general "tens of pages". A double space page is rougly 250 words & say an average word is 5 letters = 1250 characters. Since spaces count, add 250 for 1500 characters per page. Now let's use 20 pages for a message length of 30,000 characters for encryption.
___Sooooo tired...Stopping here for the flop.
__________________
Nemo me impune lacesset. ~Unattested
Reply With Quote
  #3 (permalink)  
Old 07-13-2005
Creating
Hypography Staff Member
Administrator
Editor

Join Date: May 2005
Location: Silver Spring, MD, USA
Posts: 4,492
CraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond repute
Post The Starburst Challenge - clarification

Quote:
Originally Posted by CraigD
Here’s my challenge to number-crunching-inclined Hypographers: prove the Starburst Conjecture by providing an example of any meaningful English text, encoding scheme, and expression satisfying the rules. Strange spacing, punctuation, capitalization, and spelling is OK. All that’s required to win is that the final expression be many times shorter than the original text. Of course, a formal proof or disproof wins, too.
Quote:
Originally Posted by Turtle
First if we may, I call for a set message length in characters, rather than the general "tens of pages".
Please feel free to use a starting text of any length, the only restriction being that it is something that a reasonable English-speaking reader would recognize as meaningful. Let’s take that to imply a minimum length of about 250 characters.

Let “the final expression be many times shorter than the original text” be taken to mean “the final expression be less than 1/5 the length of the original text”, but let’s be flexible with this requirement. I’d be excited to see any result where the final expression is even about the same size as the starting text.

An important freedom is that the starting text may be altered – spaces or nonsense characters inserted, words strangely capitalized, slightly misspelled, etc. – to yield an encoded integer better suited for reduction to a shorter expression.

I suspect this is a very hard challenge, so let’s impose a minimum of rules.
Reply With Quote
  #4 (permalink)  
Old 07-13-2005
Turtle's Avatar
Pasquinader
Latest blog: Meh
Platinum Subscription
Sponsor
Thumbs up Re: The Starburst Challenge

___ Just rying to get my shell around it. Do I properly understand that besides the 1/5 which is the compressed message, you then add the characters in the encryption program too?
___Another question I have; how important is the difference here between the information in the message & the message itself?
___I seem to recall an anecdote of similar circumstance to this; a professor assigned a hard problem in compression to a bunch of young students who didn't know it was a hard problem. As it turned out, one of these "naive" young guys produced a solution by using a binary probability tree, thus solving a long standing problem.
___I'm all in!
__________________
Nemo me impune lacesset. ~Unattested
Reply With Quote
  #5 (permalink)  
Old 07-13-2005
Creating
Hypography Staff Member
Administrator
Editor

Join Date: May 2005
Location: Silver Spring, MD, USA
Posts: 4,492
CraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond reputeCraigD has a reputation beyond repute
Lightbulb Re: The Starburst Challenge

Don’t worry about the size of the encoder/decoder program – just assume it’s trivially small compared to the original text. Just don’t cheat by having a “decoder” that can produce the starting message with little or no input.

The difference between the common-sense information in the starting message and its formal information (entropy) is unimportant. The message can contain lots of extra “noise” information, as long as a reasonable reader can distinguish the noise from the “signal”. For example: “Mary had a little lamb”; “mARy haD a LiTtLe lamB”; and “mary … had …… a litle … lamb” are all legitimate.
Reply With Quote
  #6 (permalink)  
Old 07-13-2005
Turtle's Avatar
Pasquinader
Latest blog: Meh
Platinum Subscription
Sponsor
Thumbs up Re: The Starburst Challenge

___Acknowledged. To the bat cave! Er... to the turtle cave.
__________________
Nemo me impune lacesset. ~Unattested
Reply With Quote
  #7 (permalink)  
Old 07-13-2005
Turtle's Avatar
Pasquinader
Latest blog: Meh
Platinum Subscription
Sponsor
Arrow Re: The Starburst Challenge

___In rereading the original description, "It’s important to note that the Compression Challenge doesn’t say that no integer can be compressed, just that one with a lot of randomness – and thus, a lot of information, can’t"; I have to say I agree.
___For example, I believe it's impossible to compress anomalous Strange Numbers & their ilk.
___In any case the written numerals , which we casually refer to as numbers, themselves constitute a compression algorithm for representing amount. Staying with position based systems using zero, the higher the base you use, the fewer positions required to encode the amount. Conversley, the lower the base, the more positions required.
___Just thinking out loud
__________________
Nemo me impune lacesset. ~Unattested
Reply With Quote
  #8 (permalink)  
Old 02-09-2006
Turtle's Avatar
Pasquinader
Latest blog: Meh
Platinum Subscription
Sponsor
Lightbulb Re: The Starburst Challenge

Quote:
Originally Posted by CraigD
In 1968, in the SF story “Gold at the Starbow's End" (which was expanded in 1983 to become the novel “Starburst”), the late Fred Pohl makes an interesting mathematical conjecture.


Here’s my challenge to number-crunching-inclined Hypographers: prove the Starburst Conjecture by providing an example of any meaningful English text, encoding scheme, and expression satisfying the rules. Strange spacing, punctuation, capitalization, and spelling is OK. All that’s required to win is that the final expression be many times shorter than the original text. Of course, a formal proof or disproof wins, too.
___I saw this thread mentioned elsewhere & thought to revisit it. I have in mind something Katabatak. Cogitating.
__________________
Nemo me impune lacesset. ~Unattested
Reply With Quote
  #9 (permalink)  
Old 02-05-2007
Turtle's Avatar
Pasquinader
Latest blog: Meh
Platinum Subscription
Sponsor
Arrow Re: The Starburst Challenge

Something new in this perhaps, by way of a fractal compression scheme developed -as best I can tell - by Prof. Ian Stewert. I heard it described in a recent PBS show entitled Colours of Infinity, hosted by Arthur C. Clarke. In short, a given image (your information) is subjected to an analysis that gives a fractal equation specific to that image. Then, the equation is given back the initial conditions and it generates missing detail. For a spacecraft for example, this means less data is required without loss of detail and so image compression. Or so they said...
Here is a link to Ian's homepage, and a Google of 'Prof. Ian Stewart' turns up many resources to his work.
Ian Stewart Home Page v 2.1
__________________
Nemo me impune lacesset. ~Unattested
Reply With Quote
Reply

Bookmarks


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
1,000 post challenge Tormod Competitions 168 10-31-2005
TM, I challenge YOU! GAHD Chess Forum 22 04-19-2005
Bandwidth speed challenge record broken again and is now set at 101 gigabytes/s alexander Computer Science 4 12-06-2004
How many of you can put 2&2 together? This is an all out challenge! Sharky Physics and Mathematics 19 08-13-2003

» Current Poll
Favorite James Bond?
Sean Connery - 63.64%
7 Votes
George Lazenby - 0%
0 Votes
David Niven - 9.09%
1 Vote
Roger Moore - 9.09%
1 Vote
Timothy Dalton - 9.09%
1 Vote
Pierce Brosnan - 0%
0 Votes
Daniel Craig - 9.09%
1 Vote
Hate 'em all - 0%
0 Votes
Who's James Bond? - 0%
0 Votes
Total Votes: 11
You may not vote on this poll.

All times are GMT -8. The time now is 03:28 PM.


Powered by vBulletin® Version 3.7.2
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.0 ©2008, Crawlability, Inc.
Copyright © 2000-2008 Hypography
Part of the Hypography - Science for Everyone Network