Eureqa II, aka Formulize

Some­how in the last few months I’d missed the seri­ous update of Michael Schmidt’s Eureqa pack­age for sym­bolic regres­sion. Now avail­able for Win­dows, Linux and Mac plat­forms, and fea­tur­ing a very nice cloud inte­gra­tion for addi­tional pro­cess­ing, this looks like a ground-breaker for usabil­ity and under­stand­abil­ity in Sym­bolic Regres­sion applications.

Here’s an intro­duc­tion video. Michael does a much bet­ter job explain­ing it suc­cinctly than I’ve seen before:

Inter­est­ingly, I stum­bled across Michael’s update because I was writ­ing to Cosma Shal­izi, fish­ing for feed­back from the Very Impor­tant Thinkers attend­ing the Ockham’s Razor Work­shop regard­ing (mod­ern, multi-objective) sym­bolic regres­sion of the style Michael’s project embod­ies. A lot of these same notions of Pareto Sym­bolic Regres­sion were devel­oped orig­i­nally by Mark Kotanchek, Katya Vladislavl­eva and Guido Smits from Dow Chem­i­cal and Evolved Ana­lyt­ics, recently released in Data­Mod­eler for Math­e­mat­ica.

It’s fas­ci­nat­ing to me how lit­tle pro­fes­sional atten­tion crosses in either direc­tion between the machine learning/statistical the­ory folks and genetic pro­gram­ming folks. Impor­tant work in both fields is essen­tially invis­i­ble across that divide. As the years go by the dichotomy is get­ting some­how deeper, to the point where I expect they’re just going to run into one another headed the other way, when they both cir­cum­nav­i­gate the net­work of All Pos­si­ble Approaches to Sci­ence and Engi­neer­ing in their rush apart….

Note, as clar­i­fi­ca­tion: I am not includ­ing any chap­ters on sym­bolic regres­sion in the book. Sym­bolic regres­sion is an amaz­ing and rapidly matur­ing field, and I count it pretty much “done” with the release of Data­Mod­eler and For­mulize. From here on, it’s a field in its own right, not least because the tricks and tech­niques use­ful in address­ing quan­ti­ta­tive mod­el­ing projects like these are quite dif­fer­ent from the qual­i­ta­tive and struc­tural mod­el­ing projects we’re doing in the book.

Still, down­load and play with Eure­qua II, and see what you can understand.

4 thoughts on “Eureqa II, aka Formulize

  1. Of course I’ll dis­agree that sym­bolic regres­sion is “pretty much done” — there are quite a few years of effort in the Data­Mod­eler devel­op­ment pipeline already laid out … assum­ing we don’t have any other bright ideas in the meantime.

    I’ll also point out that we have fully-capable free trial ver­sions of Data­Mod­eler for those inter­ested in putting it through its paces.

  2. Thank you very much for the tip,
    wasn’t acquainted with the pro­gram
    and now that I down­loaded it, I find it pretty use­ful
    for sim­ple exploratory analy­sis and lit­tle pre­dic­tion prob­lems
    even if I have now idea how to model them (but when you do, it also can use that knowl­edge quite nicely).

  3. Hi
    I am a M.S stu­dent and I choose sym­bolic regres­sion via GP for my project, but unfor­tu­nately I can’t find good source espe­cially about the SR and it’s impor­tance and fea­tures and … about it. I really need help. please help me and intro­duce good sources.
    thanks

  4. Samira, you should prob­a­bly look at ECJ for an open-source GP project with a focus on sym­bolic regression.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>