[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: We need to be 1,000 times faster.



I was just talking to someone about the Sloan.  The Fermilab computing 
group designed the data acquisition system.  They nearly met the 
schedule.  As a result, they were 3-5 years too early and the system is 
designed around computers so obsolete that they have to be special ordered 
for spares.  They find that they can spend x dollars and get 20% spares for 
the old design.  The same x dollars would buy 1/5 the number of computers 
that would run 5x as fast and do the whole job.  So they are faced with 
buying something slightly different and upgrading software, etc., or buying 
old stuff for spares and replacements.

It seems like it will always be thus in the computer biz.

I say don't worry about it.  We can now just barely do the Mark III 
job.  By the time that the Mark IVs are spitting out all that data, we will 
just barely be able to handle it.  Note that I thought about all this when 
I started this project.  If anything the computer cost for the project had 
decreased relatively from what I expected.  When I started this, I was 
running with 20Mbyte disk drives.

What we don't want to do is to be tied to any particular bit of 
hardware.  Sloan has a lot of special hardware.  We only have the need for 
a single ISA slot.  You can still buy computers with ISA slots (probably 
not for long).  We only need this on the very front end so we should be 
able to make do with old PCs as the telescope controller.

So I say don't be afraid of the size of the problem.  It will get smaller 
with time.  Let's take good data and do what we can to reduce it - at least 
to calibrated star lists.  I think each of us can do that at our sites with 
present technology.  We can then always process some of it to a data 
base.  We just do what we can.  If this data base is useful, pressure will 
grow to process all the data.  A way will be found.  Data in astronomy has 
time value.  You can never again take data for the year 2000.  So if we 
have it in an archive, some of it will be wanted.

The big problem is to figure out how to store the data so that it can be 
processed at a later date.

Tom Droege

At 11:52 AM 8/4/00 -0700, you wrote:
>Some problems with Chris' scheme, though in general
>I agree that distributed database processing is what
>will be needed.  First, a site such as NOFS or Tom's
>6X will produce more data per year than is currently in Michael's
>entire database.  If Michael has problems updating it, then
>at the same level each site will have problems in maintaining
>their own individual database.  Second, while
>FITS tables format is certainly a convenient,
>standard way to keep data available that might be
>needed for queries, such tables are not SQL-accessible
>and I wonder how a centralized server could handle them
>in an efficient manner.  The total database is not that
>much different in size from SDSS.  Let's see what ARNE
>brings to the table.  Perhaps the time really has come
>to get Microsoft, etc. involved.
>Arne