MPC Status Page: Archive (2007 January-June)
This page describes enhancements to or problems that have occurred with the MPC and scripts and the fixes that have been made.Recent problems are listed elsewhere. Index of other older problems..
Older Enhancements and Resolved Problems
- Special-epoch elements for NEAs
2007 June 29: 10:35. A number of NEAs are in the special daily-epoch elements files under both their numbered and unnumbered designations. A problem that prevented the removal of certain old sets of elements has been identified and fixed. The files are being regenerated. - Numbered elements in MPES
2007 June 24: 16:35. A problem that caused the elements of numbered minor planets to be unavailable in the MPES since noon was detected and fixed. - CF mail server outage (June 20)
2007 June 15: 15:55. We have been informed that the CF's SMTP server will be rebooted at 08:00 EDT on Wednesday, June 20. During the expected 30-minute outage, incoming mail (e.g., to mpc@cfa) will not be received, but the mail should be resent by the remote systems when the SMTP server is restarted. - Anonymous ftp server downtime (June 1)
2007 May 29: 12:35. We have been informed that the CF's anonymous ftp server will be offline for approximately 30 minutes starting at 09:00 EDT on June 1. Access to files stored there will obviously not be possible during this period. - Magnitudes in MPES and MP/NEOChecker
2007 May 19: 22:00. A user reported a difference in the predicted magnitude of a numbered object as displayed in the MPES and in the MP/NEOChecker services. A fix for this discrepancy has been put in place. - MPC Cluster problem
2007 May 7: 10:30. It seems that one of the machines in the cluster rebooted itself earlier this morning and did not come back up. This machine stores the quick-access indexed observation and orbit files used internally and by some of the web services. The machine will be rebooted as soon as possible.- 11:30. The affected machine has been restarted.
- Date of Last Observation of numbered objects in MPES
2007 May 5: 17:47. A problem with the generation of a datafile used by the MPES is causing the dates of last observation of certain newly-numbered objects not to be displayed by the MPES. The MPES did not expect this situation and would stack dump if an affected object was selected. The code has been tighten to prevent the stack dump and output a meaningful informational message. The problem datafile has been regenerated, recent observations will be reflected in this file following the next DOU MPEC. - Sky Coverage plots
2007 May 5: 08:30. A problem related to both the retirement of the old webserver and incorrect directory access permissions on the new webserver caused the script that updated the main Sky Coverage page to not copy newly-generated versions to the webserver. This has been fixed. - Sluggish response of webserver overnight
2007 Apr. 23: 08:15. Two separate events overnight caused the response of the webserver to be extremely sluggish (to the point where requests would timeout). The first was a result of a disk being found to be missing to be missing and its reinsertion. The second was a runaway cgi script that clogged up the webserver. We are checking to see why the process that kills runaway cgi scripts was not triggered.- 13:00. The process that kills runaway cgi scripts was restarted. The execution batch queue on which it ran originally disappeared when the old webserver was shutdown. It has been restarted on a execution batch queue on the new webserver.
- Moving of MPC/CBAT/ICQ webpages II
2007 Apr. 19: 12:30. The transfer of the MPC/CBAT/ICQ webpages to a new location will begin at 14:00 today. During the move, our pages should remain available, but certain files may revert to older copies when the new alias is activated. We do not expect to update any web pages during the move. Cgi scripts served from the VMS webserver will continue to be available as normal.- 18:30. The webpages have been moved and our internal procedures updated.
- MPC Cluster problems
2007 Apr. 14: 23:20. It seems that one of the machines in the cluster rebooted itself earlier this evening and has not come up cleanly. For reasons that are not yet understood, this is causing problems across the cluster, including the webserver. We are investigating...- 23:36. It seems the problem is that the quick-access indexed observation and orbit files are stored on the machine that has not rebooted. The unavailability of the latter files does impact extraction of orbits and generation of ephemerides in the webserver.
- Apr. 15: 00:30. We have abandoned tonight's DOU MPEC, postponed the mid-month MPS batch and paused the automated processing procedures. The machine will be rebooted in the late morning.
- 10:30. The affected machine has been rebooted. Normalcy should be returning.
- Upcoming Network Maintenance (Apr. 29)
2007 Apr. 13: 14:15. We have been informed that Harvard will be performing hardware upgrades on one of its gateway machines on Sunday, April 29. The work will begin at 01:00 EDT and should be completed by 07:00. During this maintenance window there will be a loss of connectivity into and out of the observatory. - Moving of MPC/CBAT/ICQ webpages
2007 Apr. 13: 10:40. We have been informed that the Computation Facility needs to move the MPC/CBAT/ICQ webpages to a new location. This move will take place sometime in the week April 16-20. During the move, our pages should remain available, but certain files may revert to older copies when the new alias is activated. Cgi scripts served from the VMS webserver will continue to be available as normal. - "File locked" messages from web scripts
2007 Apr. 6: 14:10. To try and remove the "File locked" messages some users have reported (a problem we've been unable to reproduce internally) a small change to a webserver configuration file has been made and the webserver has been restarted. Use the feedback form to report any occurrences of the "File locked" message. - MPC Webserver machine
2007 Apr. 4: 12:30. We are shifting the MPC webserver on to a different machine. When the new webserver is set up, we will shutdown the current webserver. The name "scully" will be retained as an alias for the new webserver, so that no changes to web forms are necessary. There will be a brief period sometime in the next few days when we disable the current webserver (including AUTOACK) to allow us to do a clean copy of data to the new webserver. This may be done with little warning.- Apr. 5: 11:38. The cgi scripts that run on the current webserver, as well as the AUTOACK service, are being paused to allow the new webserver to be configured. Copying of the data from current to new webserver, followed by testing of new webserver, may take a few hours.
- 14:39. The files have been shifted, access to the webserver has been enabled on the network, but the required aliasing of cgi.minorplanetcenter.net to the new machine has not yet been made. Access to the cgi scripts will be possible after this aliasing has occurred. AUTOACK should be working normally.
- 15:57. Correction: AUTOACK isn't working as the new machine is not currently allowed to receive SMTP mail. We have been informed that the request for the alias has been made, but it may be tomorrow morning before the change is made.
- 17:00. The alias is now visible externally, but not internally.
- 17:30. The alias is now visible internally.
- 18:09. The new machine can now receive SMTP mail, but the changes needed to allow the mpc@cfa alias to forward mail to the correct machine do not seem to have been made (i.e., delivery is still going to Scully). Until this is fixed we will forward observation batches manually to the AUTOACK routine.
- 20:27. From external sites, the web services appear to be accessible again. AUTOACK also seems to be working.
- MPC Webserver machine
2007 Apr. 1: 11:00. It appears that the MPC webserver machine hung again around 21:06 last night. The machine will be rebooted as soon as possible.- 14:57. Webserver has been rebooted.
- 20:00. Webserver has hung again.
- Apr. 2: 09:52. Webserver rebooted.
- Apr. 3: 21:40. Webserver has hung again.
- Apr. 4: 10:35. Webserver rebooted.
- cgi scripts on CF machines
2007 Mar. 30: 12:30. Effective immediately, all cgi scripts used by MPC/CBAT that run on CF machines are disabled. This includes the "display IAUC" script and the "show all designations" option in the MPES. Replacements will be worked on, but it may take some time to replace them all. - MPC Webserver machine
2007 Mar. 30: 11:22. It appears that the MPC webserver machine hung again around 00:26. The machine is being rebooted. - MPC Webserver machine
2007 Mar. 29: 17:48. It appears that the MPC webserver machine hung again around 15:30. The machine has been rebooted. - MPES bug fixes
2007 Mar. 29: 00.15. A user reported some non-documentation compliant behavior in the MPES. One problem has been fixed: the non-acceptance of valid dates such as "2007 March 28". The range of allowable east longitudes was extended to the range +/- 360 degrees, rather than the previously allowed 0-360 degrees. - New Feedback Form
2007 Mar. 23: 15:20. A new version of the feedback form is now on-line. Pages that reference the old version are gradually being changed to use the new version. At some point in the near future the old version will be removed. - CMTChecker and NEOChecker
2007 Mar. 14: 16:00. Two new services that use the MPChecker script to check for just comets and NEOs are now on-line. - MPC Webserver machine
2007 Mar. 14: 13:55. It appears that the MPC webserver machine hung again around 11:20. It is being rebooted.- 14:03. Turns out not to be a problem with SCULLY. The AUTOACK procedure log file had reached its maximum version number. A fix will be put in place to ensure this doesn't happen again.
- DNS Issues
2007 Mar. 8: 18:13. It seems that there have been intermittent failures of DNS over the past 24-36 hours at the CfA. This impacted issuance of a CBET last night and the issuance of an MPEC this morning. Both circulars have been sent out manually. We have received no word on whether this is going to continue. - MPC Webserver machine
2007 Mar. 8: 13:59. It appears that the MPC webserver machine hung again around 10:43. It is being rebooted.- 14:16. The webserver machine has been rebooted.
- CfA Mail Gateway Down Time
2007 Mar. 6: 11:40. We have just been informed that the CfA mail gateway machine will be shutdown at 06:00 EST tomorrow (March 7) in order to install "some important OS patches". The outage is expected to last 40-60 minutes. During this time, incoming mail routed via the mail gateway will not be received, but should queue up on the sending computer for subsequent resend attempts once the gateway is back on-line. - MPC Webserver machine
2007 Mar. 6: 10:03. It appears that the MPC webserver machine hung again around 05:45. The machine has been rebooted. - MPC Webserver machine and tonight's DOU MPEC
2007 Feb. 28: 23:25. It appears that the MPC webserver machine is hanging again. The machine will be checked out tomorrow morning. We are hoping that the machine will reboot itself, so that we get a crash dump log to help diagnose where the problem lies. Tonight's DOU MPEC is being canceled.- Mar. 1: 10:05. The webserver machine didn't reboot itself, so it has been rebooted manually.
- 18:05. The webserver machine was taken down for a few moments to check a hardware item at the request of the service organization. The machine has been restarted.
- OS patch installation
2007 Feb. 27: 14:36. We are planning on installing a number of OS patches (none security related) on the cluster machines starting tomorrow. Each machine will be patched and rebooted at a time that is most convenient for that machine. When the webserver machine is patched, web services will be off-line for the duration of the installation (probably under 30 mins, assuming no surprises). All the machines should be patched by March 4.- 16:48. The webserver machine is currently being patched. It will be rebooted in a few minutes. Intermittent web problems may occur while other machines are being patched over the next few days.
- Feb. 28: 12:29. The patching of the cluster machines has been completed.
- MPC Webserver machine
2007 Feb. 12: 21:25. It appears that the MPC webserver machine is hanging again.- Feb. 13: 10:09. Webserver machine restarted. Restart of ACK procedure complicated by submission of e-mail not formatted properly: this caused multiple ACKs to be sent out. Normalcy has resumed.
- MPC Webserver machine
2007 Feb. 11: 19:48. It appears that the MPC webserver machine is hanging again.- Feb. 12: 08:41. Heading in shortly to fix problem.
- 10:49. Webserver machine rebooted. Backlog of e-mail will be processed as soon as CF mail node retries delivery.
- Upcoming Network Maintenance (Feb. 6)
2007 Jan. 29: 15:01. We have just been informed that Harvard will be performing maintenance on another of its core routers on Tuesday, February 6. The work will begin at 04:00 EST and should be completed by 06:00. During this maintenance window there may be intermittent losses of connectivity into and out of the observatory. - Upcoming Network Maintenance (Jan. 30)
2007 Jan. 29: 14:43. We have just been informed that Harvard will be performing maintenance on one of its core routers on Tuesday, January 30. The work will begin at 04:30 EST and should be completed by 06:00. During this maintenance window there may intermittent losses of connectivity into and out of the observatory. - MPC Webserver machine
2007 Jan. 26: 09:20. It appears that the MPC webserver machine is hanging again. A system configuration modification put in place after the last hang, which we hoped would solve the hanging problem, doesn't seem to be working. Heading into office shortly to fix problem.- 11:00. The webserver machine was rebooted.
- 17:30. The webserver machine was rebooted again.
- Lists of NEOs/TNOs/etc.
2007 Jan. 25: 18:03. The lists of unusual objects of various kinds are off-line until a new version of the program that prepares said lists is put on-line. - MPC Webserver machine and tonight's DOU MPEC
2007 Jan. 21: 15:03. It appears that the MPC webserver machine is hanging again. Heading into office shortly to fix problem.- Jan. 21: 16:00. Scully has been rebooted.
- Jan. 21: 22:40. Hanging again. Reboot has been postponed until tomorrow morning, as there is no guarantee that a fix now will last until the morning. Tonight's DOU MPEC has been abandoned.
- Jan. 22: 09:52. Scully apparently rebooted itself around 03:20. Queues were just restarted.
- MPC Webserver machine
2007 Jan. 13: 23:03. It appears that the MPC webserver machine is hanging again. Heading into office to fix problem.- Jan. 14: 00:03. Scully was rebooted.
- MPCORB datafile
2007 Jan. 7: 09:48. A large number of elements are missing from the MPCORB files on the ftp site. Examination of the overnight DOU MPEC logfile shows errors on the ftp server side: apparently the disk used to store the directory /pool (used as an intermediate storgae location while transferring and appending files) filled up. The next update of MPCORB should contain all the elements as normal.