Boy, I haven't had that bad a day as a developer in a LONG time...
Category Software Development
We moved a new Notes app onto the server Friday, and we opened it up for the registration function this morning at around 7 am. Within about 10 minutes, the CPU of the server was pegged at 100%. Not good...
We modified a keyword document to close off the registration, dropped the users from the applications, and everything went back to normal (10% - 20% utilization).
Now, I would be OK with this if I could pinpoint a reason WHY this new app was going crazy. Functionality-wise, it was working fine in development. Obviously, I hadn't tested it under load of a large number of users starting to register. I streamlined a few lookups, changed a few graphics to be image resources, dropped a couple of views that had some level of calculation in a column, and dropped a field (that I didn't need) that was being built as a text list upon saving a document.
We'll try again tomorrow morning with the new changes, as well as with the server and Notes admins monitoring things when we open it back up again.
I'd understand it if I had done some Java agent that was not releasing memory, or if I had agents stuck in a loop. But there is no Java code in there, and there's only one agent scheduled to run once an hour (and I can tell in OpenLog it ran all of one second during our crisis). I'm wondering if the @PickList function used to pick people for your "team" was having fits since the view of eligible people was being added to (new registrations) and deleted from (people chosen for a team) heavily during that initial registration.
Sigh... I don't like being incompetent...
We moved a new Notes app onto the server Friday, and we opened it up for the registration function this morning at around 7 am. Within about 10 minutes, the CPU of the server was pegged at 100%. Not good...
We modified a keyword document to close off the registration, dropped the users from the applications, and everything went back to normal (10% - 20% utilization).
Now, I would be OK with this if I could pinpoint a reason WHY this new app was going crazy. Functionality-wise, it was working fine in development. Obviously, I hadn't tested it under load of a large number of users starting to register. I streamlined a few lookups, changed a few graphics to be image resources, dropped a couple of views that had some level of calculation in a column, and dropped a field (that I didn't need) that was being built as a text list upon saving a document.
We'll try again tomorrow morning with the new changes, as well as with the server and Notes admins monitoring things when we open it back up again.
I'd understand it if I had done some Java agent that was not releasing memory, or if I had agents stuck in a loop. But there is no Java code in there, and there's only one agent scheduled to run once an hour (and I can tell in OpenLog it ran all of one second during our crisis). I'm wondering if the @PickList function used to pick people for your "team" was having fits since the view of eligible people was being added to (new registrations) and deleted from (people chosen for a team) heavily during that initial registration.
Sigh... I don't like being incompetent...





Comments
Posted by Rob McDonagh At 19:20:02 On 05/05/2008 | - Website - |
Posted by Duffbert At 19:31:06 On 05/05/2008 | - Website - |
Posted by Rob McDonagh At 20:30:20 On 05/05/2008 | - Website - |
Posted by Nathan T. Freeman At 04:01:41 On 06/05/2008 | - Website - |
@4... That's not the problem. It's not "registration" as in directory stuff. It's "registration" as in "set a field in a document that says you're now part of this event, and create a logsheet for them."
Posted by Duffbert At 04:20:17 On 06/05/2008 | - Website - |
I had one yesterday where a developer added the refresh property to the form. The users picked up on it immediatly even thought the delay was not real significant. Performance issues are tough to troubleshoot sometimes.
Posted by Curt Stone At 06:52:06 On 06/05/2008 | - Website - |
Posted by Duffbert At 10:01:29 On 06/05/2008 | - Website - |
Posted by Sean Burgess At 11:55:02 On 06/05/2008 | - Website - |
Prob checked for this already but it has bitten me on the arse a couple of times.. an un-intentional LS recursive call, or a recursion that can't find a way out.
Steve
Posted by Steve McDonagh At 14:17:55 On 06/05/2008 | - Website - |