Other projects.

Message boards : Cafe Rosetta : Other projects.

To post messages, you must log in.

AuthorMessage
Bill Swisher
Avatar

Send message
Joined: 10 Jun 13
Posts: 80
Credit: 61,510,726
RAC: 3,340
Message 113049 - Posted: 31 Aug 2025, 19:33:27 UTC

I might as well ask here...

Anybody got an idea about what's going on over at WCG? Seems to be TU.
ID: 113049 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 28 May 06
Posts: 102
Credit: 289,080
RAC: 8
Message 113050 - Posted: 31 Aug 2025, 19:52:26 UTC - in response to Message 113049.  

I might as well ask here...

Anybody got an idea about what's going on over at WCG? Seems to be TU.


Follow along in the WCG FORUM @ BOINC message boards starting with Message 116748
Grumpy Swede wrote:
New update: https://www.cs.toronto.edu/~juris/jlab/wcg.html (click operational status heading) - August 29.
Also pushed to the BOINC client.

August 29, 2025
Full migration of WCG from the Graham to Nibi cloud facilities will be completed between 3:00-5:00 p.m. on August 31st, 2025
Sharcnet will then power down all hardware at Graham.
We have put in a ticket with UHN Digital to move our DNS records to the new IP addresses we have been allocated in Nibi cloud, and all storage, networking, and compute resources are already provisioned at Nibi.
We continue testing QA and Prod on the new infrastructure.
We will experience some downtime as *.worldcommunitygrid.org URLs switch over. We will be bringing down workunit creation scripting, BOINC server components, and upload/download servers in sequence, halting the database, performing a final rsync and then bringing down the website, forums, and internal services over the next 48h.
In the best case, our DNS records will be switched over on the 31st and everything behind the load balancer will be up and running. However, we want to prepare users for the possibility of additional downtime as we stand up prod on Nibi.(
ID: 113050 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher
Avatar

Send message
Joined: 10 Jun 13
Posts: 80
Credit: 61,510,726
RAC: 3,340
Message 113051 - Posted: 31 Aug 2025, 21:16:44 UTC - in response to Message 113050.  

Thanks.

They seem to be overachievers. :-) The site disappeared sometime around Friday. I got lots of jobs stacked up waiting to upload.
ID: 113051 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher
Avatar

Send message
Joined: 10 Jun 13
Posts: 80
Credit: 61,510,726
RAC: 3,340
Message 113058 - Posted: 6 Sep 2025, 1:13:58 UTC - in response to Message 113050.  

[Follow along in the WCG FORUM @ BOINC message boards starting with...


Thanks for that link, I've been following it. I can only say the "96 hours" added has come and gone. They might consider adding at least another 96 hours, maybe 192. :-)
ID: 113058 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2451
Credit: 46,464,996
RAC: 3,971
Message 113059 - Posted: 8 Sep 2025, 22:47:22 UTC

Another update today.
Sounds positive - just taking a little longer than I'd like
September 8, 2025
Over the weekend we were able to restore the DB2 databases for the website and forums.
It was a redirected restore that first required a fully containerized instance of DB2 running the same OS as we were in Graham cloud, and we ran into issues attempting the restore of the final backups. Both databases are now successfully restored, and we have moved on to containerizing Websphere and IBM MQ.
We were able to restore the BOINC database.
As part of our work on MAM1 we developed an integration testing environment and containerized the BOINC database.
We also did this for the BOINC server components (scheduler, upload and download servers with file_upload_handler, transitioner/validators/assimilators etc.).
Once we get IBM MQ and Websphere up, we will be able to bring the entire system online shortly afterwards.

ID: 113059 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2451
Credit: 46,464,996
RAC: 3,971
Message 113061 - Posted: 10 Sep 2025, 1:23:55 UTC - in response to Message 113059.  

Another update today.
Sounds positive - just taking a little longer than I'd like
September 8, 2025
Over the weekend we were able to restore the DB2 databases for the website and forums.
It was a redirected restore that first required a fully containerized instance of DB2 running the same OS as we were in Graham cloud, and we ran into issues attempting the restore of the final backups. Both databases are now successfully restored, and we have moved on to containerizing Websphere and IBM MQ.
We were able to restore the BOINC database.
As part of our work on MAM1 we developed an integration testing environment and containerized the BOINC database.
We also did this for the BOINC server components (scheduler, upload and download servers with file_upload_handler, transitioner/validators/assimilators etc.).
Once we get IBM MQ and Websphere up, we will be able to bring the entire system online shortly afterwards.

And another - tantalisingly close
September 9, 2025
We are finalizing IBM MQ <-> DB2 <-> BOINC db <-> website axis, which will allow us to bring up the website. If all goes to plan now - we should have the website up tonight.
Once that is solved - we will go through the BOINC stack to ensure nothing catastrophic will happen when once we let traffic through to the scheduler, upload/download servers. Then we can finally start letting the BOINC daemons manipulate state in the BOINC db.

ID: 113061 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Cafe Rosetta : Other projects.



©2025 University of Washington
https://www.bakerlab.org