17 Commits

Author SHA1 Message Date
4045473e65 Gzip forums.tar to save space and time. Remove verbose output to save time. 2020-08-16 18:51:29 -05:00
161dd19d36 Increas spacing between board elements & increase padding of sub-boards 2020-08-16 18:50:59 -05:00
f709ce4810 split archiving wiki and forum into separate commands, only run the forum one by default for now 2020-08-16 04:54:27 -05:00
bf4a5f2b5d Order categories and boards correctly; correctly organize them by nesting level 2020-08-16 04:28:07 -05:00
dc0191a04a Rudimentary support for building wiki archives. The content is dumped to html but the wikitext isn't parsed yet.
mwparserfromhell is used for parsing wikitext but it has no support for rendering to HTML so we'll have to build it manually.
2020-08-11 10:44:06 -05:00
3cb08e2d2f Add bbcode library and some bbcode parsing. SMF's bbcode is somewhat different to what this library expects so some of it does not come out correctly. 2020-08-10 02:00:00 -05:00
0116646dd9 Add script to deploy the generated html archive to a server. 2020-08-10 01:59:48 -05:00
f3b0319d5a Add rudimentary script for generating the forum archive. 2020-08-10 01:21:33 -05:00
7026bc2b34 Add mysql2sqlite submodule and script for making a sqlite database out of the mysql dumps. This will be used to create html archives of the forum. 2020-08-09 04:01:19 -05:00
51e9782c7e refine email regex to not include whitespace 2020-08-04 10:10:48 -05:00
73dae54cf1 Filter out sensitive information (emails, IP addressses) from messages table. 2020-08-04 02:29:42 -05:00
eac65f6bda Add dump_forum_data script: This connects to a GCL database and creates SQL dumps that can be distributed publicly.
Specifically, it creates these scripts:
- structure.sql: Database structure for all tables.
- misc_data.sql: Misc. data that may be of interest, including legacy GCLF features such as the shop and the arcade, and The Orange Glove content.
- categories.sql: Category data for public categories.
- boards.sql: Board data for public boards.
- threads.sql: Thread (topic) and message data. This is the meat of the forums.

What's NOT included:
- Personal/Private messages
- Member accounts
- Administrative stuff like error logs
- Private categories and boards
- Garbage category (Lab Omega)
2020-08-03 05:02:43 -05:00
d4002ec592 Add additional categories for gens 1 - 4. 2020-07-30 04:16:26 -05:00
191e1ebd13 Add find_pages script to determine through the MediaWiki API which pages need to be looked at. 2020-07-30 03:46:02 -05:00
1a2dbbe65b Fix up PM export script 2020-07-27 02:57:19 -05:00
6716f274db Initial stab at PM export script. It just prints out all PMs to/from current logged-in user. 2020-07-27 01:48:59 -05:00
90887491c7 add README 2020-07-26 22:36:12 -05:00