50 Commits

Author SHA1 Message Date
eff22ff325 Bolden names of successor websites. 2020-09-20 05:48:15 -05:00
6ab90d5fff Add link to GCW on the archive index page. 2020-09-20 05:47:35 -05:00
d98354046b Prefix image paths with images subdirectory (which does not yet exist) 2020-09-20 05:45:49 -05:00
f1ae73b737 Correctly parse out fragment from html archive link and append it in the correct place. 2020-09-20 05:40:57 -05:00
d062ca6787 Add anchors to headings. 2020-09-20 05:31:55 -05:00
a9adf51453 Add GCW link to archived wiki pages, for the live version of said archived article. 2020-09-20 05:28:37 -05:00
7157757d43 redirector: handle case where thread id can have an .html extension, because of course it can... 2020-09-03 04:12:54 -05:00
5859ee0408 Add notice to the top of archived pages that this is an archive. 2020-09-03 03:34:22 -05:00
b33ea016d5 convert all f-strings to old-style format strings for python 3.5 compatibility (since that is what's available on the server) 2020-09-01 02:46:37 -05:00
ab29250b74 add "added submissions" boards from Submit-A-Glitch to exclusion filter 2020-09-01 02:25:32 -05:00
167a03be3c fix syntax error 2020-09-01 01:31:39 -05:00
f65361e06b Implement forums redirector and default archive urls (since it's unlikely these will be changed). Also add redirectors for the index/main pages since those don't get handled by default. 2020-08-30 16:51:55 -05:00
c37cf4fc44 Implement thread index for mapping thread ids back to board ids, for use with the redirector.
The archive domain (archives.glitchcity.info) will host this file and the redirector will pull and unpack it when it starts up.
2020-08-30 16:50:21 -05:00
ef3f3dd60c Add pagination to forums archives. 2020-08-28 14:19:09 -05:00
0e3f1274cc Exclude links and submit-a-glitch archives from the archive. 2020-08-28 02:29:30 -05:00
1b7e3ce08b Since the wiki linker code is modularized the redirector can just import it 2020-08-27 02:24:24 -05:00
646b840be4 Use '+' as the substitution for '/' rather than '%2F' since nginx seems to want to normalize the %2F back into / which would defeat the purpose. 2020-08-27 02:11:56 -05:00
a382e6d4fd Extract more archive generator specific functionality from linker (e.g. the .html suffix, / -> %2F replacement) 2020-08-27 02:02:43 -05:00
ade44491d4 Extract archive generator specific linker functionality into ArchiveLinker subclass. 2020-08-27 01:52:17 -05:00
2e73ecd59f Begin work on redirector webapp. The logic for redirecting wiki pages -seems- consistent with how the archives are generated but... nginx is normalizing the %2Fs into slashes and therefore not able to access any files with that escape sequence (e.g. Gold%2FSilver). Might need to find another character to escape / with.
+ might work, it's semantically acceptable (e.g. "Gold+Silver" in place of "Gold/Silver") although this character is sometimes interpreted equivalently to a space. Regardless, nginx seems to be happy with it so might go with it.

May also need to test on a web host e.g. my old tripod account to see if assumptions hold up there too.
2020-08-24 00:24:45 -05:00
2aa1a7cf47 improved handling of categories and refactored Linker stuff into a class, also use ./ as default base path so Category: links will work correctly 2020-08-23 06:40:23 -05:00
77b160a35b Add all dex types and make wiki render aware of them, so that dex subpages will be rendered as GlitchDex/RB:000 while phrases like G/S will be rendered as G%2FS.
This unfortunately means the redirector will need to have special logic to be aware of dexes too, but it could probably be as simple as "if directory name contains 'Dex' then treat it as a directory name"
2020-08-23 06:02:28 -05:00
ea7e1294b1 Escape / with %2F in wiki URLs. For now this includes things which actually should be subdirectories, such as the various dexes. 2020-08-23 05:01:11 -05:00
df25b09eb7 improvements to template substitution, begin parsing out and dumping category links 2020-08-23 04:48:38 -05:00
43a36ba730 (Attempt to) strip out noinclude tags. 2020-08-21 02:52:43 -05:00
d19ca39838 Add a (very rudimentary) template transcluder. Some work will have to be done before it is ready. 2020-08-21 02:39:50 -05:00
38cdb811b0 Specify that charset is UTF-8. On the server side we are specifying the charset as UTF-8 so the encoding is already correct on the archives.glitchcity.info site, but the intent is for this to be portable to any web host. 2020-08-17 01:57:27 -05:00
54fa852897 Implement rudimentary wikitext renderer and buff deploy_archives script to be able to deploy the wiki archive, as well as sql/sqlite/xml data dumps. 2020-08-17 01:51:09 -05:00
0b1320a9da Add archives index page, and deploy wiki archives. 2020-08-16 21:11:59 -05:00
6f05f05410 Re-enable archival of wiki by default. 2020-08-16 18:53:10 -05:00
31bdd806ad Allow "forums" as a synonym of "forum" 2020-08-16 18:52:45 -05:00
943563a8db Use new field names. 2020-08-16 18:52:32 -05:00
23f4789599 Fix broken utf-8 encoding and unescape html entities. 2020-08-16 18:52:05 -05:00
4045473e65 Gzip forums.tar to save space and time. Remove verbose output to save time. 2020-08-16 18:51:29 -05:00
161dd19d36 Increas spacing between board elements & increase padding of sub-boards 2020-08-16 18:50:59 -05:00
f709ce4810 split archiving wiki and forum into separate commands, only run the forum one by default for now 2020-08-16 04:54:27 -05:00
bf4a5f2b5d Order categories and boards correctly; correctly organize them by nesting level 2020-08-16 04:28:07 -05:00
dc0191a04a Rudimentary support for building wiki archives. The content is dumped to html but the wikitext isn't parsed yet.
mwparserfromhell is used for parsing wikitext but it has no support for rendering to HTML so we'll have to build it manually.
2020-08-11 10:44:06 -05:00
3cb08e2d2f Add bbcode library and some bbcode parsing. SMF's bbcode is somewhat different to what this library expects so some of it does not come out correctly. 2020-08-10 02:00:00 -05:00
0116646dd9 Add script to deploy the generated html archive to a server. 2020-08-10 01:59:48 -05:00
f3b0319d5a Add rudimentary script for generating the forum archive. 2020-08-10 01:21:33 -05:00
7026bc2b34 Add mysql2sqlite submodule and script for making a sqlite database out of the mysql dumps. This will be used to create html archives of the forum. 2020-08-09 04:01:19 -05:00
51e9782c7e refine email regex to not include whitespace 2020-08-04 10:10:48 -05:00
73dae54cf1 Filter out sensitive information (emails, IP addressses) from messages table. 2020-08-04 02:29:42 -05:00
eac65f6bda Add dump_forum_data script: This connects to a GCL database and creates SQL dumps that can be distributed publicly.
Specifically, it creates these scripts:
- structure.sql: Database structure for all tables.
- misc_data.sql: Misc. data that may be of interest, including legacy GCLF features such as the shop and the arcade, and The Orange Glove content.
- categories.sql: Category data for public categories.
- boards.sql: Board data for public boards.
- threads.sql: Thread (topic) and message data. This is the meat of the forums.

What's NOT included:
- Personal/Private messages
- Member accounts
- Administrative stuff like error logs
- Private categories and boards
- Garbage category (Lab Omega)
2020-08-03 05:02:43 -05:00
d4002ec592 Add additional categories for gens 1 - 4. 2020-07-30 04:16:26 -05:00
191e1ebd13 Add find_pages script to determine through the MediaWiki API which pages need to be looked at. 2020-07-30 03:46:02 -05:00
1a2dbbe65b Fix up PM export script 2020-07-27 02:57:19 -05:00
6716f274db Initial stab at PM export script. It just prints out all PMs to/from current logged-in user. 2020-07-27 01:48:59 -05:00
90887491c7 add README 2020-07-26 22:36:12 -05:00