Geocities Japan Archive
Welcome to the Geocities Japan Archive! ジオシティーズ ジャパン アーカイブへようこそ。
Just two guys that met in Uni over their love for Japanese cigarettes-- working to archive and restore Geocities Japan.
This is a recreation of the Geocities Japan based on archival efforts to revive the American version of Geocities.
Due to the use of old school, pre-unicode Shift_JIS by Japanese web makers some of the sites may be broken.
This is simply a test repository before the entire 4 terabytes of data is uploaded.
We hope to add multi-langugage functionality for pages, and fix older Japanese text encodings so that the sites can work with modern browsers.
The archive will also serve as a way for me to learn Front-end development along with various other archival methods.
Under Construction
Everything lmao.
Test hyperlinks and folder structure
Test website anchor tag.
Converting weird Japanese encodings to UTF-8 using the iconv command seems to have worked. iconv -verbose -f EUC-JP -t UTF-8 index(1).html
Now need to make a bash script or some python mane to just go through all the html files and convert them to UTF-8.
Test website in regards to Shift_JIS.
List of Things to Do Now
- 1. Figure out storage and add the 4 tbs of data to the server. ✅
- 2. Run scripts for getting rid of 301, 302 errors along with NKF conversion script on all the data. ✅
- 3. Run the GIF script and then seperate all gifs into its own folder. Begin preparations for a gif search engine.
- 4. Once data is added and scripts ran, begin seperating data into open directories on the website according to neighborhood classification. WIP
- 5. Once basically everything else is done, tweak gallery script and begin construction of galleries. Need to figure out how to run on local host files or through open directories of the files if possible.
- 6. Add neocities fork to the website, translated into japanese.
- 7. Improve SEO to help make our project available to more people.
Neighborhoods & Their Abouts
- WallStreet (ウォール街, Wōrugai): Finance and business
- Epicurean Table (エピキュリアンテーブル, Epikyurian Tēburu): Dining
- Colosseum (コロシアム, Koroshiamu): Outdoor sports and health
- SiliconValley (シリコンバレー, Shirikon Barē): Computers and the internet
- SilkRoad (シルクロード, Shiruku Rōdo): Travel
- Technopolis (テクノポリス, Tekunoporisu): Science and high technology
- Berkeley (バークレイ, Bākurei): Education and student life
- Heartland (ハートランド, Hātorando): Family and pets
- Hollywood (ハリウッド, Hariuddo): Films and performing arts
- Playtown (プレイタウン, Pureitaun): Video games
- Broadway (ブロードウェイ, Burōdowei): Pop, rock music, and concerts
- Milano (ミラノ, Mirano): Fashion, design, and shopping
- Milkyway (ミルキーウェイ, Mirukīwei): Dating
- MotorCity (モーターシティ, Mōtā Shiti): Automobiles and motorcycles