HomeForumsWhat's newResources 
 
 
Help Wanted: Delete 1 Language, Keep the Rest from a Text Dump
SoloNerfherder - Apr 2, 2025
 SoloNerfherder Apr 2, 2025
Hi!
First time posting here, so please don't pelt me with rotten tomatoes if this is the wrong forum or I made some other faux pas!
I'm working on localising Terra Phantastica with Malenko.

I have a text dump of the game kindly provided by Conan (not Derek, sorry, I guess I'm just enamored with Derek ).
But, it's basically 90% control code gobbledegook (that's the technical term for program code, right?) and 10% Japanese.
I just want the Japanese so I don't have to sift through the codes when I want to go to the next line of JP when translating.

Is there an easy way that's not mega technical of deleting all the codes so I have just the JP left?

I've been Googling for hours looking for search and replace tricks in Word or something, but couldn't find anything that works.
Saw some python scripts and stuff too, but I have no idea what to do with those

Maybe there's some experienced Saturn localiser who's been in a similar sitch who could help?

 nando Apr 2, 2025
it would be pretty easy to strip the text out with a regex capture. but don't you kind of need the identifiers and other information to put it back into the game? otherwise you just have random text strings? I could write something in 10 minutes to do it, but I don't understand what you're trying to accomplish.

 paul_met Apr 2, 2025
The Japanese text is between the and tags. You can simply jump to the next pair of tags using search.

 Malenko Apr 2, 2025

nando said:

He's just translating the text, I'm going to worry about adjusting control codes to get the text to fit. There are control codes that print strings from memory, like the player name, but everything is encapsulated by the text decode/end text decode controls.

Also that dump is from @Mr Conan... but @derek (ateam)... also had a more manual dumping tool that was a big help in getting started with this. Continuing thanks to the both of them for all their help.

 SoloNerfherder Apr 2, 2025

nando said:

Don't know about other people who do this a lot, this is only my 2nd time doing a full game fan translation after Bomberman Fight, and I didn't need to deal with codes for that, praise Malenko.
I do this for a living, and we never have to deal with codes like this (just the odd HTML tag or variables in the text, which is easy enough).

The idea is, I want the text just laid out in a column I can plop into Excel so I can see easily how one string might relate to another for general understanding/context purposes.

I've tried feeding the dump files as-is into OmegaT (software I use in order to keep translation memory and terminology organised), and I can translate that way, but the codes are still there, so it's still a PITA to sift through that stuff.

Also, I don't know what a regex edit is...
Total non-code guy here

 SoloNerfherder Apr 2, 2025

Malenko said:

Apologies for the name mix up, I totally brain farted there!
Thanks indeed to both Conan and Derek!

 Mr Conan Apr 3, 2025
I can just adjust the text dumper to output an additional csv file with jp text only that you could import into excel. Does that work

 SoloNerfherder Apr 3, 2025

Mr Conan said:

@Mr Conan...
Yes! That sounds perfect!
Thanks!

 Mr Conan Apr 5, 2025

SoloNerfherder said:

Let me know if this looks good. I can adjust it more to make it easier.

 SoloNerfherder Apr 6, 2025

Mr Conan said:

Thanks, this looks great!
Could you pass me the rest of the files on discord either to me directly (SoloNerfherder) or via Malenko?