Hello everyone, I thought it might be nice to make a nice reference for people who don't know how to extract kanji from a site like Uta-Net. I will make this guide with reference to uta-net, but similar websites will work reasonably similarly.
Step 1: Find your song
Simple sounding at first, admittedly, but Japanese lyrics websites are a bit more specific right off the bat than English sites, in that you have to specify if your query is for a song or for an artist. It usually defaults to the artist name (歌手名), and if you want to search for the song you will have to switch it in the menu to song name (曲名)
From here, you can find the song. Navigate to the song's lyrics page.
Step 2: Extracting the HTML data
OK admittedly this is kinda dubious on a copyright front just taking it from the site, but at the end of the day, even if you copy the lyrics character for character by eye it'll give you the same result.
First, open up the inspector by using the command CTRL-SHIFT-C (works with both Chrome and Firefox; I will use Firefox for this tutorial-- Chrome is a bit trickier overall but it's mostly the same process and I'll point out when things vary). Now hover over the lyrics and click. This is what it should look like:
Note the highlighted section in the inspector window at the bottom (your inspector window may be to the right of the screen instead).
Now click on the little triangle next to something that'll look like:
Code:
<div itemprop="lyrics" itemscope="" itemtype="http://schema.org/CreativeWork">
And you will find the following element inside:
Code:
<div id="kashi_area" itemprop="text">
Right click on the text in this element and select the following option if you are in Firefox:
Or the following one if you are in Chrome:
You have now copied the HTML code into your clipboard and can move to processing it. I recommend you use MS Word for this. Paste it into a blank MS Word document
Step 2.5: If you're a Chrome user, read this
When you copy from Chrome, it leaves behind the outer tags you need to remove. Remove the highlighted section from your page:
If you're a Firefox user, this step doesn't apply, as the highlighted section will not be present on your copied HTML data.
Step 3: Cleaning up the output
You can see in the resulting data that wherever there's a line break, it's been replaced by a <br> and so you're going to have to replace every instance of <br> with a new line.
The reason I say use MS Word for this is because you can just use the Replace function (CTRL-H) to replace all instances of <br> with ^p (line break) as follows:
Once you've hit replace all, you will be greeted with a nicely formatted copy of the kanji lyrics that you can post or whatever.
Bookmarks