Pinyin Named Entity Recognition
You can train a character level sequence tagger (such as BiLSTM) to extract Chinese names from the sequence. And you need to make some difficult cases (such as some words look similar with names) for the model. You can easily find a lot of Chinese names from here and then use some Hanzi2Pinyin Tools (such as python-pinyin) to convert Chinese names into their pinyin form.