I decided to run a re-test of my initial list of blocked words this morning. Below you’ll find 378* keywords that are blocked as of March 13, 2012. (Note: these are words blocked on Sina Weibo; this is not a list of words blocked by the Chinese government. Please read this Disinfo article before re-using content in this post.)
Of the 1300 mostly unique words I found to be unsearchable in my initial test in Nov/Dec 2011, 933 were subsequently unblocked some time in late-January to early-February 2012. But apparently, that was an overreach and as of this morning, 393 of those 933 have been re-blocked (words which include 五毛 [Fifty Cent Party], 轮奸 [gang rape/gangbang], and 梯恩梯 [TNT], among others). I want to double-check and confirm that some of the longer length words are indeed unique (that is, verifying what the root words are that cause them to be blocked), so in this list you’ll only find words that are four characters and less (though I noticed after the fact there are some non unique words; for instance there are a few with 八八). I added in a few longer English words that I thought were of note along with some others from another final Wikipedia list that I generated, giving us the above 378 words that are blocked as of this morning. Please note, these are terms that when you try to search for on Weibo, you receive an error message. As far as I know, you are free to post these words in a message. (Of course, there is the potential for censoring after the fact…**)
For more about this project and how the Chinese government persuades Internet companies to self-censor, you can read my article up at Waging Nonviolence.
*Update: Forgot a few numbers like 64, 八八, and 1989. I’ve appended them to the bottom, but also removed a number of non-unique words I spotted after the fact (I left a few of the more interesting ones in) so this list now comprises 343 words. This list is a filtered subset of the total 1,574 words I uncovered to be blocked in early-2012, which you can view/download.