- #meta
<Back from China>
Hi, I’ve just returned from overseas and once I (hopefully) get over my jet lag this weekend, I’ll start making weekly posts again, probably starting Monday. I look forward to continuing the project. Thanks.
6/19: Listed in 2012 Danwei Model Worker Award round-up ■ 4/4: Asia Pacific Forum radio show with Rebecca MacKinnon and interview in The Next Web ■ 3/13: List of 343 words blocked on Weibo as of March 13 ■ 3/12/12: Article explaining my project in more detail: How China Gets the Internet to Censor Itself ■ 3/11/12: Words posted may no longer be blocked (they were blocked at the time of their posting or as noted in post). If you find any errors, contact me and I'd be happy to correct.
Hi, I’ve just returned from overseas and once I (hopefully) get over my jet lag this weekend, I’ll start making weekly posts again, probably starting Monday. I look forward to continuing the project. Thanks.
I’ll be in Hong Kong and Beijing during the next three months so posts will be fewer and farther between. If you happen to be in one of those cities and would like to treat me to a healthy lunch, get in touch! I’ll try to put up an occasional post, but for now, please go to China Digital Times (among others) for the latest in Weibo blocking news.
I’ll leave for the summer with this: looking through my latest set of logs, I noticed in particular two words which have been unblocked: 女同 (lesbian / nǚtóng) and Cai Fuchao (蔡赴朝), the current director of SARFT, the administrative body in charge of regulating China’s radio, film, and television content. As evidenced by the number of search results, 女同 isn’t a particularly common term, perhaps owing to the fact that it’s been blocked for so long as well as the fact that 拉拉 (lālā) is the preferred self-adopted word for a homosexual female in China. As for Cai, there’s been no major news about him recently, so who knows why they decided to unblock him again (he was momentarily unblocked back in February along with many other names before being re-blocked in March). [Status of 女同 and 蔡赴朝, data courtesy of GreatFire]
Last two, I promise:
It’s been busy these past few days traveling home for the Qingming festival. I’ll get back to the usual posts later in the week. For now, I figured I’d indulge myself and collect a few of the write-ups the site has gotten recently.
Update 3/22/12: Read this Disinformation article on my concerns with people misinterpreting this site.
To those who are new to the site, please wander over to the About section to get a better sense of what this site is tracking (or better yet read this article). I am NOT uncovering words that are blocked by the Chinese government. These are words that are voluntarily self-censored by one Internet company in China. I’m a bit dismayed at the various sites which are using the words I’ve uncovered as merely punchlines to the “How crazy is China these days?” question. China is not crazy. It’s a fascinating and interesting country with flaws like any other country.
There are usually specific reasons why a word is blocked (on this, I stress again, one site) and my goal is to provide the context for why. The generalizations that I’ve seen from people who’ve picked up this site have been scarily ignorant, and I guess I’m partly to blame for allowing my list to get so easily misinterpreted. My apologies. So as of now I’m removing my untranslated full list of blocked words (though I’ll leave up two samples which I have translated) in order to reduce the chance that someone will misinterpret my results.
Ok, on to the fun stuff. A few quick explanations for some words I’ve seen floating around:
Will return with typical blog entries in the coming days. Thanks for reading!
For those with more questions about how China censors and how I conducted this project of mine, I have an article now up on Waging Nonviolence entitled “How China Gets the Internet to Censor Itself.”
Thanks to the various blogs and Twitter folks who picked up the site over the weekend and today (including Hacker News, The Next Web, UNWIRE.HK, this Polish blog I can’t read [I hope it’s positive!], and others).
Hi all, I finished my search of 700,000 terms a couple weeks ago and I’ve started analyzing the data. Look for more regular posts in the coming weeks. In light of Twitter’s recent announcement that it may allow censorship as it intends to more aggressively expand overseas and Weibo’s much-debated push for real-name registration on the site (and the backlash), I’ll be writing up a short article on my results soon. Of interest is that a number of the bans on Weibo (like tank and Muslim, which were blocked at the time I searched them in November) no longer seem to be in effect. So please note that blocked words posted before February 2012 may no longer be blocked (they were blocked as of the time of their posting).
UPDATE 3/21/12: Because the initial Google translations provided for these words have been misinterpreted as their actual meanings, I’m taking down the full list of words I’ve uncovered thus far. If you still really want to see them, you can maybe do a Wayback search for this page. Otherwise, please read my blocked word posts wherein I’ve provided proper translations and context for why they are blocked on Weibo. The sample of words presented here are words blocked on Sina Weibo; these are not a list of words blocked by the Chinese government. The list also changes frequently, so the words posted here may have since been unblocked. This is not a list of ALL words blocked on Weibo, merely the ones I found in my searches. Please read this Disinfo article before re-using content in this post.
UPDATE 2/25/12: I finally finished searching through the 700,000 Chinese Wikipedia keywords last month and have found roughly 1000 words to be blocked, but the posting of logs and lists of banned words are temporarily on hold as I try and sort through the data and clean it up. It might take a few months before I post everything.
I’m planning on searching through a lot of words to see if they are blocked on Weibo. Ideally, going forward, I’d also be able to check the status of previously banned words to see if they’re still blocked. As this is a rather large undertaking (think SETI :), if you’d like to help and are fairly handy around a computer* (or simply good at following directions), let me know and I’ll send you a chunk of words along with instructions on how to check them. It does require a modicum of effort, but hopefully, as I refine my script, it’ll be mostly hands-free.
If you read about or discover a banned word, or if you want to write up a short summary of one of already found blocked words for me to post, feel free to submit it to me. Alternatively, you can message me on Twitter @jasonqng.
*You’ll need to be able to install a program on your computer, copy and paste files, type some words in a command line, and then send me the finished file when it’s all done. The rest is mostly automated (you can let it run all night while you sleep), but you may need need to intervene at various points. Don’t worry, it’s a lot less complicated than it sounds. Hopefully, as I play around with this some more, the majority of these tasks will become automated.