selberg.org Home Home

Archive for April, 2007
4/30/07
12:20 am
Davies sentencing delayed until June 15th

According to the Redditch Advertiser, the Lilly case has been adjourned until June 15:

Lilly case adjourned

THE sentencing of a woman in connection with the case of Baby Lilly, the infant whose body was found by the banks of the River Alne, has been adjourned.

Rachel Davies, 26, of Wharrage Road, Alcester has pleaded guilty to concealing the birth of a child she gave birth to by secretly disposing of the child’s dead body.

She was due to be sentenced on Friday at Warwick Crown Court.


The hearing is now expected to take place on Friday, June 15.

I previously discussed the guilty plea and arrest. There are a fair number of comments from people in the area on the arrest post that present some differing viewpoints, and I encourage you to read them.

4/23/07
11:55 pm
Closer on Virginia Tech…

This editorial from the WSJ says some good things about what to do to prevent another Virginia Tech massacre (h/t John Cole):

Diagnosis from afar is the purview of talk-shows hosts and other charlatans, and I will not attempt to detail the psyche of the Virginia Tech slaughterer. But I will hazard that much of what has been reported about his pre-massacre behavior–prolonged periods of asocial mutism and withdrawal, irrational anger and hatred, bizarre writing and speech–is not at odds with the picture of a fulminating, serious mental disease. And his age falls squarely within the most common period when psychosis blossoms.

No one who knew him seems surprised by what he did. On the contrary, dorm chatter characterized him explicitly as a future school-shooter. One of his professors, the poet Nikki Giovanni, saw him as a disruptive bully and kicked him out of her class. Other teachers viewed him as disturbed and referred him for the ubiquitous “counseling”–an outcome that is ambiguous to the point of meaninglessness and akin to “treatment” for a patient with metastasized cancer.

But even that minimal care wasn’t given. The shooter didn’t want it and no one tried to force him to get it. While it’s been reported that he was involuntarily committed to a “Behavioral Health Center” in December 2005, those reports also say he was released the very next morning. Even if the will to segregate an obvious menace had been in place, the legal mechanisms to provide even temporary “warehousing” were absent. The rest is terrible history.

That is not to say that anyone who pens violence-laden poetry or lets slip the occasional hostile remark should be protectively incarcerated. But when the level of threat rises to college freshmen and faculty prophesying accurately, perhaps we should err on the side of public safety rather than protect individual liberty at all costs.

If the Virginia Tech shooter had been locked up for careful observation in a humane mental hospital, the worst-case scenario would’ve been a minor league civil liberties goof: an unpleasant semester break for an odd and hostile young misanthrope who might’ve even have learned to be more polite. Yes, it’s possible confinement would’ve been futile or even stoked his rage. But a third outcome is also possible: Simply getting a patient through a crisis point can prevent disaster, as happens with suicidal people restrained from self-destruction who lose their enthusiasm for repeat performances.

This is good. Ever since the 70s, there’s been a clear relationship between the number of mentally ill people and the homeless, which in large part is due to closing down state hospitals. At my daughter’s preschool, we pulled up the carpets in the entry this weekend because it smelled like urine. The reason it smells is that there’s a transient who lives in the area who, among other issues, has bladder and bowel control issues, and will walk into the entryway during the day and soil himself. We’re getting the funding for a keypad for the exterior, but the reality is that this guy doesn’t belong wandering the streets near UW. I doubt he’s a threat, but I’d like to think that our society could at least make him comfortable and put him out of harm’s way.

The article then goes south in a hurry:

The best predictor of future violent behavior is past violent behavior, yet we regularly grant parole to murderers, serial rapists, chronically assaultive individuals and habitual pedophiles. Even when we do attempt to segregate low-impulse multiple offenders with effective tools such as with three-strikes laws, liberationist clamor never ceases.

At some point, if an inmate has done his or her time, then we let them go. And for parole, there are a number of criteria that must be met before someone is paroled. Does it always work? No. I read? heard? somewhere that a third of felons end up committing another crime. But then again, if that’s true, this means two-thirds don’t. If the author wants to argue that the sentences are too short, that’s one thing. But in any society governed by laws, once a punishment is meted out, society has to live with that punishment, even if after the fact people want more.

Talk to anyone who’s tried to commit a dangerously violent child or parent for even a few days: A stranger with a law degree will show up at the hearing and paint you as a fascist. So it’s far too much to expect anything resembling a decisive approach to those whose level of threat remains at the verbal level.

Given the excesses of the past–husbands committing troublesome wives, involuntary sterilization of those judged defective–extreme caution is warranted. But like drunk drivers, we sway from one side of the legal road to the other and find the sensible center lane elusive.

The problem here isn’t that it’s too hard to commit someone, or that there’s too much abuse when it’s too easy to commit someone. It’s that as a society, we don’t care that much about mental health. We are chronically building more jails instead of closing them, meaning we’re putting more people away. However, we aren’t doing nearly enough to help people that need it. And that’s what needs to be fixed.

4/17/07
12:00 am
Sonics stadium falls down, goes boom

Apparently, the Sonics’ stadium is dead. As a longtime listener of KJR Sports Talk Radio, 950 on YOUR AM dial, I’ve been following the latest Seattle stadium initiative with some interest. For the most part, there are two arguments, and they’re the same arguments that were made about a decade ago when the decision was made to level the Kingdome and build Safeco Field and Qwest Field for the Mariners and Seahawks.

Pro public funding of a stadium: Professional sports teams generate revenue, and thus the city / county / state should invest in a stadium.

Against public funding of a stadium: The public should not give millions to billionaires. If a stadium is such a good investment, then private financing should be readily available.

The argument against is stronger, in my opinion. As near as I can see, stadiums don’t by their nature generate a positive cash flow to a locality. Sure, there’s an influx of people to the stadium and a resulting tax dollar increase. OK, let’s do some quick math for a basketball team. 82 games per year, 41 home games. Say an average of 15,000 people attend each game (Key Arena holds a bit over 17,000). Say the average spend per person is $100 total, and we’ll say the tax is 10%, so $10 per person. So 41 * 15,000 * $10 = $6,150,000 per year. So, about $6 million per year. Say I’m off by a factor of 3… OK, that’s $18,450,000 per year. Call it $20 million. King County’s 2007 proposed budget is $507 million, so that’s about 4% of the yearly budget, being very generous. Less generous, and it’s 1-2%. However, when the initial outlay is $300 million or so (or perhaps just $250… the news reports $150 from King County and $100 from Renton), it doesn’t seem like a great investment.

Personally, I think the pro argument is the wrong one. Governments spend money on public buildings for professionals all the time — opera houses, symphony halls, theaters, and such, not to mention college stadiums where everyone makes money except the athletes. For the most part, buildings like symphony halls aren’t great investments. Symphonies and the like don’t make a lot of money. However, I find that they’re vital as far as the culture they bring to the community. Performing arts are critical to a community’s culture. And just like arts, sports are also critical to a community’s culture. Yes, there is a ton of money involved. Pop culture can do that. But take it away, and the community loses something.

That being said, it seems to me that the entire Sonics debacle is a crisis brought on by short-term greed. It seems the way to make money as a professional sports team owner is to buy, run the team for a number of years, and sell at a huge profit. Many owners can profit while they own the team, but some can’t, or don’t care to. Such is life. Cities and counties can also pay for stadiums, but they can’t necessarily pay for it on any given year. Sometimes a region feels relatively generous and can pay for things, other times it will feel pinched. Like if major bridges and freeways are in need of repair, and two other stadiums have been recently built. Now, the Sonics have been in Seattle for over 40 years, and have a huge fan base, so at some point the city or county will be up for building a stadium. But not now. So, a rational plan is to wait and keep pushing, and in a few years, something will go through.

Instead, a deal has to get done now, otherwise, the new owners will move ‘em out. It’s classic blackmail. Sadly, at this point the city / county / state isn’t in the mood to cave. Again, there are times when investing in culture makes sense, and other times when it doesn’t. Now, it doesn’t.

So, looks like the Sonics are moving to Oklahoma. Bah.

4/16/07
1:35 am
The joy of taxes

Well, it’s that time of year again, when the 300 million or so Americans file their federal, and in most cases state and local income tax returns. This year, like the past few years, I’ve used TaxCut, mostly because I’m getting pretty good at weird cap gains issues and because TurboTax annoyed me a few years ago with their overly aggressive product activation thing. What I’ve observed over the years is that even though I’m using the same program that loads in data from the previous return, I seem to be rooting through my yearly files more for random documents than before. Sure, there are the receipts for charities, and in WA state sales tax (but hey, unless it’s a big-ticket item, just go with the standard deduction). But I gotta keep track of the WA state tab cards that state how much tax I pay, and various transaction details and such. But the big takeaway is that I’m realizing that it’s becoming much tougher to not have software to fill out a modest 1040. I’m not talking about the simple math. Instead, it’s the ton of speculative calculations and mini-worksheets that need to be filled out.

At times like this, I often wonder about the difference between the “progressive” income tax and a “regressive” sales tax. At least in the US, income tax is used both to collect revenue in a way as people can afford (e.g. rich pay more than poor), as well as to encourage behavior via credits and deductions. For example, if both parents work, then some child care expenses (daycare) can be deducted. But if one spouse doesn’t work, then no deduction. The idea clearly is to encourage spouses, in particular mothers, to either stay home with the kids or work. However, putting the kid in daycare or a preschool environment without working (such as you might do when one kid is getting older and needs to socialize with other kids, and there’s a newborn that needs lots of mom time)… well, that’s just to be discouraged.

In Washington state, every now and then people grumble that we have a very regressive sales tax and no income tax, instead of what Oregon has, which is a high income tax and no sales tax. However, what a lot of people fail to recognize is that approximately 1/3 of purchases are made by businesses, thus businesses pay a third of the sales tax burden. Meanwhile, with an income tax, there’s all sorts of ways to avoid it. So, even though it normally runs against my normal instincts, I sometimes do give pause to wonder about how the economy of Washington seems to be doing much better than Oregon, and whether it makes sense to try and simplify things a bit. But that’s just crazy talk!

4/13/07
2:25 pm
AIRWeb 2007 Accepted Papers out

AIRWeb ‘07, to be held on May 8th in Banff as part of WWW 2007, published the list of accepted papers today. We’ve got 10 great full papers and three short papers. Hopefully, links to the actual papers will be coming soon, but until then, be sure to make those reservations for Banff!

Full papers

  • Kumar Chellapilla and Alexey Maykov: A Taxonomy of JavaScript Redirection Spam.
  • Debora Donato, Mario Paniccia, Maddalena Selis, Carlos Castillo, Giovanni Cortese and Stefano Leonardi: New Metrics for Reputation Management in P2P Networks.
  • Ye Du, Yaoyun Shi and Xin Zhao: Using Spam Farm to Boost PageRank.
  • Georgia Koutrika, Frans Effendi, Zoltán Gyöngyi, Paul Heymann and Hector García-Molina: Combating Spam in Tagging Systems.
  • Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura and Belle Tseng: Splog Detection Using Self-similarity Analysis on Blog Temporal Dynamics.
  • Xiaoguang Qi, Lan Nie and Brian Davison: Measuring Similarity to Detect Qualified Links.
  • Krysta Svore, Qiang Wu, Chris Burges and Aaswath Raman: Improving Web Spam Classification using Rank-time Features.
  • Baoning Wu and Kumar Chellapilla: Extracting Link Spam using Biased Random Walks from Spam Seed Sets.
  • Josiane Xavier Parreira, Debora Donato, Carlos Castillo and Gerhard Weikum: Computing Trusted Authority Scores in Peer-to-Peer Web Search Networks.
  • Dengyong Zhou, Christopher Burges and Tao Tao: Transductive Link Spam Detection.

Short papers

  • András A. Benczúr, István Bíró, Károly Csalogány and Tamás Sárlós: Web Spam Detection via Commercial Intent Analysis.
  • Qingqing Gan and Torsten Suel: Improving Web Spam Classifiers Using Link Structure.
  • Hiroo Saito, Masashi Toyoda, Masaru Kitsuregawa and Kazuyuki Aihara: A Large-Scale Study of Link Spam Detection by Graph Algorithms.

4/13/07
12:35 am
Upgrading your OS

So, lots of people know that at Microsoft, we’ve recently shipped the latest version of Windows. No, not Windows Live, but Windows Vista. And in the blogosphere, there’s a fair amount of questioning about whether or not to upgrade. I’ve skipped the debate, largely because it puts me in a difficult position. If I list a bunch of reasons to upgrade, I’m just being a corporate shill. If I list a bunch of reasons not to upgrade, I’m biting the hand that feeds me. Luckily, however, the fine folks in charge of Debian just released Etch, or Debian v4.0. I’ve been running Debian for years now on my servers at home, so I thought I’d post a bit about my thoughts on upgrading Debian. You can extrapolate this to other operating systems at your discretion, and as always, your mileage may vary.

I run Debian on two boxes that act as my utility servers; they manage my mail, Web server (including this blog), DNS entries for various domains I run, backups (fully mirrored!), and archival storage. I initially picked Debian for two reasons:

  1. It was the most stable Linux distribution;
  2. The Debian package management system (apt-get) was infinitely better than the RPM hell of Redhat and related distros.

First question - why not run Windows? Well, at the time, I had recently graduated from UW, which at the time was still a UNIX shop (mostly Digital UNIX from DEC), and I was very familiar with administering UNIX type OSes. Next question - why not move to Windows? Well, simple answer — there’s no reason to. Really, I don’t want to “maintain” standard services, like Web, Mail, and DNS. The protocols are standard and new features are few and far between, and frankly I don’t care that much. I just want mail to come to me, meaning it arrives on my server, gets piped through SpamAssassin, and gets dumped in my INBOX. I want my DNS server to resolve the various domains to the right IP address.

And this brings me to Debian 4.0… will I upgrade my servers? It’s nice and stable, with lots of bug fixes, security upgrades, and new features.

Probably not.

Again, at the end of the day, I don’t want to spend time mucking about with upgrading and likely breaking something that works. I just want the services to work, and they do. I run a number of applications on the OS, and they serve my needs. The likelihood of downtime and spending hours hunting for a misconfiguration isn’t something I’m looking to spend tons of time on.

But what about all the security features?

Well, yeah, OK. I guess my system is hackable on the ports I have open (mail, dns, web, ssh… but all you port-scanning script kiddies already know that, dontcha?). I run a service called DenyHosts that blocks IPs after 3 failed attempts, and I get 1-3 blocked IPs per day. It also automatically contributes these bad hosts to a central DB, so we can all share the, uh, love.

Ultimately, when running key services, the goal is stability. Change, meaning new applications, OS, or whatnot, all bring a risk of downtime. So, my mantra is not to change anything unless absolutely necessary. While I used to be all about the latest version of Postfix, well, I’ve discovered new hobbies to occupy my time. Not to mention my day job. ;)

4/04/07
5:05 pm
RIP Karen Sparck Jones

Jamie Callan forwarded this sad news to the SIGIR List today:

Karen Sparck Jones, a pioneer in automatic language processing and information retrieval, has died of cancer.  Karen was a good friend to many within the IR community, and a leader in the best sense of the word.  She was known for a commitment to excellence, support of junior researchers, and an influential and productive research career that spanned six decades.  She was a Fellow of the British Academy, the AAAI, and the ECAAI and received numerous awards for her research, including in the last year two awards from ACM (Athena Lecturer, ACM-AAAI Allen Newell Award) and one from the BCS (Lovelace Medal).  She was the second recipient of ACM SIGIR’s Gerard Salton Award.

For additional information about Karen’s life and distinguished research career, please see her web page (http://www.cl.cam.ac.uk/~ksj21/), and the announcement of her death from Cambridge University (http://www.admin.cam.ac.uk/news/dp/2007040403).

I met Karen a few times at various conferences. She was clearly one of the great minds of search, and was one of the key people who ensured the tradition of rigor in detailing experimental results at conferences like SIGIR and RIAO. She will be missed.