Zevils

Quick Ranked-Choice Elections with Google Forms

2021-12-03T10:47:00.000-08:00

Want to run a really quick ranked-choice election, like "which restaurant should we go to" or "where should we ask the city to build a crosswalk" ? See here for an example:

Here's one way to do it:

Create a new Google Form.
In the form description, explain each of the choices.
Add a "multiple choice grid" question.
In the "rows" of the question, add one row for each choice: "Chocolate", "Vanilla", etc.
In the "columns" of the question, add a "rank number" for each choice: "1st", "2nd", etc.
In the "three dots" menu at the bottom-right of the question, turn on "limit to one response per column":
Send out the form and wait for people to vote.
Once the votes are in, go to the "Response" tab of the form and export the ballots to a CSV using the option under the "three dots" menu:
Download ballots.py and pip install pyrankvote.
Adjust NUMBER_OF_SEATS in ballots.py to be the number of candidates you want to elect, e.g. how many flavors are you going to buy?
Unzip the ballot CSV run ballots.py with the CSV on standard input: ./ballots.py < ballots.csv

Maslow's Hierarchy of Engineering Team Needs

2021-09-08T16:19:00.006-07:00

Management is the continuation of prioritization by other means. —Carl von Clausewitz

Maslow's Hierarchy of Needs is the idea that people have an inherent and universal set of priorities. If you don't have enough air to breathe or water to drink, you'd better prioritize solving that problem, or you won't be around for very long to solve any other problems. Once you have that sorted out, you can focus on loftier goals, such as "feeling loved", "experiencing beauty", and "living a meaningful life". Software engineering teams have a similar natural hierarchy, and if you're leading one of them, thinking about the lowest point in the hierarchy where your team is struggling is a good way to decide how to invest your time.

This post is also available as a Twitter thread.

As the overriding goal of life is to maximize the spread of its genes and ideas, the overriding goal of an engineering team is to maximize the value it creates for its users. Considering the sub-goals on the way to achieving this from lowest to highest priority:

Existence

Congratulations, you have an idea, or an organizational charter, or some other mandate to do something! If you don't have a team to do it with, you're probably not going to do that something. Maybe that team is just you, maybe it's scads of highly specialized non-you people... But a team of zero is not much of a team at all.

Culture

If your team is a terrible place to work, nobody's going to work there for long, and those who do aren't going to deliver very good work. Create an environment where people can succeed, where it's "safe to take risks, and so on. If you want people to accomplish anything, create the conditions that enable it.

Engineering Velocity

Look upon thy works ye mighty and despair, you have a team of more than zero people, and they're trying to write software instead of stabbing each other in the back and hunting for a less-crappy job! Are they successfully writing software? If it's impossible to get anything done because you have no tests, or your codebase is sued by Olive Garden for theft of trade secrets, or your documentation aspires to Finnegan's Wake levels of clarity or The Winds of Winter levels of existence... your team is not an effective deliverer of value for your users.

Reliability

Yay, you can write software! I tried to give you a medal, but you were busy fighting production fires. And, oddly, I couldn't find a customer to write a testimonial, in spite of the large numbers of them amassed outside your headquarters with torches and pitchforks. (Next time you're going to leak and delete all of their data, try not to do it in that order.) Unreliable software doesn't deliver much value.

Market Exists

Ok, now we're actually getting somewhere. You have a team, they're writing software and doing it well. Does anyone care? Unless you're solving a problem that someone actually has, the answer is no.

Product-Market Fit

You're attempting to solve a problem that people actually have. Does your solution actually solve the problem? If so, you've achieved the vaunted product-market fit. If not, congratulations on identifying a problem that needs solving, but your value comes from solving it, and you're not there yet.

Out-Compete

If you're effectively solving an important problem, but someone else is solving it better, why would anyone use your solution? If the answer is "they don't", then you're not actually creating value for anybody.

Delight

If you do this well, people will be happy that they're using your software. They'll want to use more of it, they'll want to tell other people to use it, and so on. And so more people will use it. How can you fail here if you're doing everything else on this list? Maybe your solution works pretty well but is awkward to use. Maybe it has a price or licensing terms or other cost that people will only just barely tolerate. Maybe your reputation or that of your company is terrible, so folks hate that your solution is the best option for them.

Act IV, Scene I

2011-06-23T19:59:00.000-07:00

- what wizardry is this!? svn supports symlinks?

A dark Filesystem. In the middle, a Repository boiling. Thunder.
  Enter the three Programmers.
   1 PROGRAMMER.  Thrice the padded buf hath oe'r runn'd.
   2 PROGRAMMER.  Thrice and once, the platter spun.
   3 PROGRAMMER.  Hexate cries:—'tis time! 'tis time!
   1 PROGRAMMER.  Round about the repo go;
In spaghetti'd source code throw.—
Functors, that on blackest ARM,
Caused a user grievous harm;
Refactor'd business logic got,
Compile first i' the charmed pot!
   ALL.  Double, double toil and trouble;
Cycles burn, and repo bubble.
   2 PROGRAMMER.  Mock-up of an inode struct,
In the repo run amock;
Superblock, corrupt extent,
File flat, and file bent,
Meta data, magic prop,
B-tree hash, and sign that's dropped,—
For VCS of powerful trouble,
Like a hell-broth boil and bubble.
   ALL.  Double, double toil and trouble;
Cycles burn, and repo bubble.
   3 PROGRAMMER.  Shard of cluster; meg of RAM;
ASIC ripped from Don Knuth's pram;
Recursive matrix transform hack;
Toggle switch that won't switch back;
Heap address that ain't been writ,
Yet has a quine contained in it;
NSA encryption key;
Source code for an AI bee;
Lambda of Alonzo Church
Found by Turing's A* search;—
Document this noxious gruel
With Microsoft's new WinWord tool.
ALL.  Double, double toil and trouble;
Cycles burn, and repo bubble.
   2 PROGRAMMER.  Cool it with a peltier,
Then the code can ship, hooray!

Multithreaded Python, extensions, and static data

2010-11-14T08:09:00.000-08:00

The GIL

The GIL and Context Switching

I use Boost.Python to write some C++ extensions for Python. I work on an I/O-bound Python program; Python has a global interpreter lock ("the GIL") which means that in a multi-threaded program, only one thread can be executing code inside the Python interpreter at once. Now, a thread can drop the GIL, and the built-in Python read and write routines do this so that while one thread is doing I/O, another thread can run. However, due to a peculiarity in how the GIL is implemented,¹ even though the actual I/O takes place during system calls that drop the GIL, the need to re-acquire the GIL after every I/O operation was killing our performance.

For instance, one of the things that the application does a lot of is logging. Doing the logging synchronously -- as the code is executing and it wants to write something to its log, it needs to wait for the write to the logfile to finish before it can continue going about its business -- turned out to be a bottleneck. My first attempt to do something about that was to spawn off a separate thread for the logfile, and have logging look something like this:

class Logger:
    def __init__(self):
        self._buffer = []
        self._file = open("foo.log", "w")
        self._hasMessages = threading.Condition()
        self._lock = threading.Lock()

    def writeEntriesForever(self):
        while True:
            with self._lock:
                while len(self._buffer) == 0:
                    self._hasMessages.wait()
                messages = self._buffer
                self._buffer = []

            self._file.write("".join(messages))

    def log(self, message):
        with self._lock:
            self._buffer.append(message)
            self._hasMessages.notify()

def main():
    logger = Logger()
    t = threading.Thread(target = logger.writeEntriesForever)
    t.start()

    logger.log("foo")
    logger.log("bar")

That did get the I/O off of the main thread. However, while the write call in Logger.writeEntriesForever would make the logging thread drop the GIL, allowing the main thread to continue executing, the logging thread would need to reacquire the GIL when write returned. Now, it'd drop the GIL again when while waiting, which is where the thread would spend most of its time, but then it'd need to acquire the GIL again between the end of wait and the start of write. All of these context switches were almost completely negating any performance win from offloading the actual I/O to a separate thread.

Boost.Python

Enter Boost.Python. The GIL is only needed when using the Python interpreter, so if the entire body of writeEntriesForever doesn't need the interpreter, the thread can drop the GIL as soon as it enters that method and never reacquire it. This means writing that method in some language other than Python, which is what Boost.Python makes it easy to do.

The way Boost.Python works, you wind up compiling and linking your C++ code into a dynamic library, and that library is a Python extension. In the example above, the new Python code would look like:

from logging_extension import Logger

def main():
    logger = Logger()
    t = threading.Thread(targer=logger.writeEntriesForever)
    t.start()

    logger.log("foo")
    logger.log('bar")

and then you'd have C++ code that would look something like:

#include 
#include 
#include 
#include 
#include 
#include 
#include 
class ScopedGILRelease {
// The GIL will be released when an instance of this class goes in-scope
// and reacquired when it goes out of scope.
public:
    inline ScopedGILRelease() { m_thread_state = PyEval_SaveThread(); }
    inline ~ScopedGILRelease() {
        PyEval_RestoreThread(m_thread_state);
        m_thread_state = NULL;
    }
private:
    PyThreadState *m_thread_state;
};

class Logger {
public:
   Logger();
   ~Logger();
   void log(boost::python::str message);
   void writeEntriesForever();

private:
    typedef std:list LogBuffer;
    std::auto_ptr buffer;
    std::filebuf fb;
    std::ostream file;
    boost::condition hasMessages;
    boost::mutex mutex;
};

Logger::Logger() {
    fb.open("foo.log", ios::out);
    file = std::ostream(&fb);
    buffer = std::auto_ptr(new LogBuffer);
}

void Logger::log(boost::python::str message) {
    ScopedGILRelease noGIL; //Drop GIL before acquiring mutex to avoid deadlock.
    boost::scoped_lock lock(mutex);
    buffer.push_back(boost::python::extract(message);
    hasMessages.notify_one();
}

void Logger::writeEntriesForever() {
    ScopedGILRelease noGIL;
    while(true) {
        std::auto_ptr messages;
        {
            boost::scoped_lock lock(mutex);
            while(!buffer.size()) hasMessages.wait();
            messages = buffer;
        }
        for(LogBuffer::iterator i = messages->begin(); i != messages->end(); i++) {
            file << *i;
        }
    }
}

BOOST_PYTHON_MODULE(logging_extension)
{
    using namespace boost::python;
    class_("Logger")
        .def("log", &Logger::log)
        .def("writeEntriesForever", &Logger::writeEntriesForever)
    ;
}

Python Extensions and Static Data

So, that all worked fine and dandy until I did three things.

Multiple Extensions

The first thing that caused a problem is I also decided to move the code for interacting with Oracle into a Boost.Python extension. For the usual reasons, I didn't want to have that code and the logging code in one big honking library of doom, so I put it in its own extension; there was now logging.so and oracle.so.

Static Data

The second thing that caused a problem is that our logging code is actually more complicated above. We have a syslog-like framework where there are different categories of log message, and the app can be configured so that different categories have different log levels. There are a lot of LOG_DEBUG statements in the application, but if none of the logging categories are configured to be that verbose, those statements will never actually make it into the log.

Since logging settings are application-wide, and it'd be ugly to have to pass around a "logging state" object everywhere (or for that matter, an instance of Logger), I used static data for that:

    static std::map theLoggers; //Map logging destination to the logger object.
    static std::map theSettings; //Map logging category to its log level.

Using One Extension From Another

The third thing was that I wanted to actually log things from inside the database interaction code. Simply including the logging headers from inside the DB sources didn't work:<

#include "../logging/logging.hpp"
logging::doLog(logging::LEVEL_DEBUG, logging::CATEGORY_DATABASE, "Hello, world!");

It would compile, but it wouldn't run because the symbols from logging.so were unresolved. Okay, easy enough to fix. I added -l:logging.so to the link line for oracle.so and went about my merry business.

Symbol Visibility

This looked like it worked, but none of the messages from oracle.so were actually making it into the log! I thought I must be doing something threading-related incorrectly, or something. But, eventually, while debugging in GDB I noticed an odd message.

(gdb) b
Breakpoint 1 at 0x1234567: file logging.cpp, line 6. (2 locations)

2 locations? Oh. Well, that was the problem. To load the dynamic library at runtime on Linux, Python uses the dlopen function. The documentation for dlopen mentions, in the description of the flag argument, that the RLD_LOCAL flag (the default) means that "symbols defined in this library are not made available to resolve references in subsequently loaded libraries." This meant that when Python loaded oracle.so, ld.so would map in a new copy of logging.so (because oracle.so was linked to it), ignore the copy pulled in when Python did dlopen("logging.so", RTLD_LOCAL); . This meant that when Python routines called logging functions, they got one copy of the static data, while when oracle routines called logging functions, they got their own copy of the static data! So, the database code wasn't seeing any of the logging settings changes made from the Python code.

My solution was to stop linking oracle.so against logging.so, and to create a new pre_c_import.py, with a dire warning in a comment that before importing any Boost.Python extensions, one must import this file:

###---*** IMPORTANT! Before importing any Boost.Python modules, you *must*
###---*** import this, e.g.:
###---***    import pre_c_import
###---***    import c_extensions.logging
###---***    import c_extensions.oracle
###---*** If you don't, your import will fail with unresolved symbol errors.
###---*** Whenever you add a new extension module, you should add it to
###---*** the import below.
import sys, DLFCN
current = sys.getdlopenflags()
sys.setdlopenflags(current | DLFCN.RTLD_GLOBAL)

import c_extensions.logging
import c_extensions.oracle

sys.setdlopenflags(current)

RTLD_GLOBAL is the opposite of RLTD_LOCAL, so now the extensions were able to see each other's symbols, and everything was happy.

Dave Beazley has a good explanation and hour-long video describing the issues. ↩

Sloppy Graph, Sloppy Design

2009-09-16T06:21:00.000-07:00

I was spending time trying to fine-tune a graphviz file documenting the call graph of a piece of code and describing some of the critical functions. graphviz isn't really designed for the kind of long node labels I wanted to give it, so it would do things like put nodes in places which made it have to draw arrows reaching clear across the page.

Finally I realized that rather than trying to talk graphviz into reordering its nodes, I could just refactor the thing I was graphing so that the flow wasn't so darned convoluted in the first place.

Before (image links to full size version):

After (image links to full size version):

Corollary? If it's hard to get your call flow graph to look pretty, well, the graph isn't the only thing that's ugly...

An Avocado in the Snow

2008-12-22T01:54:00.001-08:00

An avocado, found in the snow near Portland St and Broadway, Cambridge, MA

An avocado in the snow.
Who left it there? I do not know.
Not Father, Son, nor Ghost so holy,
Rebirths you into guacamole.
Did leaping from some wretched fate
Allow you to feel special, great?
Or did, cast down like ancient foe,
You weep from terror, weep from woe?
But lie here now, near Portland Street,
And rest, green flesh and tasty meat.

Switching Finks

2008-01-05T17:59:00.001-08:00

One of the open-source projects I contribute to is Fink, a package manager for OS X; if you've used apt-get or yum on Linux, it provides a similar facility, allowing you to install, say, GnuPG by running fink install gnupg. It installs things into its own directory tree, rooted at /sw by default, to avoid interfering with things shipped by Apple (/, /usr) or manually installed by the user (/usr/local.) That is, if you have Fink installed, your system will have /sw/bin, /sw/lib, /sw/etc, /sw/share/man, &c.

So that you can run things installed in these nonstandard locations, Fink provides some shell commands in /sw/bin/init.sh which edit environment variables like PATH and MANPATH to include the /sw/* directories. Most Fink users have . /sw/bin/init.sh in their ~/.profile, so these commands will be invoked when their shell starts.

Having my shell automatically pull in Fink at startup doesn't work for me, though. It's important to me to have a clean environment available. For instance, when I'm contributing to non-Fink open-source projects, trying to help someone who doesn't have Fink installed troubleshoot something, or submitting a bug report for a program that interacts with other programs where I have the Fink version installed, but Apple ships a different version with the system. (Note that this is only an issue if program A interacts with program B by invoking it as a standalone process without using an absolute path.)

Also, as a Fink developer, I actually have multiple Fink installations at different paths, and I only want one loaded at a time; I don't want to activate /Volumes/SandBox/fink/dev-sw in an environment where /Volumes/SandBox/fink/sw has already been pulled in!

It's much easier to pull Fink stuff in later when I need it than to undo the changes that /sw/bin/init.sh makes to my environment. My solution for making it easy to activate a particular Fink installation was to add the following to ~/.bashrc:

if [ -n "$SW" ]
    then export CFLAGS="-I$SW/include"
    export LDFLAGS="-L$SW/lib"
    export CXXFLAGS="$CFLAGS"
    export CPPFLAGS="$CXXFLAGS"
    export ACLOCAL_FLAGS="-I \"$SW/share/aclocal\""
    export PKG_CONFIG_PATH="$SW/lib/pkgconfig"
    export PS1="[$SW_DISPNAME \\W@$(hostname -s)]\\\$ "
    . "$SW/bin/init.sh"
    export PATH=~/bin:"$PATH"
fi

What this does is arranges it so that if I start a new shell with SW and SW_DISPNAME set, it'll pull in the Fink installation rooted at the directory $SW and put $SW_DISPNAME in my shell prompt so that I can see which environment I'm using. The extra environment variables before . $SW/bin/init.sh set things up so that if I compile things by hand, they'll find and link against Fink-installed libraries; the PATH setting at the end is because init.sh places the Fink bin directory at the front of the PATH, and I want my personal bin directory to come before it.

I run the following script (saved as ~/bin/finkinit) when I want to pull in Fink:

#!/bin/bash

FINK=${1:-main}

case "$FINK" in
    main)
        SW=/Volumes/SandBox/fink/sw
        SW_DISPNAME="fink"
        ;;
    dev)
        SW=/Volumes/SandBox/fink/dev-sw
        SW_DISPNAME="fink-dev"
        ;;
    *) echo "Unknown fink install '$FINK'" >&2 ; exit 1
esac

export SW SW_DISPNAME
exec /bin/bash

This gives me a subshell with Fink turned on, which I can exit out of when I want to return to a clean environment. If I run it as finkinit, I get my main Fink installation, or I can run finkinit dev to get an alternate Fink.

For All Your Finger-Pointing Needs

2008-01-01T15:59:00.001-08:00

While working with a large codebase, I often want to find the origin of a particular line. Subversion offers a tool, annotate (aka blame, aka praise), which displays the author and revision for every line in a file, indicating who made the last change to a line. However, the last change is often not very useful; it was a minor change as a result of some other change you're not interested in, or the code was moved around due to refactoring, and you need to go back even further.

When I need to do this, I find myself doing a sequence of:

1. svn blame FILE | less; find the revision N where the line was last changed
2. svn log -rN FILE | less; if the change is interesting, read the commit log for the file
3. svn blame FILE@N-1 | less; using Subversion's little-known pinned revision syntax, find the previous time the line was changed
4. Using N-1 as the new N, return to step 2.

: Pretty much any Subversion command that takes a path argument can be given PATH@REVISION instead to use the version of the path at a particular revision. This is great for diff and cat as well as blame. I use it for working with deleted files and branches and diffing a branch against trunk.

I've put together a rough version of a tool to make this easier; it's at /trunk/blamegame in my repository, which is here for browsing with ViewVC, or it can be checked out with svn co http://zevils.com/svn/trunk/blamegame blamegame . It still needs some fine-tuning and documentation, but invoke it like blamegame FILE LINE (where FILE is a URL or the path to a file in a Subversion working copy) to start looking at a particular line of a file. You can navigate and search the file using a less-like interface. To drill down to the previous change to a line, hit r and then enter the line number. l, o, n, and m switch between viewing the commit log, the changed parts of the old file, the changed parts of the new file, and (the default) the diff. If you need to change the path you're looking at (for instance, to jump inside a branch), use the p command. h will show the available commands.

Let me know what you think.

Wrong Dates in iCal Birthday Calendar

2007-12-31T07:12:00.000-08:00

To keep track of people's birthdays, I use Mac OS X's Birthday Calendar feature of Address Book/iCal. I was going through my calendar the other day, and I noticed that a birthday which I knew was sometime in January wasn't showing up. It was on the corresponding Address Book contact, though. I deleted the birthday from this contact and reentered it, which fixed that entry, but on the suspicion that more birthdays might be missing, I flipped through my calendar and found:

The Address Book birthday field has the misfeature that it forces a year to be specified. What a rude thing for Address Book to be asking! Anyway, I'd arbitrarily picked year 1 for the year for any contacts whose birth years I didn't know. Maybe, I thought, the Gregorian reform was throwing things off. However, changing the year to 1900 didn't help matters, and in fact made them worse:

Turning the birthday calendar off (which wipes out iCal's backing store for the calendar) and on didn't help matters. A web search turned up some other people having the same problem, but the only useful solution they came up with was deleting and recreating entire contacts by hand.

I wanted to see if the raw data was wrong in Address Book's database. Address Book uses Core Data in a way that makes the database difficult to work with at the SQLite command-line level, so instead I hacked /Developer/Examples/Python/PyObjC/AddressBook/Scripts/exportBook.py to emit the birthday field by adding ('Birthday', AddressBook.kABBirthdayProperty) to FIELD_NAMES and the following to encodeField:

    elif isinstance(value, AppKit.NSCalendarDate):
        return value.descriptionWithCalendarFormat_("%Y-%m-%d")

It turns out that a number of entries had negative years, e.g. -1900-03-23 instead of 1900-03-23. I'm not sure how this happened, but here's a script (which you can download) to fix it:

#!/usr/bin/python
"""
Fix negative birthday years in Address Book.
This work is hereby released into the Public Domain.
"""
import AddressBook
import AppKit

def personName(person):
    return "%s %s" % (
        person.valueForProperty_(AddressBook.kABFirstNameProperty),
        person.valueForProperty_(AddressBook.kABLastNameProperty)
        )

def formatDate(date):
    return date.descriptionWithCalendarFormat_("%Y-%m-%d")

def fixBirthday(birthday):
    year = int(birthday.descriptionWithCalendarFormat_("%Y"))
    if year < 0:
        return birthday.dateByAddingYears_months_days_hours_minutes_seconds_(
            -year * 2, 0, 0, 0, 0, 0)
    else:
        return None

def fixPersonBirthday(person):
    birthdayProp = AddressBook.kABBirthdayProperty

    birthday = person.valueForProperty_(birthdayProp)
    if birthday == None: return

    fixedBirthday = fixBirthday(birthday)
    if fixedBirthday != None:
        print "Fixing up %s: %s -> %s" % (
            personName(person),
            formatDate(birthday),
            formatDate(fixedBirthday)
            )        
        person.setValue_forProperty_(fixedBirthday, birthdayProp) 

book = AddressBook.ABAddressBook.sharedAddressBook()

for person in book.people():
    fixPersonBirthday(person)

book.save()

Internationalization of Names

2007-12-28T08:19:00.000-08:00

Names are complicated

What's in a name? The answer turns out to vary quite widely around the world. When an English-language form, either electronic or paper, asks for a person's name, it usually provides separate fields for first and last name, and sometimes middle name or middle initial. Aristotle Pagaltzis linked to a post by Jim Clark on Thai names, demonstrating that this approach, or even the alternative "given name, family name", falls down pretty quickly outside the English-speaking world. Thai names consist of:

A given name, similar to the English first name, except that it must come from a list of government-approved names;
A family name, which is also government-regulated; all people with the same family name are related, and new Thai citizens must select an unused name. Like all non-namespaced identifiers (domain names, instant messenger handles, user names on popular web services), the good short ones are taken; and
A chue len, which is typically translated as nickname, but according to Mr. Clark is more like an informal given name; it's selected by one's parents or close relatives early in life (though not necessarily at birth).

The obvious mapping of Thai name components onto English, (given name, family name, chue len) → (first name, last name, nickname), doesn't work very well. Consider the Thai name Thaksin Shinawatra, chue len Meow, the former prime minister. His (romanized; more on that later) legal name is Thaksin Shinawatra. If addressing him politely, I would refer to him as Khun Thaksin.¹ Note that this is {honorific} {given name}, not {honorific} {family name}; in other words, Mr. Matthew as opposed to Mr. Sachs. His friends and family will call him Meow, not Thaksin or Shinawatra.

A further wrinkle is that when sorting a list of Thai names, the given name, not the family name, should be the sort key. Then there's also the matter that Thaksin Shinawatra, aka Meow isn't really the gentleman's name at all; it's ทักษิณ ชินวัตร, aka แม้ว. There are several standard romanizations for Thai, and whichever one the named individual prefers is considered canonical. There are also other quirks involved in the Thai script form of a name, like the lack of whitespace between the honorific and the given name.

Non-Thai complications

Then there are the whole sets of different requirements for other kinds of names. The comments on Jim Clark's blog entry, and this post by Richard Ishid, who's in charge of i18n issues for the W3C, give some other good examples.

Russian and Icelandic have gender suffixes on the family name (Fuzaylova for a woman, Fuzaylov for a man; Fjalar Jónsson vs. Katrín Jónsdóttir.)
Russian has nicknames (which, like Thai "nicknames", are much more widely used than English nicknames) which are usually (always?) systematically derivable from their given names; Vladimir → Vova.
Scandanavian given names typically include spaces, and convention varies as to how acceptable it is to refer to Hans Christian Andersen as Hans vs. Hans Christian. This isn't unheard of in the southern United States, either -- Billy Jean, &c. In some parts of Europe, these multipart given names are hyphenated, as in the Austrian Hans-Christian or the French Jean-Claude.
In France and Italy, names can have a comma which essentially divides a series of first names from a series of middle names; in France, the middle names are rarely used outside of legal contexts, while in Italy, the middle names aren't used in legal contexts. A Mario, Alberto Giovanni Rossi would have a legal name of Mario Rossi in Italy, whereas a French Jean, Christophe Dupond would be commonly known as Jean Dupond but legally Jean, Christophe Dupond.
Many countries use patronymics instead of stable family names, so a set of related people won't have the same family name.
Many Chinese take arbitrary western nicknames for ease of communicating with westerners.
Chinese names also have generational markers, so a set of siblings will all have the same "middle" name, and names are written {family}{generational}{given} in Chinese script.

So what?

How much of this do we really need to worry about? When I say that Thai names should be sorted by given name, should, of course, is a horribly loaded term. If an American border control agent pulls up a list of people who have entered the country at a particular point, they probably want the sort key to be Thaksin, not Shinawatra. Mapping (given, family) → (first, last) is also probably fine for this application. So when, exactly, does the extra information need to be preserved?

Some reasons that a system might be interested in a name, or parts of a name, are:

Correlating records with other systems
Displaying people's names
Addressing people in writing ("Dear Mr. Sachs,", "Welcome, Matthew!") or on the phone
Identifying people ("To look up your records, enter your name")
Searching for people (on, say, a social networking site)
Sorting a list of people

For most English applications that don't cater to a large international audience, it might be "good enough" to either simply have a flat name field where users can either enter arbitrary names or at least their romanizations.² A flat name field is much more flexible. Since you probably need to support substring searches anyway, it doesn't lose anything as far as searching's concerned.

If you want to sort by last name, or communicate with other systems that take a (first name, last name) tuple, it might be good enough to just split off the last whitespace-separated token and treat that as the last name.³ If that's not good enough, a pair of (first names, last name) or (given names, family name) inputs may be called for, but characters such as spaces and apostophes (O'Flannagan) should be valid. If your application wants to try to automatically derive a secondary form of address from the name entered, maybe it shouldn't. Is the ability to have form letters say Mr. Sachs as opposed to Matthew Sachs really worth the faux pas of Mr. Shinawatra? I guess it depends on how international your audience is; you could always ask for multiple forms of address.⁴

For applications that want to really get localized names right, like a system-wide address book or a global social networking site, a more complex approach is called for. For instance, the Mac OS X address book framework knows about the address formats for various countries; it could extend that functionality to support different name formats. It has some rudimentary support for this, in that an individual address book entry can have a set of name ordering flags associated with it, either first name first or last name first (sic); name fields are fixed at title, first name, middle name, last name, suffix, nickname, maiden name, and phonetic (first, middle, last) name.

Per-country address format support doesn't change which fields exist, but it changes the order they're displayed in. Per-country name format would need to be more complicated. A Name (which a person might have more than one of with different NameFormats) might consist of:

NameFormat, defining the (country, language) associated with the name (e.g. en.US and the set of available NameComponent)
A list of (NameComponent, Value, (optional) PhoneticValue)
int Name.compareWith(Name)
String Name.representation(NAME_REPRESENTATION) where NAME_REPRESENTATION is one of:
- LEGAL_NAME
- FORMAL_NAME
- SHORT_FORMAL_NAME
- INFORMAL_NAME
- VERY_INFORMAL_NAME
Name Name.convertTo(NameFormat) would try to convert to a different name representation using automated rules for things like romanization.

Khun is a generic honorific roughly akin to Mr./Ms./Mrs. There might be a better one to use for a (former) Prime Minister. This list includes ones for teacher, aunt, sister, older person, and younger person, but suggests that khun is always used when addressing someone formally. ↩
In part two of his post Mr. Ishid recommends that applications that expect ASCII input specify it; detecting and erroring on input in unsupported scripts is probably sufficient. ↩
It might be worth having a list of tokens which will also get treated as part of the last name, such as de, with this approach. ↩
"Enter your name and how you'd like to be addressed:" ? ↩

Migrating a wiki from Trac to MediaWiki

2007-12-26T08:02:00.000-08:00

I'd set up a Trac installation for wedding planning, instead of using MediaWiki (the system Wikipedia uses, which I already had a couple of installations of) since we wanted both a wiki (venue data, possible honeymoon destinations, guest lists... shut up, it's useful!) and ticket system (useful for tracking things like thank-you notes and being able to assign specific ones to either Liz or myself).

However, Dreamhost doesn't support mod_python, so pages were taking way too long to load. I decided to switch over to MediaWiki for the wiki part and just use my existing Bugzilla installation for ticket tracking. Hence, a new script over on the code page, trac2mw. Our wiki was fairly tiny, so caveat user. I didn't bother having it migrate tickets tickets or attachments, since we didn't have any data there that was worth preserving. The input format, a MySQL XML dump, probably isn't ideal for a lot of people (since Trac runs on SQLite by default.) It does fix up the wiki page syntax (the parts of it we were using, at least), though.

Less Edward Tufte, More Don Martin

2007-12-17T15:30:00.000-08:00

A New York Times blog post on holiday tipping linked to a gem from the Times archives, its own ancestor from 1911.

The most striking feature of the article, which appeared on page six of the magazine section, is the large political cartoon-like illustration in the center (drawn by Reginald Russom, who evidently went on to help found what later became the Australian Cartoonists' Association.) From what I've noticed, while the Times Magazine still employs plenty of illustrations, they're mostly charts and graphs; when there's a lead image that's not a more or less realist photograph of the article's subject, it tends to be a photo like this one.

I love how one old newspaper article can shed light on:

Other concerns of the period (the legality of a state (or city?)-wide income tax debate was argued before the State Supreme Court)
Typical incomes and wages (a bit over $1M/yr in 2006 dollars is their example income for a "well-bred" New Yorker)
Types of service-sector employees one might utilize (such as elevator boy, charwoman, furnaceman, telephone operator, milkman, and stenographer, in addition to less remarkable professions)
Things that one might fear malfunctioning in an apartment (how little some things change; here we have the electric buzzer, hot water, windows (by the glass being broken, not routine mechanical failure), and mail delivery)

Maybe this is still routine in Manhattan, at least in the more highfalutin co-ops, but I also found it noteworthy that the building's management was expected to send you candidates if you wanted to sublet your apartment (but watch out; if you anger your super by not tipping around Christmas, he might send "several negroes and a Chinaman" your way!)

When I first got Times archives access (by subscribing to TimesSelect back in the day), I trawled the archives, there's a lot of good stuff there. If anyone else has a favorite, I'd love to hear about it in the comments.

Diagnosis of Inferior Social Proclivity Disorder in Young Adult Patients: A Case Study

2006-03-14T07:16:00.002-08:00

Rodgers N. Hart, F. Sinatra, and E. Fitzgerald, Lorenz Institute for the Advancement of Clinical Psychology

Note: This paper has also been accepted for publication in the Annals of reformat_songs.

Introduction

Inferior social proclivity disorder, or “trampiness”, is commonly mistaken for adjustment disorder not otherwise specified.¹ However, this condition is surprisingly common in early post-adolescent patients, especially females.² We examine the diagnosis and treatment of one patient, who we shall refer to as Lady. Lady, when she began treatment, was a 24-year-old who referred herself to our private practice. She had become increasingly concerned over her difficulty in forming social relationships at her place of employment, a finishing school.

Initial Work

We spent several sessions simply becoming familiar with the patient³ and allowing the therapeutic relationship to coalesce, and listening to the cognitive-behavioral paradigms⁴ which the patient used to self-describe the internalities⁵ of her situation. Lady seemed to view herself through a neo-behavioralist⁶ lens, and attempted to leverage this paradigm to assert control over her situation. She would often attempt to defer meals until excessively late hours, although these control attempts were never successfully realized due to her inability to stave off her hunger. Peculiarly, she was unusually consistent in her failures; she routinely ate dinner at exactly 7:55 in the evening. This led us to suspect a possible anorexia nervosa (restricting type) in conjunction with obsessive-compulsive personality disorder.⁷ Her consistent timeliness at cultural events — she was a regular patron of the theatre — reinforced this notion.⁸ However, our experiences with disorders of these spectra suggested that it would be premature to form anything more than a tentative diagnosis at this point.⁹ Using a hybrid talk therapy approach,¹⁰ we probed further.

Contraindications for Obsessive-Compulsive Personality Disorder

Further work with Lady led to the discovery that she exhibited several behaviors which contraindicated OCPD. First and foremost amongst these was a strong revulsion to gambling and excessive personal grooming.¹¹ Two contexts in which her coworkers often socialized were informal gambling nights with members of the local political establishment and outings to nightclubs with rigorous formal dress codes. Lady claimed that she felt excluded from these events due to her aversion to these activities. In addition to serving as social bonding rituals, her coworkers used these occasions to undertake the exchange of critical back-channel social collateral, or “gossip”.¹²

Contraindications for Anorexia Nervosa

We also found evidence that she did not have anorexia nervosa, or any other eating disorder. Eating disorders are typically characterized by a need by the patient for control over his or her environment, actualized by control over the frequency and manner of dietary events.¹³ It is expected, in cases of these disorders, to find, upon a closer examination, a pattern of control mechanisms. However, Lady did not seem to have any extra-dietary retentiveness behaviors. She was almost alarmingly nonchalant about upcoming major life events and her financial situation. She hoped to leave California (her state of residence) at some point, stating a preference for a warmer, more arid climate, but neither had nor desired strategies for attaining this goal. On a smaller scale, she would often arrive for appointments with her hair in a state of disarray, claiming (when prompted) that it had been disturbed by the wind on the drive over, but making no attempt to correct it.

Diagnosis of Inferior Social Proclivity Disorder

We concluded that Lady was probably not suffering from OCPD or anorexia nervosa. We considered a diagnosis of general social anxiety disorder, but she genuinely did seem to desire to connect with her coworkers, and she was quite active in other social circles. Then, in one session, Lady revealed a key piece of information. She said that her avoidance of the contexts in which her coworkers preferred to socialize was probably a good thing, because her financial situation did not permit her to engage in the expense of attending such nights on the town. She felt that her non-luxury automobile and other secondary socioeconomic characteristics placed her in a position of inferiority, and that she would be taken advantage of by the sophisticated and (in her view) unsavory characters who would often accompany her coworkers on these social outings. She wished to pursue a deeper connection with her coworkers, but she characterized their other associates as “sharpies” and “frauds.”

We then asked how her coworkers could maintain such extravagant lifestyles while she, in a similar job at the same place of employment, could not. Her response to this was the final piece of the puzzle. This reinforces the critical importance of a close reading of responses to even innocuous questions in talk therapy.¹⁴ She said that she had been offered many increases in salary, but had repeatedly turned them down because she “didn’t want the hassle.” This was a clear-cut case of ISPD. The patient was intentionally holding herself to an “inferior” social position, had difficulty functioning because of it, and did not perceive of her assumed position as problematic.¹⁵

Motivating Factor Analysis

At this point we had diagnosed Lady, but this only really told us the “how” of her “trampiness”. Although it is often difficult or impossible to do so successfully,¹⁶ we elected to explore the motivating factors behind her disorder (the “why” of her “trampiness.”) Such analysis often reveals additional disorders, or at least provides information which may prove invaluable in treatment. This analysis is still ongoing, and we do not have any results yet.

Treatment Plan

Treatment of Lady is currently ongoing. We are continuing talk therapy, both for its own merits, and as a component of the aforementioned motivating factor analysis. We are also attempting to use a combination of cognitive behavioral therapy and desensitization to address some of her avoidance issues.¹⁷ We have had some preliminary success in exposing her to fast food sprayed with a solution which will cause it to induce greater than normal levels of nausea when consumed, and we have instructed her to bring gradually larger amounts of cash with her on her visits to our office. We hope to discuss the efficacy of these techniques in a future publication.

A. Hasapemapetalan, B. F. Goodwrench; Misdiagnosis of Social Proclivity Disorders; Annals of the Bowling University Watercooler; 1973. ↩
D. Sedaris, T. Mobile; Covariant Statistical Analysis via Modified Stochastic ANOVA of ISPD Demographics; Quarterly Christian Statistical Review; 2001. ↩
F. Vuzayloya, R. Nachlin; Look Who’s Talking: Techniques for Patient-Therapist Acclimation; Proceedings of the Windsor University Conference on Clinical Techniques; 1999. ↩
J. Evans, B. Wilson; Quantum Entanglement and the Cognitive-Behavioral Paradigm; Psychological Humourism; 1273. ↩
B. Allen, M. Davis, L. Fracalossi, M. Sue; Internalities: A New Paradigm for Patient Perception Analysis, and its Applications for the Treatment of Inferior Fictive Disorder; Psychology Fortnight; 1999. ↩
K. Reeves, A. Wachowski, L. Wachowski; A New Kind of Behavioralism; Zion Review of Psychology; 2235. ↩
M. Tee, S. L. Jackson; Foolish Diagnoses: A Case Study of an Aviaphobic-Ophidiophobic Complex; Scientific Moldovan; 2004. ↩
I. Asimov; The Endochronic Properties of Resublimated Thiotimoline; Astounding Science Fiction; 1948. ↩
D. Savage, D. Iskowitz; It Happens To All Therapists: On The Avoidance And The Acceptance Of Premature Diagnosis; Journal of the Association for Computing Machinery; 2003. ↩
P. Hanks, J. Pusteyevski, R. Jakenduf; Semantic Metrics for Evaluation of Talk Therapy Approaches; Psychological Linguistics; 2006. ↩
U. Ulrich, D. Davidson, L. Richards, L. Rudolfo, B. Abrams; Coded Contraindications; Proceedings of the 30th Annual Hashimoto University Conference on Psychological Methodologies; 1986. ↩
D. Wikiberg, S. Bunan; Byzantine Generals In Space: Network Theory, Social Dynamics, and Back-Channel Communications; RISKS Digest; 1997. ↩
M. Powers; Controlling Massively Parallel High-Resolution Event Timers in Low-Memory Environments; Nature; 2000. ↩
H. P. Grice; Logic and Conversation; Syntax and Semactics, Vol. iii; 1975. ↩
T. Geisel (ed.); The Delightful Diagnostic Dictionary; Scholastic Books; 1960. ↩
S. Hill, P. Graves, et. al.; Administrative Disavowment in High-Stress Environments; Organizational Psychology; 1966. ↩
C. Thulhu, Y. Sothoth, S. Niggurath; Inspiring Fear in Humankind; Applied Noneuclidianism Review; 1986. ↩

Introduction to Unit Testing

2006-03-08T09:36:00.003-08:00

Notes for a lecture given to Brandeis University’s COSI 22a.

What Is Unit Testing, and Why Should I Care?

Unit testing is the process of writing tests for individual bits of your program, in isolation. A “bit” is a small piece of functionality. We’ll discuss how small later. How can you know whether or not your program works if you don’t test it? If you’ve ever lost points on a programming assignment because something didn’t work right, you could’ve saved yourself from that by testing your program.

If you go on to take COSI 31a, you will do better on the programming assignments if you write tests! More importantly, it’s a good habit to get into as a programmer. Having tests for your code turns programming from an art — “gee, it looks right and seems to work, I think I’m done” — to a science —; “this is the evidence I have to support the claim that my program is behaving correctly.”

Unit testing is one of the easier ways to get into all the nooks and crannies of your code and make sure it’s doing the right thing. The act of writing tests often helps reveal areas where it isn’t clear what it means to do “the right thing.”

What to Test

To figure out what to test, start by thinking about what it means for your program to work. If you have a formal specification, that’s a great place to start. For your homework assignments, you’ve had such a specification, the Java API reference for whichever class you were supposed to be implementing.

You should also think about what all the different parts of the task are. You want at least one test for every public method in every public class. One way to measure the quality of unit tests is a metric called coverage. Coverage measures how much of your code is hit when you run your tests. Consider the following code for the function isNegative:

if(n > 0)
    return false;
else
    return true;

If you wrote one test for this function, which tested n = -5, you would only have 50% coverage, because two of the four lines are hit by that test (the first two are never executed.) To achieve complete coverage, you also need a test for a positive n, say n = 5. Conceptually, you’re not fully testing the function if you only test that it returns true for negative numbers, you also need to test that it returns false for positive numbers; otherwise, it could be replaced by a function that always returned true and your test suite (the collection of all of your tests) would have no idea! This is a common error I saw in the homeworks. A lot of people were doing things like only testing isEmpty() on an empty list.

There’s one trap I should mention here. If you’re writing your test suite and thinking about how to achieve maximum coverage, one way to do it is to look at the source for your class while you’re writing the test suite and go through every method and branch. The problem with this is that it ties your test suite to implementation details of your code. It’s important to think about the logical cases of the underlying problem you’re solving. Consider the isNegative example. What does it return for n = 0? According to a mechanical coverage check, you don’t need to add a test for that, since you’ve already test both cases in the code. The zero case is something that it’s easy to get wrong, though. It’s the boundary between negative and positive. A good rule of thumb is to always write specific tests for boundary conditions. The isNegative above does the wrong thing, and it’s very easy to miss unless you explicitly check isNegative(0). The way to figure out where the boundary cases, the corner cases, the weird inputs which will give you problems are is to have a detailed mental picture of what a particular method is supposed to. If you understand what it really means to test whether a number is negative, it should occur to you that 0 is an interesting case to check. Think about ways to implement the functionality, and ways to implement it incorrectly. When comparing the size of two lists, you should probably test not only cases like {1, 2, 3} == {1, 2, 3, 4}, but also {1, 2, 3, 4} == {1, 2, 3}, because catching one but not the other is a common mistake to make. Figuring out what the easy mistakes are is hard. Of course, figuring out the hard mistakes is harder.

Also make sure to test the side effects and error conditions. If a method is supposed to throw particular exceptions on particular invalid inputs, does it? If LinkedList.addAll(Collection) is supposed to return true to indicate that the list was modified, does it return false when the collection is empty? A well-written spec makes this job a lot easier. Look at the documentation for the method and make sure you’re testing that it does everything that the documentation specifies, and exactly what the documentation specifies.

Another source of tests is bugs. When you find a bug, it indicates something that you forgot to test. When this happens, write a test case for it. You should do this before fixing the bug to verify that the test case fails when the bug is present. Then fix the bug and make sure that the test case starts passing. Things that you got wrong once are things that you’re liable to get wrong again as things change. These sorts of tests are called regression tests, because they’re testing that your quality is always moving forward and never regressing.

How to Test It

Take a look at the included PizzaTest class and Pizza documentation. I’ve written a package, Pizza, for determining a set of toppings that will make a group of people happy when they’re trying to order a pizza. Full source code for Pizza is on the web, see below for the URL.

The test suite is structured into groups of tests which test units of functionality. The simple classes, Topping and ToppingConstraint, have one group for each class. Pizza has a few different groups. I isolated each group so that it doesn’t depend on anything done in any of the other groups. Each group that needs to construct a Pizza initializes its own topping list. This way the test groups aren’t dependent on each other and a failure in one small area of the test suite won’t randomly break a bunch of tests that should work. In order for a test suite to be useful, you want it to help you figure out exactly what is failing. There are trade-offs, though. I use Topping.equals(Object), even in tests for completely unrelated things. These tests will break if Topping.equals is broken. It would be a lot of extra busywork to avoid using Topping.equals, and it couldn’t be done without tying myself to the internal makeup of Topping. I shouldn’t need to rewrite the entire test suite if another attribute is added to toppings! One solution to this would be to indicate in some fashion that some of the other groups of tests, such as the applyConstraints() tests, depend on the toppings() tests, and we shouldn’t even bother running the applyConstraints() tests if the toppings() tests fail. There are frameworks to help you write unit tests, such as JUnit, which allow you to express this.

The first test group, mustSetToppings(), is testing that an error condition is generated under circumstances that it should be and not generated under other circumstances. It’s also a good example of how to test whether or not an exception is thrown.

The second test group, toppings(), tests the Topping class. It’s a fairly trivial class, but we test it anyway. It’s nice to not have to worry about whether or not it’s working. The test suite can get things wrong, of course, so don’t get overconfident. Note that the way equality of toppings is defined, they must have both the same name and the same type, so the tests for Topping.equals(Object) test cases where they have the same name but different types and vice versa, not just a case where they’re completely different and a case where they’re completely identical. We also test the case of them being completely different. This way, if, say, the name equality test is broken, we will know exactly what went wrong, because the “same names, different types” test will fail if the name test is broken to return false negatives, and the “different names, same types” test will fail if the name test is broken to return false positives.

applyConstraints() is the most complicated test group. This makes sense, it’s testing the really hard bit of Pizza. The individual tests are straightforward, the tricky part was figuring out which tests there should be. To come up with those test cases, I spent a lot of time thinking about the different ways in which this could go wrong. I intentionally picked a loosely-specified problem to make this job more interesting. The problem that Pizza is attempting to solve, how it’s supposed to work, what sorts of results it should return… these are all somewhat open to interpretation. That’s often what you have to do when you’re programming. A lot of times, you’ll get a vague problem, and you have to figure out how to solve it. Sometimes these are “business requirements” handed to you by your boss, sometimes it’s you thinking that it would be cool to do “foo”. The homeworks and labs have been based on fairly detailed specifications, and there have still been ambiguities! It took me around four hours to write all of this code, Pizza and PizzaTest and PizzaMain, and at least an hour, maybe two hours, was writing the applyConstraints() tests. Most of that time was figuring out what tests I needed to write!

Further Resources

Pizza source code
Some other types of testing: integration testing, stress testing, fuzz testing, performance testing
JUnit is a framework for doing unit testing in Java. It handles a lot of the grunt work for you. There are some additional packages for it that will automatically measure the coverage of your test suite.

These lecture notes and all associated source code are in the public domain.

The Design of Laptop "fn" Keys

2006-02-13T08:59:00.002-08:00

On every PC laptop made in the past 5+ (10+ ?) years, many of the “F1” (F2, F3, …) keys, and sometimes some of the other keys (the arrow keys in particular) serve two purposes. When pressed normally, they act as their respective key — F1 acts as F1, etc. However, when pressed in conjunction with the “Fn” key, they perform a special function indicated by an icon on the key. Usually both the icon and the label on the Fn key will be blue (whereas the other key labels are white.) For instance:

Today, one of my professors tried to hook up his laptop to the projector and was befuddled when it didn’t work. As soon as I saw him struggling, I knew that the problem was that he had to turn on the external video out. PC laptops typically have three display output modes: internal LCD only, external (VGA, or sometimes DVI these days) connector only, or both internal and external simultaneously. In order to change the mode, one typically has to either use the Fn function of one of the F keys (typically F5, F6, or F7.) Sometimes it can also be done through some buried option in the Display control panel.

The reason I knew that this was a problem is because almost every single professor who I’ve seen hook a laptop up to a projector has had to do this and had no idea what they had to do or how they were supposed to do it. The notion of hitting Fn in conjunction with some other key didn’t even seem to occur to them. Here’s something that’s a common thing to need to do, and laptop designers have tried to come up with a design that affords doing it (Fn is always next to Ctrl, so it should be natural to interpret it as a modifier key, and the color labels reinforce hitting Fn in conjunction with specific keys), but their design has failed, even after it’s been around for so many years and people have had a chance to get accustomed to it. Why doesn’t their design work? (And why do they keep using it?)

One problem with the “Fn+blue” design is that the labels on the keys are almost always terrible. Okay, the volume labels — often the Fn+arrow keys will control the volume or generate page up/page down/home/end — are pretty recognizable, but it’s easy enough to control that from within Windows, and a dark blue label on a black background doesn’t stand out (although some keyboards are much worse about this than others, I’ve seen ones where you need to really squint to see that there’s anything there at all), so that one doesn’t tend to get noticed or remembered. Also, many professors, the group of users who I’ve had the most opportunities to observe trying to hook up a laptop to a projector, don’t use sound on their laptops, so they’ve never needed to control the volume. The label for toggling the display mode is either a cryptic image (), a slightly less cryptic image (), or a confusing text label ( — even those few users who know what a CRT is are unlikely to associate it with trying to use a projector, since a projector is not a CRT).

Another problem is that users don’t expect to have to turn on the VGA output. It doesn’t match any of their experiences with plugging things in (most of them haven’t plugged in digital audio cables to their sound cards or receivers), and it doesn’t even match their experiences in plugging in monitors to desktops. It also isn’t very consistent. Sometimes it does just work, because they happen to have been in “internal+external” mode, and then for no apparent reason, they’ll have gotten switched to internal-only the next day.

Finally, I don’t think that users conceptualize Ctrl, Alt, etc. as modifier keys; that is, keys which, when pressed in conjunction with another key, change the behavior of that key in a predictable way. I think they get conceptualized as chording keys, or parts of a two-key combination. Ctrl-S doesn’t get conceptualized as “like pressing S, but I’m also pressing Ctrl so it will behave differently than the way I normally press S.” Instead, Ctrl-S turns into “pressing these two keys together to act as a different key entirely.” Users are right. The change in behavior produced by modifier keys is so rarely systematic (what does it mean to “Ctrl” something? To “Alt” it?), and the behavior produced by the combination bears so little resemblance to the normal behavior of the base key (the act of saving has nothing whatsoever to do with the act of producing the letter ‘s’), that there’s no reason to expect people to think of Ctrl/Alt/Fn as modifiers. It’s even hard to say what “Ctrl” means as an independent concept. Ctrl (in Windows) means “do something”, which is pretty meaningless. Alt (again, in Windows) seems to mean “shortcut to menus”, but most users don’t know about that either. The consequence of this is that users don’t feel that the behavior of chording keys is something they can predict. If Foo-S has nothing to do with either Foo or S, but does something completely novel, why should one be able to intuit what Bar-S might do? Fn actually does have a meaning — manipulate system hardware functionality in the way stated by the blue labels on the key caps — but meaning is not something that users expect to find.

The way Apple does things is different in a revealing way. First, Apple doesn’t use color to differentiate between the Fn function of its keys and the standalone behavior, everything is the same shade of gray. They use position (unmodified behavior on the left side of the key cap, modified behavior on the right.) Second, the modified and unmodified behavior are the inverse of PCs. Color would probably be better. F4 by itself decreases the volume, and Fn+F4 produces F4. (The exception is the arrow keys, which are arrow keys unmodified and page up/page down/home/end in conjunction with Fn.) The only time I’ve ever had to use one of the F keys on a Mac is F5 to bring up autocomplete in Xcode, and F12 for Dashboard, so the fact that needing to press Fn+F4 to get F4 doesn’t leap out at a naiive user isn’t all that critical, the F keys get used a lot more in Windows (application menu items are often bound to F keys by default.) Third, it’s much easier to find the software display controls in Mac OS X — they’re obvious in the Displays system preferences panel, and if it’s something you do often, you can check a box in Displays to have it right in your menu bar. Finally, it’s much more likely to just work — automatically detect that you’ve plugged something in and start displaying to it — on a PowerBook than on any PC laptop I’ve seen. This is borne out by my experiences observing professors, PowerBook users are much less likely to need to do anything and much more likely to be able to figure out what to do when they do.

Interestingly, many PC desktop keyboards are emulating the Mac model these days. The current generation of PC keyboards with “media” keys, e.g. a dedicated key to change to the next track in WMP or open Internet Explorer, typically make the F keys serve double-duty as the extended keys, and have a “Fn lock” which defaults to on and must be turned off to get the standard F behavior. Oddly, most of those keyboards don’t also have an Fn key, which makes them a pain in the butt.

Enhancing Machine Translation via Frame-Semantic Data

2004-12-20T08:47:00.002-08:00

I’ve just finished my final assignment for the semester, a paper for LING 190. Click the title for the full text of the paper, read the abstract below, and see the cut at the bottom of this entry for a layperson’s explanation of the technical bits.

Enhancing Machine Translation via Frame-Semantic Data

Abstract

Frame semantics is an approach to examining meaning in natural language by considering clusters of related concepts. For instance, in the “commercial transaction” frame, there is a buyer, a seller, goods, and money; different predicates in this frame will place these agents in different syntactic roles, so, in English, the buyer will be the subject of buy while the seller will be the subject of sell.

Frame semantics presents a powerful aide to machine translation. Frame-semantic knowledge of an input phrase facilitates more precise word-sense disambiguation and allows greater flexibility in deciding which of multiple valid word orderings to emit in the target language. I have demonstrated this by creating a rudimentary system for translating from Spanish to English which can optionally take advantage of frame-annotated input, and then testing this system on a small corpus of phrases in the commercial transaction frame.

Glossary, in order of appearance in paper:

predicate: verb
word-sense disambiguation: For a word that has multiple meanings, figuring out which meaning a particular occurrence of that word is referring to.
corpus, corpora: A bunch of arbitrary text gathered from real-world sources. “Corpora” is the plural.
parsing: Taking natural language input and arranging it into phrases, subphrases, clauses, etc.
lemma: The root form of a word. For verbs, the lemma would be the infinitive form.
tokenization: Splitting a stream of natural language into a series of tokens, which are basically the same thing as words and punctuation. So, splitting “Hello, world!” into the following series of tokens: H e l l o , w o r l d !
tagging: Annotating each token with its part of speech, so in the sentence “he ran”, ‘ran’ would be marked as (amongst other things) a past-tense third-person singular verb.
lexicon: Dictionary.
gendered/neuter pronouns: “He” and “she” are gendered pronouns in English. “It” is a neuter pronoun in English. syntactic distribution of frame roles: This is referring to the way that particular frame roles are assigned to particular grammatical components in a particular frame-predicate (a particular predicate in a particular frame.) For instance, in English, “BUY” has the buyer as the subject, the seller in an optional “from” clause, the goods as the direct object, and money in an optional “for” clause.
area for further research: I’m too lazy to look into it / give me more grant money.
syntactically motivated: It’s a result of syntax.
collocations: Two words are considered collocates when they appear near each other more often than you’d expect given their respective frequencies.
anaphora resolution: When you have a pronoun, figuring out which noun it refers to.

Why Subnets are Good: The Party

2003-11-03T08:44:00.002-08:00

(This is an attempt to explain, to a non-technical audience, why a large network, such as the Brandeis network, should be divided into subnets. It comes from this bboard thread.)

The reasons why one wouldn’t want to have the entire campus be one huge subnet are technical, but I’ll try to explain them. Think of the Brandeis network as a huge party. It’s a tremendous party, with every student, faculty-member, and staff-member in one huge room. The university is sponsoring it to celebrate the grand opening of the Carl and Ruth Shapiro Supremely Massive Empty Room.

You think your friend Alice is at the party, and you want to talk to her. So, you have to find her. You don’t know where she is in the massive room, so you have to either shout really loudly, or spend a lot of time looking for her. Everyone else is also trying to find people, so everybody’s shouting and running around.

Furthermore, people are trying to get to different places in the room. Once people have found their friends, they need to walk over to them, plus people need to get to the exits, the bathrooms, the bar, the DJ, and so forth. Some people are even trying to dance. So, everybody is walking everywhere, and dancing, and shouting. This might be fun, if you’re into that kind of thing, but it’d be really hard to find Alice, walk over to her without bumping into people and without taking some crazy zig-zagging route, and have a conversation with her.

Then it gets worse. Bob, that insufferable boor, has too much to drink, and gets in a fight with Charlie. Their shouting and scuffling drown out the rest of the activity in the room, and makes it impossible for anyone to have a good time until Public Safety comes and escorts them out. It takes a long time for Public Safety to locate Bob and Charlie in the room, despite their loudness, because the crowd is so thick, and even when Bob and Charlie are found, it takes many minutes for the officers to push through the crowd and get the drunks removed.

Well, the party is a disaster. Carl and Ruth get separated by people running between them trying to get to Alice, and they can’t find each other until 3AM. Jehuda spends the whole night trying to dance the Electric Slide, but people are shouting over the music and he can’t keep the beat.

To make things up to the Shapiros, Brandeis decides to throw another party. This time, they decide to get Brandeis’s most renowned social event planners to help them out with it, and so they go to the ITS department, whose infamous VoIP Rollout Bash is still whispered about in revered tones.

ITS throws the party in a large house, with many different rooms. Each room has between fifty and a couple of hundred people in it. Furthermore, people are assigned to rooms alphabetically, so you know which room everyone is in. Each room has a robot in it which can relay sounds, 3D images, smells, and even the sensation of someone dancing with you, from one room to another. This robot is so powerful, that all you have to do is say, at a normal conversational volume, “Hi, Alice?”, and the robot will zip into the corridor, fly down to some out-of-the-way closet, and give the message to a page, which will zip it over to the robot in Alice’s room. The robot is almost never too busy, the layout of the corridors is so efficient that messages can get between rooms instantaneously.

People can find each other and talk to each other, and are having a good time. You and Alice dance the night away. Bob and Charlie get drunk and start fighting again, but only the people in their room are affected, since the robot doesn’t pass on their alchohol-fueled shouting; also, Public Safety knows what room they’re in, and because the room is much smaller they can get them escorted out much more quickly. The party is a smashing success. Jehuda wins the dance competition, and Carl and Ruth have a wonderful time.

When Daemons Attack: Debugging Linux Applications

2003-05-01T09:14:00.002-07:00

Notes from a talk I gave to the Brandeis University Computer Operators Group.

Call tracing
- System call tracing — strace (Linux), truss (BSD), strace for NT
- Library call tracing — ltrace (Linux/BSD)
Trace example (ltrace -S)
- First, the program is linked and loaded…

 
 SYS_uname(0xbffff600)                            = 0
 SYS_brk(NULL)                                    = 0x0804c000
 SYS_open("/etc/ld.so.preload", 0, 010000210574)  = -2
 SYS_open("/etc/ld.so.cache", 0, 00)              = 3
 SYS_fstat64(3, 0xbfffeda0, 0x400114ac, 0, 0x400115e4) = 0
 SYS_mmap(0xbfffed70, 0, 0x400114ac, 3, 0x40011594) = 0x40012000
 SYS_close(3)                                     = 0
 SYS_open("/lib/libc.so.6", 0, 027777767210)      = 3
 SYS_read(3, "\177ELF\001\001\001", 1024)         = 1024
 SYS_fstat64(3, 0xbfffedf0, 0x400114ac, 0, 0x400115e4) = 0
 SYS_mmap(0xbfffecd0, 0x40011d30, 0x400114ac, 2, 0xbfffecf0) = 0x40029000
 SYS_mprotect(0x40131000, 32292, 0, 2, 0xbfffecf0) = 0
 SYS_mmap(0xbfffecd0, 0x40011d30, 0x400114ac, 0x40011d30, 0xbfffed08) = 0x40131000
 SYS_mmap(0xbfffecd0, 0, 0x400114ac, 0x40137000, 0xbfffed08) = 0x40137000
 SYS_close(3)                                     = 0
 SYS_mmap(0xbffff360, 8, 0x400114ac, 4096, 112)   = 0x40139000
 SYS_munmap(0x40012000, 92718)                    = 0

We’ve finally reached the program’s main statement.

 
 __libc_start_main(0x08048b28, 2, 0xbffffa94, 0x08048760, 0x08049b40  
 setlocale(6, "")                                 = "C"
 bindtextdomain("coreutils", "/usr/share/locale"  
 SYS_brk(NULL)                                    = 0x0804c000
 SYS_brk(0x0804d000)                              = 0x0804d000
 SYS_brk(NULL)                                    = 0x0804d000
 <... bindtextdomain resumed> )                   = "/usr/share/locale"
 textdomain("coreutils")                          = "coreutils"
 __cxa_atexit(0x08048f34, 0, 0, 13, 0x08049edd)   = 0
 getenv("POSIXLY_CORRECT")                        = NULL
 getopt_long(2, 0xbffffa94, "+", 0x0804a080, NULL) = -1
 fputs_unlocked(0xbffffbe2, 0x40132c40, 0x080489f8, 20304, 0xbffffa50

strace shows this next line as fstat64(1, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 4), ...}) = 0 — that’s why you should try strace before going to ltrace, it’s prettier.

 
 SYS_fstat64(1, 0xbffff8d0, 0x40135f60, 0x40136560, 0) = 0

This is libc’s fputs allocating a buffer.

 
 SYS_mmap(0xbffff8b0, 0xbffff8d0, 0x40135f60, 0x40132c40, 4096) = 0x40012000
 <... fputs_unlocked resumed> )                   = 1
 exit(0  
 __fpending(0x40132c40, 0x400114ac, 0x400116d8, 0x08048728, 0x40135f60) = 13
 fclose(0x40132c40  
 SYS_write(1, "Hello, world\n", 13Hello, world
 )               = 13

echo explicitly closes stdout for some reason.

 
 SYS_close(1)                                     = 0
 SYS_munmap(0x40012000, 4096)                     = 0
 <... fclose resumed> )                           = 0
 SYS_exit_group(0  
 +++ exited (status 0) +++

Example: Finding errors with strace
- foo: No such file or directory
- Negative return values usually indicate errors, so try grepping for them
- If the program is printing an error message, start at where that error message is printed and work backwards

 
 open("/usr/lib/locale/locale-archive", O_RDONLY|O_LARGEFILE) = 3
 fstat64(3, {st_mode=S_IFREG|0644, st_size=1384816, ...}) = 0
 mmap2(NULL, 1384816, PROT_READ, MAP_PRIVATE, 3, 0) = 0x4019a000
 close(3)                                = 0
 ioctl(1, SNDCTL_TMR_TIMEBASE, {B38400 opost isig icanon echo ...}) = 0
 ioctl(1, TIOCGWINSZ, {ws_row=57, ws_col=158, ws_xpixel=1584, ws_ypixel=1144}) = 0
 brk(0)                                  = 0x805a000
 brk(0x805d000)                          = 0x805d000
 stat64("foo", 0x8059a5c)                = -1 ENOENT (No such file or directory)
 lstat64("foo", 0x8059a5c)               = -1 ENOENT (No such file or directory) 
 write(2, "ls: ", 4ls: )                     = 4
 write(2, "foo", 3foo)                      = 3
 open("/usr/share/locale/locale.alias", O_RDONLY) = 3
 fstat64(3, {st_mode=S_IFREG|0644, st_size=2627, ...}) = 0
 old_mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x40012000
 read(3, "# Locale name alias data base.\n#"..., 4096) = 2627
 brk(0)                                  = 0x805d000
 brk(0x805e000)                          = 0x805e000
 read(3, "", 4096)                       = 0
 close(3)                                = 0
 munmap(0x40012000, 4096)                = 0
 open("/usr/share/locale/en_US.ISO-8859-1/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
 open("/usr/share/locale/en_US.iso88591/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
 open("/usr/share/locale/en_US/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
 open("/usr/share/locale/en.ISO-8859-1/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
 open("/usr/share/locale/en.iso88591/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
 open("/usr/share/locale/en/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
 write(2, ": No such file or directory", 27: No such file or directory) = 27 
 write(2, "\n", 1
 )                       = 1
 exit_group(1)

Source-level Debugging (This is why having source code is good…) Useful gdb commands:
- bt (backtrace) — Shows the stack of function calls
- b (breakpoint) — Sets a breakpoint
- print expr
Example: Using gdb

 
 /* Silly C program to print numbers */ 
 
 #include <stdio.h> 
 #include <stdlib.h> 
 
 void fillarray(int *array, int size) { 
     int i;
 
     for(i = 0; i &lt; size; i++) array[i] = i;
 } 
 
 int main(int argc, char *argv[]) { 
     int *numarray, i;
 
     numarray = malloc(atoi(argv[1]));
     fillarray(numarray, sizeof(numarray));
     for(i = 0; i &lt; sizeof(numarray); i++) printf("%d\n", numarray[i]);
 
     return 0;
 }

 
 bash$ printnums
 Segmentation fault
 bash$ gcc -ggdb -o printnums printnums.c
 bash$ gdb printnums
 (gdb) r
 Starting program: /home/matthewg/printnums 
 
 Program received signal SIGSEGV, Segmentation fault.
 0x40052a6e in __strtol_internal () from /lib/libc.so.6
 (gdb) bt
 #0  0x40052a6e in __strtol_internal () from /lib/libc.so.6
 #1  0x40050849 in atoi () from /lib/libc.so.6
 #2  0x080483e3 in main (argc=1, argv=0xbffffa04) at printnums.c:15
 (gdb) up
 #1  0x40050849 in atoi () from /lib/libc.so.6
 (gdb) up
 #2  0x080483e3 in main (argc=1, argv=0xbffffa04) at printnums.c:15
 15        numarray = malloc(atoi(argv[1]));
 (gdb) print argv[1]
 (gdb) print argc
 $4 = 1
 (gdb) quit
 A debugging session is active.
 Do you still want to close the debugger?(y or n) y

The program’s expecting a command-line argument, we forgot to give it one or check for that condition.

 
 Continuing.
 0
 1
 2
 3
 
 Program exited normally.
 (gdb) b 20
 Breakpoint 2 at 0x8048451: file printnums.c, line 20.
 (gdb) r
 Starting program: /home/matthewg/printnums 1
 
 Breakpoint 2, main (argc=2, argv=0xbffff9f4) at printnums.c:20
 20        numarray = malloc(atoi(argv[1]));
 (gdb) s
 21        fillarray(numarray, sizeof(numarray));
 (gdb) print sizeof(numarray)
 $7 = 4
 (gdb) print argv[1]
 $8 = 0xbffffb64 "1"
 (gdb) c
 Continuing.
 
 Breakpoint 1, fillarray (array=0x804a008, size=4) at printnums.c:9
 9        for(i = 0; i < size; i++) array[i] = i;
 (gdb) clear
 Deleted breakpoint 1 
 (gdb) c
 Continuing.
 0
 1
 2
 3
 
 Program exited normally.
 (gdb) quit

sizeof(foo) doesn’t do what the programmer thought it did
- Replacing Library Functions fputs.c source:

 
 #define _GNU_SOURCE 
 #include <stdio.h> 
 #include <dlfcn.h> 
 
 int fputs_unlocked(const char *s, FILE *stream) { 
         int (*orig_fputs)(const char *, FILE *);
         int retval;
 
         orig_fputs = dlsym(RTLD_NEXT, "fputs_unlocked");
 
         printf("Doing fputs...\n");
         retval = orig_fputs(s, stream);
         printf("fputs returning %d.\n", retval);
 
         return retval;
 }

 
 bash$ gcc -shared -ldl -o fputs.so fputs.c
 bash$ LD_PRELOAD=./fputs.so /bin/echo "Hello, world"
 Doing fputs...
 Hello, worldfputs returning 1.
 
 bash$

Why is there a line break after “fputs returning 1.” and none between that and “Hello, world” ?