[Cdn-DMCA] Proposal: collaborative comments on the Commons debates

Kristofer Coward kris at melon.org
Tue Apr 30 20:08:56 EDT 2002


The proposed system seems rather similar to a slash site - it might
just be easier to make the appropriate modification to slash than to
code this from the ground up.

On Tue, Apr 30, 2002 at 07:13:19PM -0400, mskala at ansuz.sooke.bc.ca wrote:
> 
> OK, I still don't know how to make a game out of this, but here's what I
> imagine in the way of a database-driven comment system:
> 
> * Have a script that parses each day's Hansard, capturing all the names
>   and headings and capturing them into a database.  The Government
>   postings already contain link targets that could be stored in the
>   database.
> 
> * Store comments in the database too.  Each comment would be associated
>   with a specific point in some issue of Hansard.
> 
> * A "main" screen showing the N most recently added comments in the
>   database, with columns for who wrote the comment, which member was
>   speaking, and that member's party.  Also shown for each comment would be
>   the comment itself, and a link that would take the user to that point in
>   the annotated Hansard (see below).
> 
> * By clicking on any of the column values on the main screen, users could
>   limit the display to comments having that value.  For instance, if
>   there's a comment from me, they could click on my name and see a
>   similar display of only my comments.  Or they could click on the party
>   affiliation of the member speaking, and see commonts only on that
>   party's members.  These would be cumulative, if you wanted to see only
>   my comments on a particular party or whatever.
> 
> * Every comment would have a positive or negative point value and these
>   pages would show the total for comments matching the filter.
> 
> * Each page would also be available in RSS format (trivial to do, just use
>   the same code to spit out the results in XML instead of HTML) so people
>   could keep an eye on it with an appropriate client.
> 
> * A script to provide an "annotated" Hansard, which I imagine as being a
>   cached copy of the one from the Government's site, with extra icons
>   added.  Anywhere users could add a comment, there would be an icon to
>   click to add a comment at that point; for any points where there were
>   already comments, there would be a little icon and note inserted (N
>   comments, click to read them).  This would preferably be available, just
>   like the files on www.parl.gc.ca, as both a complete transcript of the
>   day's events, and in 5-minute segments.
> 
> * Other scripts could be added to do things like the top N Members by
>   score, the form-letter thing I mentioned, and so on.  Also necessary
>   would be the add-a-comment script, and administrative stuff for adding
>   users, deleting users and comments, etc.
> 
> What do you think?  Is this something we'd like to have on our Web site?
> (For some value of that; I know this list reaches the people in charge of
> several different sites.)  Am I missing anything critical?  Would we be
> able to get people to use it?
> 
> Some technical issues:
> 
> * My preferred platform for developing stuff like this is PHP4, with
>   either mySQL or PostgreSQL.  I have both database packages on my system
>   at home, but it isn't network-accessible.  I can post things for
>   debugging purposes on a friend's machine, www.edifyingfellowship.org - 
>   a "production" site wouldn't be welcome there for traffic reasons, but
>   testing/debugging would be fine.  That system supports only PostgreSQL,
>   not mySQL, so if we wanted to port from there to a mySQL-based system
>   elsewhere, then some rewriting and conversion would be necessary.
> 
> * If this were going to be hosted on a system that already has a user
>   account database (for phpSlash or similar) then it might be desirable to
>   write code to connect with those user accounts instead of having a
>   separate account base for the parliamentary-comment system; then we'd be
>   spared of having to deal with account management ourselves.
> 
> * Disk space requirements: an issue of Hansard, in English, is about
>   600K.  Double that if we include French as well.  Double again if we 
>   store both the "complete" file and the "in 5-minute segments" files; divide
>   by two, maybe, to account for compression.  We could probably fake one 
>   of {complete,segmented} by splitting or joining the other, although 
>   if space is cheap it would be nicer not to have to, because storing 
>   both would allow better synch with the Government site.  My guess is
>   that if we didn't cache the actual text, but only stored the "heading"
>   information, that would take about half as much space, counting database
>   overhead.  If we *did* cache the text we'd probably still want to store
>   the headings in a database; my bottom line rough estimate is that we'd
>   have about 2M of data to store per day of Hansard, plus whatever
>   comments people add.  That's not a huge amount of disk space but is
>   enough to be worth thinking about; I wouldn't want to have to store it
>   on ansuz.sooke.bc.ca with my 100M space limit.
> 
> * We could have lots of "fun" figuring out how to represent what happens
>   in the database when Members change parties, change portfolios, resign
>   mid-term and get replaced, etc.  Other people speak in Parliament
>   besides regular Members - for instance, the Speaker and his deputies.
>   Also, I anticipate parsing problems with situations like the Committee of
>   the Whole, and the editorial notes that occasionally get inserted when, 
>   for instance, Members speak in languages other than English and French.
> 
> Some non-technical issues:
> 
> * Such a system would need people to be "operators", to keep an eye on it
>   and make sure everything was going smoothly.  I could forsee
>   vandalism/trolling problems; a "lack of critical mass" problem if we
>   ever got into a situation where there were no recent comments in the
>   system; and all kinds of fun when (as always happens eventually, with
>   systems designed to automatically parse other people's
>   for-human-consumption postings) the Government Web people changed the
>   format of the Parliamentary site.
> 
> * What's the copyright on Hansard, and would this violate it in any way?
> 
> * To what extent should or could such a project be bilingual?
> 
> * Area of coverage: I am most interested in the House of Commons debates,
>   but interesting things happen in the Senate and the Provincial
>   legislatures too, and many if not all Parliamentary debates in Canada
>   are online and could be fodder for such a system.
> 
> * Programming: I think I can write a parser and basic query script, but I
>   don't have time and energy to do all of the development for a nice
>   idiotproof system with all the features I've talked about.  Do we have,
>   or can we recruit, other people who would participate in building it?
>   I'm actually more concerned about recruiting "operators", because
>   that's an activity I hate doing, whereas building new stuff is an
>   activity I enjoy and will do as much as I have time for.
> 
> If a system like that described above seems too large or complicated,
> there may be lighter-weight things we could do instead that would still be
> valuable and serve many of the same purposes.  What I've described sounds
> a lot like what a "Wiki" does; perhaps we could simply build a parser that
> imports Hansard into the database of an existing Wiki package.  Then we'd
> get all the collaboration features for free.  Less work to build, but my
> guess is that it would also be less nice to use.  A good thing about that,
> though, is that if we wanted a Wiki for other reasons (for instance, the
> "open dictionary" that has been discussed), they could seamlessly merge.
> -- 
> Matthew Skala
> mskala at ansuz.sooke.bc.ca                    Embrace and defend.
> http://ansuz.sooke.bc.ca/
> 
> --
> For (un)subscription information, posting guidelines and
> links to other related sites please see http://www.flora.org/dmca/

-- 
Kristofer Coward				http://unripe.melon.org/
GPG Fingerprint: 2BF3 957D 310A FEEC 4733  830E 21A4 05C7 1FEB 12B3
--
For (un)subscription information, posting guidelines and
links to other related sites please see http://www.flora.org/dmca/



More information about the Discuss mailing list