Pete Carroll History

Here’s a GREAT summary of Pete Carroll’s career within football.

http://usctrojans.collegesports.com/sports/m-footbl/mtt/carroll_pete00.html

Make sure to check out the chart at the bottom to see a good breakdown of his history. This is a guy who’s been much maligned as being a failure in the NFL, but if you take a closer look at his profile, you can see that he spent a long career building up experience as an effective defensive coordinator. It’s no wonder his tenure with Norm Chow at the offense has led to such great seasons as of late.

What I find amazing is that Steve Sample et. al somehow had the foresight to take the head coach of the New York Jets from a 6-10 season and hand him the keys to a football team of Time’s University of the Year in 2000. USC was up-and-coming, but our football team was an embarrasment, losing to UCLA for 8 years in a row. The reputation of the university was booming, but the football needed a serious revitalization.

I still remember being served daily humiliation from all walks of Bruin life! Then, getting last-minute tickets to the rivalry game meant seeing a matchup that we were only likely to win because USC wasn’t as bad as UCLA in late 1999. When we finally broke the losing streak, all of a sudden, the Bruins I knew were no longer interested in the rivalry!

All of this leads up to why tomorrow’s game is going to be such a fun display of absolute crushing defeat for the Bruins. USC is on the peak of its game, and the only team that comes close in the PAC-10 are the Bears, who came fairly close to repeating an excuse for the BCS not to place the Trojans in 1st place where they belong. Not this year, however, as the Trojans are sure to be #1 in whatever poll or calculator you pick this season.

Fight on!!!

Media Mammon, a play stock market of memes

This is a fun idea:

Media Mammon is a meme-based stock market. You can create accounts for free and buy or sell shares of search terms that rise or fall in popularity based upon how much the search term appears.

Myself, I just listed “Media Mammon” for $25 play dollars and then bought 215 shares.

Talk about unethical journalism. Heh.

It doesn’t use typekey to authorize accounts, so it would be pretty easy to just flood the market with a ton of automatically created user accounts.

(via)

The Black Cards

I’d heard rumors of superyuppies carrying around exclusive ‘black’ credit cards. At first, I thought these cards would carry a permanent 2% apr in the hope of capturing very expensive impulse purchase habits, but apparently, I was wrong.

http://money.msn.co.uk/Bank_Plan/Cards/SpecFeat/TakeTheCredit/Update1/default.asp The Amex Centurion epinions review The Amex Centurion Urban Legends Page (What’s the opposite of debunk?) The Beyond Black from Quintessentially.

I can see why heavy travelers can make use of free traveler’s upgrades, but honestly, is it a race to see who can have the credit card with the highest fees? Admittedly, it might be fun to pay for a Bentley on a piece of plastic… but to me, these are clearly not the realm of the self-made millionaire.

Did Clinton Gut the Military?

I’m kind of sick of hearing this from various sources, so I went to do some research into government-supplied budgetary data. I threw it into some Excel and ran some charts. Have a look for yourself. I plan on adding some more research there to investigate the various claims I hear from time to time.

One thing I did learn from this process is that budgeting and policy is pretty complex stuff. You might think that our leaders might have to back up their claims with better data than a simple dollar figure spent on National Defense. You might also think that a single number wouldn’t fully disclose changing natures of defense and military structure.

It happens to be that you don’t even need a single number to make a good sound bite. You can get away with a generalization, and as long as you repeat it long enough, those who hear it will believe.

Summarizations of Applying Access Control to Search

One of the issues with providing search functionality to corporate knowledge is that access control is in full effect; you can’t simply do full indexing of everything available, because each user has a different set of data available to them. It also seems like everyone writes their own access control system as well (including yours truly), complicating matters. I’ll overview some of the interesting stuff going on and then go on to list some ideas for implementation in open source.

My Summary:

The critical issue is whether to integrate your access control system into your indexing process, or to modularize it into its own component and provide an interface to the search application. If you modularize, you end up doing a roundtrip check to the security module while iterating over result sets interminably. If you integrate, then you have to inflate your indexes considerably, which probably doesn’t scale too well. Not to mention that you now probably have an mini (or full) access control system in parallel to your main access control system, which you now must maintain and replicate successfully.

Either way, it seems like a tricky problem. Here are my thoughts about practical implementation:

  • If you really want to keep the modularization, it might be possible to create some sort of batch access control check. Instead of iterating one check at a time, bundle up a chunk of your result set, send it over the wire to the access control system, and get back a matrix of results. Might work a little better and would probably incur less network overhead, even though it’s still the primitive solution.
  • If your set of credentials and content is manageable (and if you don’t mind being a jackass), you can try an unscalable solution of performing an exhaustive indexing operation at scheduled intervals for each credential over the entire set of content, cached at the search application level. This is also a primitive solution but probably would result in fast queries.
  • If your access control system does caching, that will help, but the first time is still going to be quite a nasty hit, and why would you search twice on the same terms in the same session?
  • This is kind of a stupid idea, but what if you could decouple the search application from your standard web idea of a search application, and treat it more like a P2P network search? In P2P applications, you enter terms, hit search, and alt-tab or go away and come back when it’s had more time to look around. This probably isn’t acceptable in web application UX unless the user understands that secure web applications with private content requires special handling. Good luck on that one. If they’re in their browser, they probably expect google.
  • For a more metadata-oriented access control solution, it might be possible to run and maintain multiple indexes, partitioned by metadata property, that basically consider themselves static content sets. Then, when you search over a user credential, you can leverage parallel checks to multiple indexes for each metadata property the user has access to. Then, you’ve probably got some set mathematics to perform on the parallel result sets that are returned, based upon the relationships between the metadata. This is some limited integration with the access control system, and is a pretty heinous idea, but it might work better in heavy or complex data sets if the processing power is available.

Research References:

There are very few providers or researchers that i’ve found doing work in this area. It even seems like nobody’s coined a proper term for such functionality in search, so I list some terms you can google for at the bottom.

Netegrity / Inktomi – SiteMinder

Netegrity collaborating with Inktomi have apparently abstracted out RBAC into Netegrity’s SiteMinder software. It connects to LDAP on the user management side, and integrates with Inktomi’s Enterprise Search Security Module to basically do a last-step check on each search result returned. It’s the primitive solution, and has a host of performance issues involved with abstracting the permissions system out of the search component. This is apparently the only commercial solution to the problem that I could find. They even say they’re the only vendor inside their PDF! If it’s in the PDF, it must be true.

  • If your search wants 100 results, do you just use that as a parameter for the initial grab of results? Or do you use that as a goal, and continue checking results until it all adds up to 100?

http://www.netegrity.com/partners/related/InktomiDatasheet.pdf

XenIntranet

There’s a reference in the changelogs for XenIntranet to adding access control to search. It looks like they use a custom ACL solution, and probably integrated it directly. See comments far below.

http://www.xenintranet.com/changelogs.php

Stanford Peers

This paper from Stanford people Mayank Bawa, Roberto Bayardo Jr., and Rakesh Agrawal describes a Privacy-Preserving Index. It also complains about the lack of Private information search technology, but the solution it posits seems to be more about preventing reverse engineering of data availability through special algorithms for building distributed indexes. The powerpoint below has animations describing the techniques.

*** Update – Mayank Bawa was kind enough to write me and point me to the original powerpoint slides for the presentation, so I changed the link below and removed my snarky comment about Stanford (full disclosure: I went to USC). Thanks, Mayank!

http://www-db.stanford.edu/~bawa/Pub/ppi.ppt

They do list a couple of interesting links.

The Stanford Peers P2P homepage. That’s interesting, that resource discovery over P2P networks may have a lot to do with access-controlled search. This page lists a lot of resources for reading on P2P network topics, but it’s a little stuffy in there.

IBM’s YouServ, a distributed personal webserver at use within IBM for web hosting / file sharing.

Chris Weider

This early paper (’96!) from Chris Weider seems to touch briefly upon some of these issues in the second from last paragraph. It seems to be more concerned with exposing for-pay content to public users via normal search tools. Some solutions it describes are indexing proxies, which might index and expose for-pay content via search tools only. Think a9′s searching through book content for keywords.

http://www.isoc.org/isoc/whatis/conferences/inet/96/proceedings/a2/a2_1.htm

MIT Computation Structure Group

This bunch of people from MIT seem to be barking up the right tree. However, they like to use a bunch of big words. I think that when you’re dealing with a subject as complex as access control mapped onto search, you need to give your reader a bit of a break when it comes to academic huffing and puffing. Anywho, to summarize, they also complain about the performance implications of Netegrity-Inktomi-adopted approach of completely modularizing access control, and are working on integrating ACL’s into the Intenational Naming System.

http://www.csg.lcs.mit.edu/pubs/reports/search3.pdf, referenced in the MIT Computation Structure Group’s Search Project

If you want to do more research

Here are some of the terms I searched with that turned up goodies:

“permission-based search” “access-controlled search”

Possible outlets for implementation

  • Integrating a Lucene port with a standardized access control system – I might end up doing this with a customized access control system.

Anyway, if you’re researching or doing development on anything like these things, I would love to hear from you. gluk AT padtie dot com.