knowledge-database (beta)

Current group: comp.arch

[OT?] FBI Virtual Case File is even possible?

[OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Del Cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Bernd Paysan
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Bernd Paysan
 Re: [OT?] FBI Virtual Case File is even possible?  
Casper H.S. Dik
 Re: [OT?] FBI Virtual Case File is even possible?  
del cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Bernd Paysan
 Re: [OT?] FBI Virtual Case File is even possible?  
jgrove24 at hotmail.com
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
mike
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Fuld
 Re: [OT?] FBI Virtual Case File is even possible?  
mike
 Re: [OT?] FBI Virtual Case File is even possible?  
Martin Rodgers
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Fuld
 Re: [OT?] FBI Virtual Case File is even possible?  
Bernd Paysan
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Fuld
 Re: [OT?] FBI Virtual Case File is even possible?  
Andrew Reilly
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
krasicki
 Re: [OT?] FBI Virtual Case File is even possible?  
krasicki
 Re: [OT?] FBI Virtual Case File is even possible?  
jgrove24 at hotmail.com
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Fuld
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Fuld
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Some Guy Who Doesn't Want Spam
 Re: [OT?] FBI Virtual Case File is even possible?  
krasicki
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Sprunk
 Re: [OT?] FBI Virtual Case File is even possible?  
Jan_Vorbrüggen
 Re: [OT?] FBI Virtual Case File is even possible?  
Stephen Sprunk
 Re: [OT?] FBI Virtual Case File is even possible?  
Eugene Miya
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Eugene Miya
 Re: [OT?] FBI Virtual Case File is even possible?  
del cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Terje Mathisen
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
mike
 Re: [OT?] FBI Virtual Case File is even possible?  
Rick Jones
 Re: [OT?] FBI Virtual Case File is even possible?  
Del Cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Terje Mathisen
 Re: [OT?] FBI Virtual Case File is even possible?  
Del Cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Rick Jones
 Re: [OT?] FBI Virtual Case File is even possible?  
del cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Anne & Lynn Wheeler
 Re: [OT?] FBI Virtual Case File is even possible?  
Edward Wolfgram
 Re: [OT?] FBI Virtual Case File is even possible?  
jgrove24 at hotmail.com
 Re: [OT?] FBI Virtual Case File is even possible?  
Anne & Lynn Wheeler
 Re: [OT?] FBI Virtual Case File is even possible?  
Stefan Monnier
 Re: [OT?] FBI Virtual Case File is even possible?  
Stefan Monnier
 Re: [OT?] FBI Virtual Case File is even possible?  
del cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Nick Maclaren
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
del cecchi
 Re: [OT?] FBI Virtual Case File is even possible?  
Robert Myers
 Re: [OT?] FBI Virtual Case File is even possible?  
Eric P.
 Re: [OT?] FBI Virtual Case File is even possible?  
Nick Maclaren
 Re: [OT?] FBI Virtual Case File is even possible?  
krasicki at consultant.com
 Re: [OT?] FBI Virtual Case File is even possible?  
rmyers1400 at comcast.net
 Re: [OT?] FBI Virtual Case File is even possible?  
lynn at garlic.com
 Re: [OT?] FBI Virtual Case File is even possible?  
mike
From:Robert Myers
Subject:[OT?] FBI Virtual Case File is even possible?
Date:Fri, 14 Jan 2005 11:50:57 -0500
Greetings,

News articles say FBI is probably going to scrap its $170 Million
Virtual Case File system and start over from scratch.

The problem is characterized as a software issue. Not being an
enterprise architect, I would have thought the problem was an enterprise
architecture issue: hardware, software, and mostly organizational wetware.

I'm posting here because of the number of posters and lurkers with
significant architectural experience in mainframe and enterprise systems.

One article says the FBI wants to do prognostic data mining. Who
doesn't? For $170 Million, the FBI is going to invent a new field (for
which there surely must be players on Wall Street who would pay ten
times as much or more for the same capability), replace hardware, create
a nationwide secure network that can handle the most sensitive national
security information _and_ be accessible to garden-variety FBI agents,
_and_ that said agents will be willing to use. Even if they didn't have
to upgrade a system currently in use (which is what Sen. Patrick Leahy
has said is the problem), it sounds like somebody slipped a decimal
point (at least) in time and money.

This is the future of computing as visualized from when spinning tape
drives were the very symbol of the awesome capacity of computers to
acquire and to sort through information: a computer that gobbles up
everything, sifts through it all, and tells the hapless human what to do
next. Still science fiction.

Projects that come to mind that have to work and that historically have
worked are all mainframe-type systems, some of which I think are still
using System 360-vintage software. You don't invent the future all at
once, you do it piece by piece, and you probably hire a mainframe house
to do it for you over a period of decades. SAIC couldn't do it for $170
Million in a few years? Gimme a break.

If space-type programs didn't have such a lousy track record (the
Manhattan and Apollo projects worked, the Space Shuttle, the War on
Cancer, and Japan's Fifth Generation AI Project didn't), I'd say that a
space-type program was what would really be required, with that kind of
visibility and commitment. Won't happen, whether it would be a good
idea or not, because time and the tides have moved past computing for
that sort of attention.

Maybe they should hire Dell and an offshore software house--the
combination that has set the expectation for the price tag and for what
constitutes a working computer architecture.

RM
From:Del Cecchi
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Fri, 14 Jan 2005 10:58:06 -0600
Robert Myers wrote:
> Greetings,
>
> News articles say FBI is probably going to scrap its $170 Million
> Virtual Case File system and start over from scratch.
>
> The problem is characterized as a software issue. Not being an
> enterprise architect, I would have thought the problem was an enterprise
> architecture issue: hardware, software, and mostly organizational wetware.
>
> I'm posting here because of the number of posters and lurkers with
> significant architectural experience in mainframe and enterprise systems.
>
> One article says the FBI wants to do prognostic data mining. Who
> doesn't? For $170 Million, the FBI is going to invent a new field (for
> which there surely must be players on Wall Street who would pay ten
> times as much or more for the same capability), replace hardware, create
> a nationwide secure network that can handle the most sensitive national
> security information _and_ be accessible to garden-variety FBI agents,
> _and_ that said agents will be willing to use. Even if they didn't have
> to upgrade a system currently in use (which is what Sen. Patrick Leahy
> has said is the problem), it sounds like somebody slipped a decimal
> point (at least) in time and money.
>
> This is the future of computing as visualized from when spinning tape
> drives were the very symbol of the awesome capacity of computers to
> acquire and to sort through information: a computer that gobbles up
> everything, sifts through it all, and tells the hapless human what to do
> next. Still science fiction.
>
> Projects that come to mind that have to work and that historically have
> worked are all mainframe-type systems, some of which I think are still
> using System 360-vintage software. You don't invent the future all at
> once, you do it piece by piece, and you probably hire a mainframe house
> to do it for you over a period of decades. SAIC couldn't do it for $170
> Million in a few years? Gimme a break.
>
> If space-type programs didn't have such a lousy track record (the
> Manhattan and Apollo projects worked, the Space Shuttle, the War on
> Cancer, and Japan's Fifth Generation AI Project didn't), I'd say that a
> space-type program was what would really be required, with that kind of
> visibility and commitment. Won't happen, whether it would be a good
> idea or not, because time and the tides have moved past computing for
> that sort of attention.
>
> Maybe they should hire Dell and an offshore software house--the
> combination that has set the expectation for the price tag and for what
> constitutes a working computer architecture.
>
> RM
>
How about a nice Blue Gene based system like the Mayo Clinic is building
for genomic medicine and mining patient records? :-)

Maybe they shouldn't have hired the low bidder. And the government gets
what they ask for in the contract. Maybe they asked for the wrong
thing, or maybe they asked for the undoable.

Really wants one want to have them take over health care, eh?

del cecchi
From:Bernd Paysan
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Fri, 14 Jan 2005 23:38:56 +0100
Del Cecchi wrote:
> Maybe they shouldn't have hired the low bidder. And the government gets
> what they ask for in the contract. Maybe they asked for the wrong
> thing, or maybe they asked for the undoable.

IMHO, if you are asking for new technology, you should not make a bit, but a
contest. New ideas mean that many people go and head off into the wrong
direction. There's nothing wrong with that, you can stop them pretty soon
(maybe even before they get into the Death Valley of the project).
"Contest" means that you allow all participants to deliver a prototype -
but the price you'll pay is fixed, and unevenly distributed among the
participants (the higher the rank, the more).

I think all the basic components for such a system are already available.
All they need for a start is a hypertext (wiki) system where officers can
enter data, add links and google around (not all free-form, but a
significant part of the case file is like that). A thesaurus should help
officers to enter precise worded entries, but I think that is common
practice for real files, too (the thesaurus should be built into the search
matrix too, e.g. if you search for "body", you should find "corpse", too -
Google doesn't). Biometric data can be automatically put into categories,
too, e.g. run fingerprint and face recognition software to identify
potentially overlapping cases.

Perhaps the FBI aimed much too high - that is the most common problem with
government contracts. They rarely take the approach "start low, with what's
available, and improve gradually", they always want the big thing, and are
willing to wait ten years and hundred millions, and then scrap the whole
thing.

The harder thing is to prevent abuse of such a system. The easy part is to
expire and clear files, though probably nothing will ever get deleted (at
least searching should require a sort of "way back machine", i.e. not in
the normal context). However, criminal officers can use such a virtual file
and the search functions to hide. Therefore, each file should also keep
track of who viewed it - if an officer not in charge views a particular
file quite often, there's a suspect. Version control is also obvious. That
all this should happen within a VPN, too.

The web is a huge public general purpose information system, making a
specialized in-house variant should really not take that much. The search
engine companies sell their technology, they'll probably be happy with
ideas how to make things better (a real-world Google could benefit from a
thesaurus, too).

--
Bernd Paysan
"If you want it done right, you have to do it yourself"
http://www.jwdt.com/~paysan/
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Fri, 14 Jan 2005 18:13:34 -0500
On Fri, 14 Jan 2005 23:38:56 +0100, Bernd Paysan
wrote:

>Del Cecchi wrote:
>> Maybe they shouldn't have hired the low bidder. And the government gets
>> what they ask for in the contract. Maybe they asked for the wrong
>> thing, or maybe they asked for the undoable.
>
>IMHO, if you are asking for new technology, you should not make a bit, but a
>contest. New ideas mean that many people go and head off into the wrong
>direction. There's nothing wrong with that, you can stop them pretty soon
>(maybe even before they get into the Death Valley of the project).
>"Contest" means that you allow all participants to deliver a prototype -
>but the price you'll pay is fixed, and unevenly distributed among the
>participants (the higher the rank, the more).
>

Hasn't gotten DARPA an autonomous land vehicle, so far.

>I think all the basic components for such a system are already available.
>All they need for a start is a hypertext (wiki) system where officers can
>enter data, add links and google around (not all free-form, but a
>significant part of the case file is like that). A thesaurus should help
>officers to enter precise worded entries, but I think that is common
>practice for real files, too (the thesaurus should be built into the search
>matrix too, e.g. if you search for "body", you should find "corpse", too -
>Google doesn't). Biometric data can be automatically put into categories,
>too, e.g. run fingerprint and face recognition software to identify
>potentially overlapping cases.
>
>Perhaps the FBI aimed much too high - that is the most common problem with
>government contracts. They rarely take the approach "start low, with what's
>available, and improve gradually", they always want the big thing, and are
>willing to wait ten years and hundred millions, and then scrap the whole
>thing.
>
>The harder thing is to prevent abuse of such a system. The easy part is to
>expire and clear files, though probably nothing will ever get deleted (at
>least searching should require a sort of "way back machine", i.e. not in
>the normal context). However, criminal officers can use such a virtual file
>and the search functions to hide. Therefore, each file should also keep
>track of who viewed it - if an officer not in charge views a particular
>file quite often, there's a suspect. Version control is also obvious. That
>all this should happen within a VPN, too.
>
>The web is a huge public general purpose information system, making a
>specialized in-house variant should really not take that much. The search
>engine companies sell their technology, they'll probably be happy with
>ideas how to make things better (a real-world Google could benefit from a
>thesaurus, too).

You're concerned about civil liberties, which is understandable.
That's a much bigger problem than the FBI's proposed
system--computerized collections of personal data or allegations about
indivduals (which the internet already is) and the ability to search
them (which google already is) are inimical to privacy.

The problem for the customer of record here is not civil liberties,
but security. The very same mechanisms that have made the internet so
stunningly powerful (flat resource space, amazing indexes, universal
access) are unacceptable for secure systems.

The security regimes with which I have been acquainted lean heavily on
strict compartmentalization by need to know. That means that access
paths all have to be laid out rigidly and with significant
foreknowledge about what metadata need to be visible--and it means the
metadata have to be tediously created by hand. I don't know how the
FBI has proposed to deal with compartmentalization, but you can't just
dump everything out there in a flat resource space, index it, and let
people do their own searches.

If you didn't have to worry about security, your proposal would be
magic. Digitize everything (although agents have apparently been
resistant to the step of putting things into a scanner), put it where
it can be indexed, index it, and let agents learn to think the way
that people who use and contribute to open source do.

The FBI didn't think this part through clearly, either, and figured
that, if you can correlate the entire planet with a quarter million
PC's (as in google), you should probably be able to cope with just the
FBI's case files for $170 Million and no special magic?

RM
From:Bernd Paysan
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Sat, 15 Jan 2005 15:46:48 +0100
Robert Myers wrote:
> The security regimes with which I have been acquainted lean heavily on
> strict compartmentalization by need to know. That means that access
> paths all have to be laid out rigidly and with significant
> foreknowledge about what metadata need to be visible--and it means the
> metadata have to be tediously created by hand. I don't know how the
> FBI has proposed to deal with compartmentalization, but you can't just
> dump everything out there in a flat resource space, index it, and let
> people do their own searches.

As far as I know the FBI (and even more the secret services) have troubles
with "left hand doesn't know what right hand does". This is a consequence
of security regimes like the "need to know" principle. Since the FBI is a
federal police, the "need to know" regime should happen at the border to
the FBI, not within. Crimes that are no concern to the FBI don't go into
the FBI database, and crimes that do should allow to be linked relatively
freely. Organized crime needs correlation between seemingly unrelated
cases. Organized crimes also requires surveillance of the officers in
charge, as the combination of "need to know" and corruption is an easy way
for organized crime to continue their work even though the FBI has been set
on them.

The "need to know" principle does not help civil liberties. The Gestapo was
clearly organized that way, too. It's a two-sided sword - it helps to hide
crimes by those organizations as well as it helps to hide well enough
organized criminals.

Systems like that need boundaries. Some material needs to be classified
internally, too. However, it's dangerous to allow material to be locked in
(like the notebook from the suspected 9/11 highjacker which was caught a
month before). I'm quite sure that the inner boundaries, the virtual
cubicle walls inside the FBI are there for the same reasons many
programmers in classical corporate environments don't want other people to
look into their sources: They are there as job protection mean, to hide
errors, not to help nation security or civil liberties.

Sure, the database should not be searched that easily. At least a "four
eyes" approach or something like that is needed. Internal communication
within the team may be concealed from non-members (the same thing is even
possible on Sourceforge). For things like that (mailing lists), you don't
need special infrastructure, not even for a very tough security regime on
those mails (OpenPGP or S/MIME is clearly sufficient).

> If you didn't have to worry about security, your proposal would be
> magic. Digitize everything (although agents have apparently been
> resistant to the step of putting things into a scanner), put it where
> it can be indexed, index it, and let agents learn to think the way
> that people who use and contribute to open source do.

Hm, requires at least three silver bullets ;-). FBI agents are far away from
open source contributors.

--
Bernd Paysan
"If you want it done right, you have to do it yourself"
http://www.jwdt.com/~paysan/
From:Casper H.S. Dik
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:15 Jan 2005 15:13:54 GMT
Bernd Paysan writes:

>Sure, the database should not be searched that easily. At least a "four
>eyes" approach or something like that is needed. Internal communication
>within the team may be concealed from non-members (the same thing is even
>possible on Sourceforge). For things like that (mailing lists), you don't
>need special infrastructure, not even for a very tough security regime on
>those mails (OpenPGP or S/MIME is clearly sufficient).

Data would need to be labeled so only sufficient cleared personel
and personel with a stated interest in certain data can see it;
that, of course, makes searching/fishing a task which is not something
everybody is allowed to do, at least not in the whole database.

Making sure that some part of the system knows who accessed what and
when is also very important: it serves both as a deterrent from abuse
and may allow tracing leaked documents.

>> If you didn't have to worry about security, your proposal would be
>> magic. Digitize everything (although agents have apparently been
>> resistant to the step of putting things into a scanner), put it where
>> it can be indexed, index it, and let agents learn to think the way
>> that people who use and contribute to open source do.

>Hm, requires at least three silver bullets ;-). FBI agents are far away from
>open source contributors.

A lot of the data the FBI gathers is not computerized; to go from
hand written notes to a searchable index is quite a step.

Casper
From:del cecchi
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Sat, 15 Jan 2005 20:05:51 -0600

"Bernd Paysan" wrote in message
news:ovmnb2-rqa.ln1@vimes.paysan.nom...
> Robert Myers wrote:
> > The security regimes with which I have been acquainted lean heavily
on
> > strict compartmentalization by need to know. That means that access
> > paths all have to be laid out rigidly and with significant
> > foreknowledge about what metadata need to be visible--and it means
the
> > metadata have to be tediously created by hand. I don't know how the
> > FBI has proposed to deal with compartmentalization, but you can't
just
> > dump everything out there in a flat resource space, index it, and
let
> > people do their own searches.
>
> As far as I know the FBI (and even more the secret services) have
troubles
> with "left hand doesn't know what right hand does". This is a
consequence
> of security regimes like the "need to know" principle. Since the FBI
is a
> federal police, the "need to know" regime should happen at the border
to
> the FBI, not within. Crimes that are no concern to the FBI don't go
into
> the FBI database, and crimes that do should allow to be linked
relatively
> freely. Organized crime needs correlation between seemingly unrelated
> cases. Organized crimes also requires surveillance of the officers in
> charge, as the combination of "need to know" and corruption is an easy
way
> for organized crime to continue their work even though the FBI has
been set
> on them.

Sorry Charlie. You must not have gotten the memo about the wall between
criminal and national security work at the FBI. Sent by Garfolo(Not
sure about the spelling, worked for Reno)
>
> The "need to know" principle does not help civil liberties. The
Gestapo was
> clearly organized that way, too. It's a two-sided sword - it helps to
hide
> crimes by those organizations as well as it helps to hide well enough
> organized criminals.

You also don't know about the fruit of the poisoned tree and all that.
American Blind justice is almost too bizarre to think about.
>
> Systems like that need boundaries. Some material needs to be
classified
> internally, too. However, it's dangerous to allow material to be
locked in
> (like the notebook from the suspected 9/11 highjacker which was caught
a
> month before). I'm quite sure that the inner boundaries, the virtual
> cubicle walls inside the FBI are there for the same reasons many
> programmers in classical corporate environments don't want other
people to
> look into their sources: They are there as job protection mean, to
hide
> errors, not to help nation security or civil liberties.

Bzzt, thanks for playing. Read the accounts of the arrest and trial of
Z Massoui ( the purported 20th hijacker). His hard drive couldn't be
searched. Would violate his rights.

>
> Sure, the database should not be searched that easily. At least a
"four
> eyes" approach or something like that is needed. Internal
communication
> within the team may be concealed from non-members (the same thing is
even
> possible on Sourceforge). For things like that (mailing lists), you
don't
> need special infrastructure, not even for a very tough security regime
on
> those mails (OpenPGP or S/MIME is clearly sufficient).
>
I don't think you should talk about stuff you don't know about. They
already have a database. Hell, they probably have 17 databases. All
living on servers somewhere that are updated overnight.

snip
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Fri, 14 Jan 2005 13:16:28 -0500
Del Cecchi wrote:

>>
> How about a nice Blue Gene based system like the Mayo Clinic is building
> for genomic medicine and mining patient records? :-)
>

Blue Gene gets more amazing all the time. It now does virtualization? ;-).

A z-series mainframe, I would have thought, with virtualization on
virtualization to monitor and to trap suspicious behavior aimed at such
a high-value target, but what do I know?

If vmware can virtualize garden-variety x86, I'm sure that IBM can
virtualize the Power-based Blue Gene. What's one more ambitious goal,
more or less?

RM
From:Bernd Paysan
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Sat, 15 Jan 2005 17:51:51 +0100
Del Cecchi wrote:
> Maybe they shouldn't have hired the low bidder. And the government gets
> what they ask for in the contract. Maybe they asked for the wrong
> thing, or maybe they asked for the undoable.

According to Heise online, one of the problems the FBI has is that the first
deliverable doesn't work perfect. Duh. This is to be expected, software
simply doesn't do that. An iteration cycle is necessary. Government
contracts are especially difficult to deal with, since the customer both
doesn't know what he wants *and* wants a perfect solution on the first
shot. Commercial development is easier, since while the customer there
doesn't know what he wants either, a perfect solution is not necessary.
Economic pressure makes sure that the customer will live with the 90%
solution, and allows for gradual improvement.

According to recent news on Heise, FBI also doesn't use "Carnivore", but
some unnamed commercial systems. They don't seem to have much luck with the
software development they initiate.

> Really wants one want to have them take over health care, eh?

Hm, the US system, as it is now, is nothing to be proud of, either. And you
can't say that the state is not already involved deeply - the FDA does such
weird things on pharmacy prices and availability that even MCs go to Canada
to buy cheaper drugs (or drugs unavailable in the USA).

So: given how much the state now already manages to mess up health care, no,
you wouldn't want more of that.

--
Bernd Paysan
"If you want it done right, you have to do it yourself"
http://www.jwdt.com/~paysan/
From:jgrove24 at hotmail.com
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:15 Jan 2005 13:59:47 -0800

jgrove24@hotmail.com wrote:
> Robert Myers wrote:
> > Greetings,
> >
> > News articles say FBI is probably going to scrap its $170 Million
> > Virtual Case File system and start over from scratch.
> >
> > The problem is characterized as a software issue. Not being an
> > enterprise architect, I would have thought the problem was an
> enterprise
> > architecture issue: hardware, software, and mostly organizational
> wetware.
> >
> > I'm posting here because of the number of posters and lurkers with
> > significant architectural experience in mainframe and enterprise
> systems.
> >
>
> The features the FBI looked for are easy using UNIX, and a tool like
> Documentum. They were looking to manage CASE FILES, you know like
paper
> documents. And the ability to search or GREP in UNIX. A Grand Jury
> examination of this folly is in order.
>
> JG

Actually, I'm read usenet in GOOGLE, what the FBI wanted is basically
GOOGLE features.

JG
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Sat, 15 Jan 2005 17:40:08 -0500
On 15 Jan 2005 13:59:47 -0800, jgrove24@hotmail.com wrote:

>
>Actually, I'm read usenet in GOOGLE, what the FBI wanted is basically
>GOOGLE features.
>

The problem is that the part of the problem you can see--the part the
looks like the internet and google--is sufficiently non-trivial that
it is successfully masquerading as the whole problem to you.

The FBI wants data relevant to terrorism to be seen instantaneously
(as opposed to the currently claimed 24-hour cycle, which at that I
don't believe)--google can't do that.

There's lots of stuff on the web, the so-called deep web, that you
can't see with google, and the Virtual Case File would have to look
like a deep web so that it didn't pose unacceptable security risks.

Google can't manage document access or audit changes and versions.
The tools open source contributors use can do many of those things
nicely, but if FBI agents could function at that level of competence,
they could probably build the system themselves on the fly.

I wonder if something very similar to the false intuition you have
about this problem doesn't explain the whole wasted $170 million. It
looks easy, so it must be easy. Put it out on an RFP, take a
plausible bid, and wait for your own ready-made personal internet and
google to appear. Piece of cake.

RM
From:mike
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 05:30:37 GMT
There are many organizational and cultural hurdles for governmental
organizations to overcome when developing large systems. However this
posting implies one problem that does not exist.

With the Internet there is no way for GOOGLE to know immediately when a web
page is created or updated. Therefore, GOOGLE must scan the whole net, or
as much as it can access, every few days or weeks to keep its indexes fairly
current. Google can assign a high priority to some sites based on user
access frequency and site volatility. These sites can be scanned every day,
hour or even every few minutes. On the other hand changes to less important
sites will not be reflected in the index for weeks or even months. There is
no possibility for a "real time" index.

In a closed government system like the FBI case database the software that
enters or modifies a case file can send a message directly to the indexing
engine. Further unlike GOOGLE the indexing engine does not have to waste
cycles re-indexing old entries that have not been changed. Therefore a near
real time index, within a few hundred milliseconds, can be maintained.

Mike Sicilian


"Robert Myers" wrote in message
news:c76ju01ic19dv5h8efai6c021cqvol77f1@4ax.com...
> On 15 Jan 2005 13:59:47 -0800, jgrove24@hotmail.com wrote:
>
>>
>>Actually, I'm read usenet in GOOGLE, what the FBI wanted is basically
>>GOOGLE features.
>>
>
> The problem is that the part of the problem you can see--the part the
> looks like the internet and google--is sufficiently non-trivial that
> it is successfully masquerading as the whole problem to you.
>
> The FBI wants data relevant to terrorism to be seen instantaneously
> (as opposed to the currently claimed 24-hour cycle, which at that I
> don't believe)--google can't do that.
>
> There's lots of stuff on the web, the so-called deep web, that you
> can't see with google, and the Virtual Case File would have to look
> like a deep web so that it didn't pose unacceptable security risks.
>
> Google can't manage document access or audit changes and versions.
> The tools open source contributors use can do many of those things
> nicely, but if FBI agents could function at that level of competence,
> they could probably build the system themselves on the fly.
>
> I wonder if something very similar to the false intuition you have
> about this problem doesn't explain the whole wasted $170 million. It
> looks easy, so it must be easy. Put it out on an RFP, take a
> plausible bid, and wait for your own ready-made personal internet and
> google to appear. Piece of cake.
>
> RM
From:Stephen Fuld
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 06:00:16 GMT

"mike" wrote in message
news:14IGd.10035$Vj3.7515@newssvr17.news.prodigy.com...
> There are many organizational and cultural hurdles for governmental
> organizations to overcome when developing large systems. However this
> posting implies one problem that does not exist.
>
> With the Internet there is no way for GOOGLE to know immediately when a
> web page is created or updated. Therefore, GOOGLE must scan the whole
> net, or as much as it can access, every few days or weeks to keep its
> indexes fairly current. Google can assign a high priority to some sites
> based on user access frequency and site volatility. These sites can be
> scanned every day, hour or even every few minutes. On the other hand
> changes to less important sites will not be reflected in the index for
> weeks or even months. There is no possibility for a "real time" index.
>
> In a closed government system like the FBI case database the software that
> enters or modifies a case file can send a message directly to the indexing
> engine. Further unlike GOOGLE the indexing engine does not have to waste
> cycles re-indexing old entries that have not been changed. Therefore a
> near real time index, within a few hundred milliseconds, can be
> maintained.

Precisely. And the other technical problems seem fairly easily solvable.
For instance, I don't doubt that by sending some money to Google for a
special system for them selves, Google could easily take each query,
optionally look up all the words in a thesaurus and modify the queries to
include either the original word or the synonym. As for getting all of the
existing paper information into the database, Google has just commenced
scanning a huge amount of data, from several libraries into searchable form,
so this is right up their alley.

Some people mentioned the security issues. ISTM that either you allow FBI
agents access to the data or you don't. If you do, then there is no
problem. If you don't, then you can't say that you have "broken the stove
pipes" or "broken down the walls", as they claim to be doing. If there is
some particular need, again, having some code on each document and making
the search or display software aware of the codes seems like it could be
made to work.

While I agree that trying to use the existing Google system exactly as it is
now wouldn't work, but ISTM that it is so close, and the modifications
should cost a tiny fraction of the $157 million, that it is worth trying.

--
- Stephen Fuld
e-mail address disguised to prevent spam
From:mike
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 06:35:42 GMT
Hold on a second!!

I just said that real time index maintenance was not the issue, I did not
mean to imply that GOOGLE and a few million dollars would be a full function
case system for the FBI. I have not even seen a clear statement of system
requirements.

However, just thinking out loud, you can not just let every FBI agent look
at every file with no controls and no log of access. If Al Queda manages to
suborn just one agent they could find out exactly what the FBI was looking
at and the names of all informants and puff! the whole FBI is useless.

I would expect the FBI needs a system that will look at each case and notify
the agents on that case of other cases and documents that might be relevant.

Further, supervisors should get detailed reports on system usage so they can
verify the appropriateness of data requests as relevant to the assignments
of their staff.

Also, the system should have a "work flow management" component like call
centers and large help desk operations use, so that new data entry
transactions needing investigation or analysis are automatically assigned
and progress tracked. The FBI is a big organization and without such a
facility, too much will fall between the cracks.

One of the biggest problems with government system procurement is that many
of these features can be found in commercial packages. When available, they
should be used as subsystems rather than reinvented.

Mike Sicilian





"Stephen Fuld" wrote in message
news:QvIGd.33750$w62.13950@bgtnsc05-news.ops.worldnet.att.net...
>
> "mike" wrote in message
> news:14IGd.10035$Vj3.7515@newssvr17.news.prodigy.com...
>> There are many organizational and cultural hurdles for governmental
>> organizations to overcome when developing large systems. However this
>> posting implies one problem that does not exist.
>>
>> With the Internet there is no way for GOOGLE to know immediately when a
>> web page is created or updated. Therefore, GOOGLE must scan the whole
>> net, or as much as it can access, every few days or weeks to keep its
>> indexes fairly current. Google can assign a high priority to some sites
>> based on user access frequency and site volatility. These sites can be
>> scanned every day, hour or even every few minutes. On the other hand
>> changes to less important sites will not be reflected in the index for
>> weeks or even months. There is no possibility for a "real time" index.
>>
>> In a closed government system like the FBI case database the software
>> that enters or modifies a case file can send a message directly to the
>> indexing engine. Further unlike GOOGLE the indexing engine does not have
>> to waste cycles re-indexing old entries that have not been changed.
>> Therefore a near real time index, within a few hundred milliseconds, can
>> be maintained.
>
> Precisely. And the other technical problems seem fairly easily solvable.
> For instance, I don't doubt that by sending some money to Google for a
> special system for them selves, Google could easily take each query,
> optionally look up all the words in a thesaurus and modify the queries to
> include either the original word or the synonym. As for getting all of
> the existing paper information into the database, Google has just
> commenced scanning a huge amount of data, from several libraries into
> searchable form, so this is right up their alley.
>
> Some people mentioned the security issues. ISTM that either you allow FBI
> agents access to the data or you don't. If you do, then there is no
> problem. If you don't, then you can't say that you have "broken the stove
> pipes" or "broken down the walls", as they claim to be doing. If there is
> some particular need, again, having some code on each document and making
> the search or display software aware of the codes seems like it could be
> made to work.
>
> While I agree that trying to use the existing Google system exactly as it
> is now wouldn't work, but ISTM that it is so close, and the modifications
> should cost a tiny fraction of the $157 million, that it is worth trying.
>
> --
> - Stephen Fuld
> e-mail address disguised to prevent spam
>
From:Martin Rodgers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 08:08:24 +0000
mike wrote:

> Further, supervisors should get detailed reports on system usage so they can
> verify the appropriateness of data requests as relevant to the assignments
> of their staff.

That sounds like profiling of FBI agents. Oh, the irony!
--
http://www.wildcard.demon.co.uk You can never browse enough
Will write code that writes code that writes code for food
From:Stephen Fuld
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 08:09:07 GMT

"mike" wrote in message
news:21JGd.18525$by5.14061@newssvr19.news.prodigy.com...
> Hold on a second!!
>
> I just said that real time index maintenance was not the issue, I did not
> mean to imply that GOOGLE and a few million dollars would be a full
> function case system for the FBI.

OK.

> I have not even seen a clear statement of system requirements.

Nor have I, but we can do some intelligent speculation.

> However, just thinking out loud, you can not just let every FBI agent
> look at every file with no controls and no log of access.

I mentioned in my post the idea of codes to limit access. That doesn't seem
like a major issue. And today, Google can log each query and does for
statistical purposes.


> If Al Queda manages to suborn just one agent they could find out exactly
> what the FBI was looking at and the names of all informants and puff! the
> whole FBI is useless.

What are the controls today? I remember the one agent complaining that she
couldn't see if other cities had people with arabic names taking flight
lessons and this was not due to some security issues, but because the system
simply couldn't support it. Clearly, some people need access to all the
data and if they are compromised, then we have a Robert Hanson type
situation.

> I would expect the FBI needs a system that will look at each case and
> notify the agents on that case of other cases and documents that might be
> relevant.

I'm not sure how you could code the test for "relevant". But let's assume
you could do that. Then when a new case is created, or when there are
substantial updates, a "relevant information" query could be run. Then the
software could automatically send an e-mail to the agents on the other
cases.

> Further, supervisors should get detailed reports on system usage so they
> can verify the appropriateness of data requests as relevant to the
> assignments of their staff.

Sure. A simple extension of the logging function seems to meet that
requirement.

> Also, the system should have a "work flow management" component like call
> centers and large help desk operations use, so that new data entry
> transactions needing investigation or analysis are automatically assigned
> and progress tracked. The FBI is a big organization and without such a
> facility, too much will fall between the cracks.
>
> One of the biggest problems with government system procurement is that
> many of these features can be found in commercial packages. When
> available, they should be used as subsystems rather than reinvented.

Yes. I absolutely agree. And it is even the case that it may be faster to
slightly modify an existing system than to code from scratch.

--
- Stephen Fuld
e-mail address disguised to prevent spam
From:Bernd Paysan
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 12:54:50 +0100
Stephen Fuld wrote:
> Precisely. And the other technical problems seem fairly easily solvable.
> For instance, I don't doubt that by sending some money to Google for a
> special system for them selves, Google could easily take each query,
> optionally look up all the words in a thesaurus and modify the queries to
> include either the original word or the synonym.

Not just the query, the index, too. Certainly, the thesaurus must map equal
words when querying, but when they go in as equals in the map (index), the
map is denser and smaller.

> While I agree that trying to use the existing Google system exactly as it
> is now wouldn't work, but ISTM that it is so close, and the modifications
> should cost a tiny fraction of the $157 million, that it is worth trying.

ISTM, too.

--
Bernd Paysan
"If you want it done right, you have to do it yourself"
http://www.jwdt.com/~paysan/
From:Stephen Fuld
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 18:09:11 GMT

"Bernd Paysan" wrote in message
news:cllsb2-0nl.ln1@miriam.mikron.de...
> Stephen Fuld wrote:
>> Precisely. And the other technical problems seem fairly easily solvable.
>> For instance, I don't doubt that by sending some money to Google for a
>> special system for them selves, Google could easily take each query,
>> optionally look up all the words in a thesaurus and modify the queries to
>> include either the original word or the synonym.
>
> Not just the query, the index, too. Certainly, the thesaurus must map
> equal
> words when querying, but when they go in as equals in the map (index), the
> map is denser and smaller.

That is a good point that I missed. But I am not sure whether it is a good
idea. Is it sometimes necessary to ask for the particular word and
explicitly not one of its synonyms? If so, then you could specify it in the
query, but that wouldn't work if the index had already deleted the
information. I don't know if there is such a requriement though.

--
- Stephen Fuld
e-mail address disguised to prevent spam
From:Andrew Reilly
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 22:45:28 +1100
On Mon, 17 Jan 2005 06:30:37 +0000, mike wrote:

> In a closed government system like the FBI case database the software that
> enters or modifies a case file can send a message directly to the indexing
> engine.

kqueue, famd, snapshots ?

No, I know that's probably not the whole answer...

--
Andrew
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 08:59:26 -0500
On Mon, 17 Jan 2005 05:30:37 GMT, "mike" wrote:

>
>In a closed government system like the FBI case database the software that
>enters or modifies a case file can send a message directly to the indexing
>engine. Further unlike GOOGLE the indexing engine does not have to waste
>cycles re-indexing old entries that have not been changed. Therefore a near
>real time index, within a few hundred milliseconds, can be maintained.
>

Well, sure. People know how to keep idices updated. In order to
index something, the data have to be visible to the indexing system in
a secure, consistent way that the indexing system understands. Many a
slip 'twixt the cup and the lip. That's why I don't believe the
current FBI system is only ever 24-hours out of date. When they find
another roomful of boxes, it'll have been however many months or years
out of date that roomful of boxes was hiding.

This from my inbox

--------------------------Advertisement-------------------------------
: Watch
everything work together and see change coming a mile away.
Reduce the number of IT elements in your network. Eliminate
customisation. Automate change. Use standard technologies and
interfaces. Create reusable components. Implement consistent
processes.
Interact with any system, anywhere, always. Build architectures
modularly. Virtualise your systems. Change one element without
impacting the entire network. Build a dynamic link between business
and IT. Connect applications and processes inside and out.

*******

Above all, minimize difficulties, oversell, and watch people go back
to pad and pencil, or at least bypass your nifty enterprise
architecture and keep things squirreled away in private stashes, maybe
even on a computer--a PC or a laptop. And, oh yes, watch as revenue
and jobs disappear from the industry.

Maybe there is a National Academies report that would answer the
question in my OP.

RM
From:krasicki
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:18 Jan 2005 19:46:04 -0800
Stephen Sprunk wrote:
> "krasicki" wrote in message
> news:1106021732.876074.76930@c13g2000cwb.googlegroups.com...
> > The intent [just guessing] is to anticipate what might happen, who
> > might be inviolved, what might prevent the event rather than the
more
> > pedestrian anthropology of what happened, who did it, give me
instant
> > suspects. With all due respect to the latter activity, and
considering
> > the consequences, the former problem will not be solved COTS style.
>
> If that was the intent, then why the heck is it called "Virtual Case
File"?
> The name strongly implies they want an electronic document storage
and
> management system.

There are often massive differences between what something sounds like
and what something is.

As a taxpayer and wearing the hat of an information analyst, I would be
furious if $150+M were being spent to manage legacy FBI paperwork
because that would mean that 'terror' in the eyes of the FBI
constituted the imminent need to file and retrieve paperwork.

Aside from the sheer absurdity that pre-911 case file information
schemas are sufficient to capture the kind of data necessary for
analyzing terrorist threats [instead of, say, political tom-foolery]
there is probably no canonic and extendable information schema that
maps historic case file differences, cross-agency differences, and
intra-agency diffences of even existing data. Up-front questions about
such a project is what kind of documents do you have, what are they
worth, how clean is the quality of the stuff, what would you be
interested in, and so on.

And the reason you would ask these questions is to introduce rigor to
the information schema - when an agent needs a bit of information that
bit of information has to have a well-known, certain meaning.

But even this is presumptive that the agency has interest in mining the
nuances of existing data.

The specification could just as well mean that the paper inventory of
index cards needs to be physically retrievable. Modern libraries are
moving away from Dewey Decimal systems to inventory control systems in
which physical payload is stored and retrieved from an automated
container management system. Material is put into a container, the
contents identified, and the system carries it away until the material
is requested. No more messy file cabinets or bookshelves to browse.


>
> Another project might use VCF as a source of information to decide
who
> "might" be a terrorist, but that doesn't seem like the purpose of VCF
> itself.
>

Sounds like the FBI still has no imagination if that bullet point
didn't make the 'wish list';-)
From:krasicki
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:16 Jan 2005 20:11:10 -0800
Very good points.

Right the first part of the problem is a canonic case file, the second
part is making it virtual, the third part - realtime, the fourth -
realtime rollback when it's wrong, and so on.

Architecturally, we are talking about the largest in-memory database
ever attempted and we're talking about synchronization features that
haven't been dreamed of with the initial investment. As you say, this
is almost certainly vendor driven, government payola driving the train.

One of the most severe problems that we as a nation face in truly
beginning to prepare ourselves against the war on terror (let's face it
- the Bush administration are klowns) is that the RFPs for far too many
systems have overt, vendor driven agendas built into the spec. Often
the vendor insists only their inside consultants can do the work. The
contract award this becomes self-insulating for the vendor or family of
vendors and no fresh air ever gets in to disinfect the project from
poor management, honest review, and so on.
And you are right in observing that Google is not the answer.
From:jgrove24 at hotmail.com
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:17 Jan 2005 13:47:15 -0800

>
> Finally, if they can't understand the requirements of what they want
-
> then managing [people AND technology] will bring any such system to
its
> knees during peak vacation times. $170M is a drop in the bucket for
a
> meta-enterprise system requirement such as this.

Assuming a billing rate of $250K/year, what exactly was produced for
680
person-years of work?

per LA Times:

A preliminary report from Aerospace Corp., a federally funded nonprofit
research firm in El Segundo hired by the FBI to assess its options, has
identified commercially available programs that could meet the FBI's
requirements, sources familiar with the study said. Using such programs
would also enable the FBI to integrate its software with that of other
agencies doing similar work - a far more complicated task if it chose
to stick with a custom product.
......
In 1995, the agency's vast paper files were partly loaded into a
searchable electronic database. That system was soon found to be
inefficient and riddled with software bugs, and some top officials
never adopted it.

Until recently, much of the rest of the agency - including some field
agents - lacked such basic features as e-mail.

After the Sept. 11 attacks, the FBI's hardware, software and
communications networks, built up at the cost of hundreds of millions
of dollars, were revealed as severely outmoded.
.....
The linchpin of the planned computer system was the Virtual Case File
software, designed for use on fast computers and networks to make it
widely accessible across the FBI's scores of offices. The plan was to
include images, video and sounds, crucial components of a modern
investigation.
....and CNN
The current program requires FBI personnel to manually enter, print,
sign and scan their information into the "investigative data
warehouse."
The new software program was supposed to allow agents to pass along
along intelligence and criminal information in real time.
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 17:30:47 -0500
On 17 Jan 2005 13:47:15 -0800, jgrove24@hotmail.com wrote:

>
>>
>> Finally, if they can't understand the requirements of what they want
>-
>> then managing [people AND technology] will bring any such system to
>its
>> knees during peak vacation times. $170M is a drop in the bucket for
>a
>> meta-enterprise system requirement such as this.
>
>Assuming a billing rate of $250K/year, what exactly was produced for
>680
>person-years of work?
>
>per LA Times:
>
>A preliminary report from Aerospace Corp., a federally funded nonprofit
>research firm in El Segundo hired by the FBI to assess its options, has
>identified commercially available programs that could meet the FBI's
>requirements, sources familiar with the study said. Using such programs
>would also enable the FBI to integrate its software with that of other
>agencies doing similar work - a far more complicated task if it chose
>to stick with a custom product.

Aerospace Corp is at least a disinterested party. Whether it has the
expertise to do this particular job is another question. It certainly
doesn't do enterprise IT or (AFAIK) supervise enterprise IT-type
contracts.

HP blamed a $208 million quarterly loss for its enterprise storage and
server division on a new inventory and logsitics control system last
year. No security and no criminal case file issues. What would it
have cost HP to do it right the first time? Anything less than $208
million would have been a bargain. I wouldn't have wanted to be
selling HP enterprise services when that was going down.

Rather than what do you need all this money for (which is surely the
question being asked), a better place to start would be how expensive
is it to get it wrong. The Federal government has gotten so many of
these systems wrong, like the social security computer system, that
it's tempting to say derisive things about the US government IT. With
the other examples in this thread, maybe we should just say derisive
things about government IT. But then HP can't get its own system
right. One could say derisive things about HP? Already done.

Or maybe the problem is just hard and people are chronically
underestimating the actual difficulty and cost of doing it right. The
"Oh, we'll just do this and this and that, and it'll be done"
handwaving has obviously been done to an expensive fare-thee-well.

RM

From:Stephen Fuld
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Tue, 18 Jan 2005 01:54:55 GMT

"Robert Myers" wrote in message
news:7jdou09inojg2a80mufa7a1sbkpr71dohn@4ax.com...

snip

> Rather than what do you need all this money for (which is surely the
> question being asked), a better place to start would be how expensive
> is it to get it wrong. The Federal government has gotten so many of
> these systems wrong, like the social security computer system, that
> it's tempting to say derisive things about the US government IT.

What is wrong with the Social Security computer system(s)? There were lots
well documented screwups in implementing the SSI system in the 1970s (and I
can talk about that), but I haven't heard much since. The Social Security
system is usually cited as the epitome of government efficiency, costing
taxpayers only 1-2 percent of benefit costs in administrative expense. Full
Disclosure - I used to work for the Social Security Administration in the
1970s.

--
- Stephen Fuld
e-mail address disguised to prevent spam
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Mon, 17 Jan 2005 21:44:48 -0500
On Tue, 18 Jan 2005 01:54:55 GMT, "Stephen Fuld"
wrote:

>
>"Robert Myers" wrote in message
>news:7jdou09inojg2a80mufa7a1sbkpr71dohn@4ax.com...
>
>snip
>
>> Rather than what do you need all this money for (which is surely the
>> question being asked), a better place to start would be how expensive
>> is it to get it wrong. The Federal government has gotten so many of
>> these systems wrong, like the social security computer system, that
>> it's tempting to say derisive things about the US government IT.
>
>What is wrong with the Social Security computer system(s)? There were lots
>well documented screwups in implementing the SSI system in the 1970s (and I
>can talk about that), but I haven't heard much since. The Social Security
>system is usually cited as the epitome of government efficiency, costing
>taxpayers only 1-2 percent of benefit costs in administrative expense. Full
>Disclosure - I used to work for the Social Security Administration in the
>1970s.

August 2003 google groups comp.arch "social security." I may well
have different federal fiascos conflated by now. More than one
problem system was discussed.

RM
From:Stephen Fuld
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Tue, 18 Jan 2005 06:32:16 GMT

"Robert Myers" wrote in message
news:ettou0lqsk6p15kpic17t3jfi349do2enh@4ax.com...
> On Tue, 18 Jan 2005 01:54:55 GMT, "Stephen Fuld"
> wrote:
>
>>
>>"Robert Myers" wrote in message
>>news:7jdou09inojg2a80mufa7a1sbkpr71dohn@4ax.com...
>>
>>snip
>>
>>> Rather than what do you need all this money for (which is surely the
>>> question being asked), a better place to start would be how expensive
>>> is it to get it wrong. The Federal government has gotten so many of
>>> these systems wrong, like the social security computer system, that
>>> it's tempting to say derisive things about the US government IT.
>>
>>What is wrong with the Social Security computer system(s)? There were
>>lots
>>well documented screwups in implementing the SSI system in the 1970s (and
>>I
>>can talk about that), but I haven't heard much since. The Social Security
>>system is usually cited as the epitome of government efficiency, costing
>>taxpayers only 1-2 percent of benefit costs in administrative expense.
>>Full
>>Disclosure - I used to work for the Social Security Administration in the
>>1970s.
>
> August 2003 google groups comp.arch "social security." I may well
> have different federal fiascos conflated by now. More than one
> problem system was discussed.

I checked and there was much confusion of several different projects and
several more situations. The closest to a "fiasco" was the one I described
back then, which was an attempt to redesign the entire Social Security data
processing systems in one fell swoop. It was an in house effort and they
had dozens of analysts working one it, but it never got to the coding stage
before the management came to their senses and realized that it wouldn't
work. So it was not a system that was implemented and "gotten wrong".
Other than that, I can't find anything that matches the criteria you had in
your post.

Not that there haven't been a substantial number of such "fiasco"s in the
federal government. :-(

--
- Stephen Fuld
e-mail address disguised to prevent spam
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Tue, 18 Jan 2005 10:06:02 -0500
On Tue, 18 Jan 2005 06:32:16 GMT, "Stephen Fuld"
wrote:


>
>I checked and there was much confusion of several different projects and
>several more situations. The closest to a "fiasco" was the one I described
>back then, which was an attempt to redesign the entire Social Security data
>processing systems in one fell swoop. It was an in house effort and they
>had dozens of analysts working one it, but it never got to the coding stage
>before the management came to their senses and realized that it wouldn't
>work. So it was not a system that was implemented and "gotten wrong".
>Other than that, I can't find anything that matches the criteria you had in
>your post.
>
>Not that there haven't been a substantial number of such "fiasco"s in the
>federal government. :-(

Not everything is a failure, I'm sure. What I partly had in mind
about social security was its more recent move to decentralized data
processing. I know I had read some complaints, but now that I look
over what's quickly available, it seems to have worked--certainly not
a fiasco.

It's not the Federal Government, but Lockheed is bragging about how
much it has reduced paperwork in its contract submissions by putting
everything (including drawings in editable format) on computers. Come
to think of it, the Aerospace Corporation probably does have some
experience with a system with many of the capabilities the FBI wants,
including the capacity to handle sensitive documents. Wonder how well
it really works?

It's difficult for someone not actually involved even to guess what's
going wrong, how much should be forseeable as being at least very
risky, and how much is just the misery of dealing with complex
systems.

RM
From:Some Guy Who Doesn't Want Spam
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Thu, 20 Jan 2005 16:41:24 -0600
jgrove24@hotmail.com wrote:
> ...and CNN
> The current program requires FBI personnel to manually enter, print,
> sign and scan their information into the "investigative data
> warehouse."
> The new software program was supposed to allow agents to pass along
> along intelligence and criminal information in real time.

It's worse than you think... If you poke around a bit, you discover that
the "Investigative Data Warehouse" was a proof-of-concept piece intended
mostly for A: evaluating search tools, and B: figuring out what analysts
needed. It went online in summer of '04, it worked beautifully,
and the analyst test group using it told other analysts, who promptly
glommed on to the first (and at this point, only) working
multiple-source search capability.

Later in the CNN article, it mentions that there are 6000 operational users
of that system... which is pretty impressive for a system designed for a test
group of about 200. (This may also be why it's so awkward to enter data into
the system... It may not have been intended for user updates of data.)
From:krasicki
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:17 Jan 2005 20:15:32 -0800
Stefan,

It is precisely what I wanted to say and I agree with you that the
current political opportunists will never get it right although some
of the problem is endemic to all governments.

In the United States too little thought [in fact none] goes into
ensuring a harmony between technology and governance [rights and law
inclusive].

What the FBI is asking for is technically feasible, it will never be
cheap until the extreme expense of doing it the first time is spent,
and it will not be like Google, grep, or many of the familiar utilities
being argued here.

It will require non-trivial artificial intelligence components,
robotics, and human exo-skeleton sensory extensions to gather and
process information. Although, Google search queries like, "Find me a
terrorist I can arrest." are quaint wish list items what is really
needed is a push mechanism that triangulates a multiplicity of weighed
behavioral, inferred, and heuristically enhanced, pattern-matching
analysis tools. You can't begin to talk about a Virtual Case File
until an inventory of these techniques is accounted for and
expectations level set.

This problem is not profoundly different from NASA's 'search for life'
techniques and certain gaming algorithms may be useful in unorthodox
ways.

The intent [just guessing] is to anticipate what might happen, who
might be inviolved, what might prevent the event rather than the more
pedestrian anthropology of what happened, who did it, give me instant
suspects. With all due respect to the latter activity, and considering
the consequences, the former problem will not be solved COTS style.
From:Stephen Sprunk
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Tue, 18 Jan 2005 12:44:41 -0600
"krasicki" wrote in message
news:1106021732.876074.76930@c13g2000cwb.googlegroups.com...
> The intent [just guessing] is to anticipate what might happen, who
> might be inviolved, what might prevent the event rather than the more
> pedestrian anthropology of what happened, who did it, give me instant
> suspects. With all due respect to the latter activity, and considering
> the consequences, the former problem will not be solved COTS style.

If that was the intent, then why the heck is it called "Virtual Case File"?
The name strongly implies they want an electronic document storage and
management system.

Another project might use VCF as a source of information to decide who
"might" be a terrorist, but that doesn't seem like the purpose of VCF
itself.

S

--
Stephen Sprunk "Stupid people surround themselves with smart
CCIE #3723 people. Smart people surround themselves with
K5SSS smart people who disagree with them." --Aaron Sorkin
From:Jan_Vorbrüggen
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Wed, 19 Jan 2005 09:23:12 +0100
> Another project might use VCF as a source of information to decide who
> "might" be a terrorist,

Solving that problem is only to a very small degree amenable to spending
more money. It would take years or, more likely, decades of research while
giving the researchers access to real-life data, for starters. And a lot of
others things in addition. I don't see that happening.

Jan
From:Stephen Sprunk
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Thu, 20 Jan 2005 09:18:58 -0600
"Jan Vorbrüggen" wrote in message
news:356jnfF4j7q4kU1@individual.net...
> > Another project might use VCF as a source of information to decide who
> > "might" be a terrorist,
>
> Solving that problem is only to a very small degree amenable to spending
> more money. It would take years or, more likely, decades of research while
> giving the researchers access to real-life data, for starters. And a lot
of
> others things in addition. I don't see that happening.

I didn't mean to imply that such a task is actually possible with today's
knowledge, just that (based on the name) it doesn't sound like that was the
point of VCF.

S

--
Stephen Sprunk "Stupid people surround themselves with smart
CCIE #3723 people. Smart people surround themselves with
K5SSS smart people who disagree with them." --Aaron Sorkin
From:Eugene Miya
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:18 Jan 2005 11:30:18 -0800
Just passing thru.

In article <1106073846.68565940416bbb820ea211806518463d@teranews>,
Stephen Sprunk wrote:
>"krasicki" wrote in message
>news:1106021732.876074.76930@c13g2000cwb.googlegroups.com...
>> The intent [just guessing] is to anticipate what might happen, who
>> might be inviolved, what might prevent the event rather than the more
>> pedestrian anthropology of what happened, who did it, give me instant
>> suspects. With all due respect to the latter activity, and considering
>> the consequences, the former problem will not be solved COTS style.
>
>If that was the intent, then why the heck is it called "Virtual Case File"?
>The name strongly implies they want an electronic document storage and
>management system.

This is largely in fact what it is and is meant to be.
I was asked and did finally visit the Bureau and briefly worked with
their Chief Scientist. The Bureau no longer has that position.
They did approach my agency for assistance but due to the distance
of my/our programmatic offices I recommended they talk to people at
Goddard.

It's called a case file going back to the Hoover days and one of the
things he did to revolutionize the Bureau (it's not called a bureaucracy
for nothing):
Hoover brought in 3 x 5 cards.

The Bureau is highly backlogged with FOIA requests. That should give
you a hint.

>Another project might use VCF as a source of information to decide who
>"might" be a terrorist, but that doesn't seem like the purpose of VCF
>itself.

Maybe later stages. That's a slightly different matter.

There is a talk abstract from a project we funded at UCB given by a
friend when she was at Berkeley but its no longer on line and one would
have to search the Internet Archive (unless it has been removed at DOJ
request).

In a way I feel sorry for the FBI, but in other ways I don't.
It's truly a case by case basis.

Whit Diffie (who is giving a Museum talk in 2 weeks) pegged Bureau
mentality in his book.


Gotta go, just passing thru.


Oh, is it possible? Hmmm. Well, likely you want to substitute other TLAs
and see if you think that.

--
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Tue, 18 Jan 2005 16:57:32 -0500
On 18 Jan 2005 11:30:18 -0800, eugene@cse.ucsc.edu (Eugene Miya)
wrote:


>
>Oh, is it possible? Hmmm. Well, likely you want to substitute other TLAs
>and see if you think that.

I like this article

http://www.floridatoday.com/news/space/stories/2004b/spacestoryN1010ROCKETSIDE.htm



Oct 10, 2004

Most defense space projects over budget

BY JOHN KELLY
FLORIDA TODAY

CAPE CANAVERAL -- The Delta 4 and Atlas 5 rockets aren't the only
space projects to cost more than the government expected.

Most Defense Department space projects are over budget, many by
billions of dollars.

In quarterly reports to Congress, the Pentagon reports at least $20
billion worth of overruns on its active space programs. That does not
count secret programs, such as spy satellites.

Then there's NASA. On average, the space agency's projects come in 45
percent over budget, the Congressional Budget Office recently
reported.

The simple explanation from the Air Force and others in the business:
"Space is hard."



The article goes on to speculation as to what's really happening that
might fit right into this thread (changing the subject from space to
computers, of course).

That fact that nothing seems to get done without going significantly
over budget isn't the same thing as saying that nothing ever gets
done, of course, but some of the ways things go wrong don't seem
likely to be peculiar to the FBI.

RM
From:Eugene Miya
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:20 Jan 2005 10:52:11 -0800
In article <1v0ru0phjdv5571gmolugemaapjc7dt0hr@4ax.com>,
Robert Myers wrote:

I got email requested back for a moment.

>On 18 Jan 2005 11:30:18 -0800, eugene@cse.ucsc.edu (Eugene Miya)
>wrote:
>>Oh, is it possible? Hmmm. Well, likely you want to substitute other TLAs
>>and see if you think that.
>
>I like this article

While DOD is a TLA, it is more than well known that the DOD can throw
money almost wherever it wants. If you want to self select like this
proverbal guy under the street light looking for dropped keys at night,
you aren't going to think about things.

>http://www.floridatoday.com/news/space/stories...
>Most defense space projects over budget
This is a digression from your own question.

Try thinking about the usual (and less usual) conspiracy theory agencies
if you want to consider the realm of possibilities independent of the
Bureau's bureaucracy:

CIA Virtual Case File is even possible?
NSA Virtual Case File is even possible?
G[S]IA Virtual Case File is even possible?
NRO Virtual Case File is even possible?

And then you can consider "even possible?"
Certainly not hardware architecture.

--
From:del cecchi
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Tue, 18 Jan 2005 18:55:27 -0600

"Robert Myers" wrote in message
news:1v0ru0phjdv5571gmolugemaapjc7dt0hr@4ax.com...
> On 18 Jan 2005 11:30:18 -0800, eugene@cse.ucsc.edu (Eugene Miya)
> wrote:
>
>
> >
> >Oh, is it possible? Hmmm. Well, likely you want to substitute other
TLAs
> >and see if you think that.
>
> I like this article
>
>
http://www.floridatoday.com/news/space/stories/2004b/spacestoryN1010ROCKETSIDE.htm
>
>
>
> Oct 10, 2004
>
> Most defense space projects over budget
>
> BY JOHN KELLY
> FLORIDA TODAY
>
> CAPE CANAVERAL -- The Delta 4 and Atlas 5 rockets aren't the only
> space projects to cost more than the government expected.
>
> Most Defense Department space projects are over budget, many by
> billions of dollars.
>
> In quarterly reports to Congress, the Pentagon reports at least $20
> billion worth of overruns on its active space programs. That does not
> count secret programs, such as spy satellites.
>
> Then there's NASA. On average, the space agency's projects come in 45
> percent over budget, the Congressional Budget Office recently
> reported.
>
> The simple explanation from the Air Force and others in the business:
> "Space is hard."
>
>

>
> The article goes on to speculation as to what's really happening that
> might fit right into this thread (changing the subject from space to
> computers, of course).
>
> That fact that nothing seems to get done without going significantly
> over budget isn't the same thing as saying that nothing ever gets
> done, of course, but some of the ways things go wrong don't seem
> likely to be peculiar to the FBI.
>
> RM

Duh. Where have these idiots been? How do you estimate the cost of
doing what has never been done? And is the pressure to estimate high or
low? It wasn't that the FBI program went over budget, but that they
went over budget and it still didn't even come close to working.

del
From:Terje Mathisen
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Wed, 19 Jan 2005 07:46:21 +0100
del cecchi wrote:
> Duh. Where have these idiots been? How do you estimate the cost of
> doing what has never been done?

In the case of NASA, your best/only guess is to look at how you've
budgeted previous projects, vs how much they've really cost. This is the
only way I know to even guess at the required fudge factor. :-(

Of course, some things turn out to be effectively impossible, in which
case you need clear milestone targets, and the guts to kill projects
that aren't going to work out.

I'm running the 'IT Fire Department' in Hydro, which means that my group
get all the 'critical/totally new stuff/overrun/should have been working
last week' projects. We have an incredible success rate, partly because
I insist on splitting complicated development jobs into multiple clearly
delineated projects, with a stop/go decision after each part.

If it isn't going to work at all, I'd much rather find out sooner rather
than later.

(Yes, I do realize that this is hard to do when you have multiple
man-years of effort before you can have even a mock-up working. I stay
away from such projects. :-)

> And is the pressure to estimate high or low?

Here's the crux!

When all competing projects are going to submit a 'best case' guess,
instead of a maximum (90%?) likelyhood/median/error range number, your
own project pretty much have to do the same thing, otherwise you'll
never get of the ground (literally in NASA's case).

Another example from the other end of the spectrum:

I am leading the orienteering group at the corporate sports club, we
submit budgets each fall for the next year's activities. Three years in
a row we submitted exact budgets, i.e. we knew within about 10% what
both income and expense would be. All three years our budget got slashed
by 50%, because all other groups in the club had overbudgetted madly, so
everyone got slashed by the same percentage.

Last year we fixed this, not by doing a similar padding of the budget,
but by writing an enclosing letter stating how we knew this budget to be
exact (as well as _much_ lower than any other group's), pointing to the
very close correlation between the previous three year's original budget
and actual result, and that we would much rather keep on doing the best
budgetting job we knew how, than to become a part of the problem by
doing the same padding as everyone else.

It did work, we got almost exactly what we asked for. :-)

> It wasn't that the FBI program went over budget, but that they
> went over budget and it still didn't even come close to working.

Right.

Terje

--
-
"almost all programming can be viewed as an exercise in caching"
From:Robert Myers
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Wed, 19 Jan 2005 08:48:04 -0500
On Tue, 18 Jan 2005 18:55:27 -0600, "del cecchi"
wrote:

>
>"Robert Myers" wrote in message
>news:1v0ru0phjdv5571gmolugemaapjc7dt0hr@4ax.com...
>> On 18 Jan 2005 11:30:18 -0800, eugene@cse.ucsc.edu (Eugene Miya)
>> wrote:
>>
>>
>> >
>> >Oh, is it possible? Hmmm. Well, likely you want to substitute other
>TLAs
>> >and see if you think that.
>>
>> I like this article
>>
>>
>http://www.floridatoday.com/news/space/stories/2004b/spacestoryN1010ROCKETSIDE.htm
>>



>>
>> The article goes on to speculation as to what's really happening that
>> might fit right into this thread (changing the subject from space to
>> computers, of course).
>>
>> That fact that nothing seems to get done without going significantly
>> over budget isn't the same thing as saying that nothing ever gets
>> done, of course, but some of the ways things go wrong don't seem
>> likely to be peculiar to the FBI.
>>
>
>Duh. Where have these idiots been? How do you estimate the cost of
>doing what has never been done? And is the pressure to estimate high or
>low? It wasn't that the FBI program went over budget, but that they
>went over budget and it still didn't even come close to working.
>

That (big money, no product, not even a working prototype) also
happened with the Sergeant York Gun, for example. In the case of the
Space Shuttle, it only ever worked in the sense that the goal posts
were moved very far from the way the program was sold (it really was
supposed to be a shuttle, like the New York-Boston shuttle, then a
pioneering service offered by Eastern Airlines). I really don't think
the problems are peculiar to the FBI.

As to the realities of goals and budgets for new projects, you
_surely_ must have at least a clue as to how that goes in large,
bureaucratic organization. The short time scale on which the budget
is completely redone for the US government means that promises have to
be sufficiently ambitious to survive the budget axe, and the budget
has to fit into whatever arbitrary constraints have been dictated.
That predicts chronic overpromising and underbudgeting, a prediction
that is borne out by experience.

One way to regularize the discussion of the FBI Case File system would
be to divide it into a part that is relatively routine and a part that
hasn't really been done before. The challenge for the relatively
routine part (a secure document management system) is probably getting
agents actually to use it without disrupting ongoing cases; i.e., it
really shouldn't be a "software problem." The non-routine part,
prognostic data mining, sounds like it's just burueacratic sales talk
at this point.

RM
From:mike
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Wed, 19 Jan 2005 21:52:16 GMT

>The non-routine part,
> prognostic data mining, sounds like it's just burueacratic sales talk
> at this point.


Most of this thread keeps repeating the assumption that the FBI system was a
failure because it tried to do this and it probably is impossible with
current technology. We have not seen a real requirements specification and
this may or may not be in it. I wonder if we are over stating the goals.

Historically, investigative organizations have tried to find all the parts
of a crime and all individuals involved in a conspiracy by looking at every
contact and relationship. That is, they would interview all family members,
look at the phone bills and interview everyone that was called, look at the
credit card bills and examine travel patterns and talk to everyone who sold
something to the suspects, etc.

If I were trying to design a system for the FBI, I would concentrate on
automating data mining from this perspective rather than try to directly
predict criminal intent. If the system contained info on every suspected /
convicted criminal including terrorists and also contained detailed credit
card, phone, e-mail etc. transaction history you might be able to quickly
generate a "related suspect" list for further investigation and or
surveillance. The system would not say "hay these suspects are all taking
flying lessons, they intend to attack the World Trade Center". Rather, it
would say this suspected terrorist has been in contact with these 23 people
and 4 of them have received money transfers from a common account in Libya.

There are cultural and organizational challenges to successful system
development in any large organization and these are exaggerated in
government bureaucracies. However this more modest goal would seem to be
theoretically possible and an appropriate use of technology by the FBI.

Mike Sicilian
From:Rick Jones
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Wed, 19 Jan 2005 18:21:29 GMT
del cecchi wrote:
> Duh. Where have these idiots been? How do you estimate the cost of
> doing what has never been done?

The same way we all estimate the performance of systems that don't
exist?-)

rick jones
--
Process shall set you free from the need for rational thought.
these opinions are mine, all mine; HP might not want them anyway... :)
feel free to post, OR email to raj in cup.hp.com but NOT BOTH...
From:Del Cecchi
Subject:Re: [OT?] FBI Virtual Case File is even possible?
Date:Wed, 19 Jan 2005 13:19:47 -0600
Rick Jones wrote:
> del cecchi wrote:
>
>>Duh. Where have these idiots been? How do you estimate the cost of
>>doing what has never been done?
>
>
> The same way we all estimate the performance of systems that don't
> exist?-)
>
> rick jones

Nope. I know what the non existant system is supposed to do and how it
is supposed to work, so I can simulate its performance. Actually
implementing or designing such a system maybe in fact be impossible or
require invention.

An example "Wouldn't it be nice if we had a system that never had a
cache miss. What would be the performance of such a system?"

Now, build one. :-)
From:Terje Mathisen