Testing With "The Force"

Pablo · July 8, 2009, 12:00am

If you’re a nobody… how do you get attention?

Simple, just go against everything that a well-known person says even if they are completely right, there you go! you got your 15 minutes at last!

Guys let’s take this advice and ignore this guy.
http://www.codinghorror.com/blog/archives/001271.html

CheapW · July 8, 2009, 12:00am

Great article Jeff. I love star wars, and I love the star wars/regex tie in. I’ll be back.

CheapW · July 8, 2009, 12:00am

Whoops, sorry for the double post. Please delete.

JuanZ · July 8, 2009, 12:00am

@Dennis Forbes:
By the way, there are two Juan’s in the room, I’m Juan Zamudio, the other one is just Juan.
While agree that most of the recent posts are not that valuable I keep coming back hoping I can read another great post like in the good old days, but I find interesting that you come back for more, and after almost two years you have not found the time to remove codinghorror from IGoogle given the fact that you dislike this blog. That’s the point want to made.
I also didn’t see the point in that Google-fu that you mention, that bring nothing to the table.

PS; I’m not a Jeff Groupie, i found this blog by accident also (searching something related to the code complete book, I’m a McConnell whore, i have to admit that).

o_s5 · July 8, 2009, 12:00am

“Some people are so incredibly arrogant it amazes me.
Matt on July 8, 2009 9:18 AM”

Excellent, excellent point Matt I couldn’t have said it better myself.

JesseM · July 8, 2009, 12:00am

Another example of why regex is shit.
That regex is pretty much impossible to read unless you carefully split it up in to smaller parts.

For this kind of thing you need a proper grammar.

Alex · July 8, 2009, 12:00am

In the grand tradition of honing in on something in a blog post that has nothing to do with the purpose of the post…

and are outdated. Instead we’re all supposed to use and

Alex · July 8, 2009, 12:00am

Sorry, I forgot- html doesn’t fly in comments-

What I was saying before: we’re not supposed to use [i] and [b] anymore, instead we’re supposed to use [em] and [strong]

DennisF · July 8, 2009, 12:00am

If you’re a nobody… how do you get attention?

Disagree with Jeff on his own blog! GENIUS! Then you can gain the attention of a bunch of people who through survivorship-bias (in that they continued to read it) are going to likely be fans of Jeff’s!

Somehow I don’t think that strategy is a very good avenue to fame. Gosh, I’m going to have to rethink this.

Simple, just go against everything that a well-known person says even if they are completely right

Sorry, friend, but I’ve trodden this ground half a decade ago - http://www.yafla.com/dforbes/The_Fallacy_of_Test_Driven_Development

I disagree with Jeff when I disagree with Jeff (somehow CodingHorror got on my iGoogle page, and I’ve been remiss to remove it. And every now and then I expand one of those nodes…). If this hurts your precious feelings, I would advise that you stop reading the comments.

Guys let’s take this advice and ignore this guy.

This is like those YouTube channels where people put a big notice at the top disclaiming that they don’t care what anyone thinks, which of course means that they desperately care what everyone thinks.

Honestly I think Jeff should disable comments, because his biggest fans are his worst enemies, and they are the reason he gets often undeserved backlash. It’s like some sort of weird little groupie festival.

Someone1 · July 8, 2009, 12:00am

text or text (or double) are no good choices for markup in an environment that is full of bad C code. Go for of some sort and sanitize the database by escaping the old posts of course. These * and _ will just annoy everyone.

Someone

seth8 · July 8, 2009, 12:00am

Google fight is it’s own arch enemy: compare

http://www.googlefight.com/index.php?lang=en_GB&word1=%22Jeff+Atwood%22&word2=%22Dennis+Forbes%22
to
http://www.googlefight.com/index.php?lang=en_GB&word1=Jeff+Atwood&word2=Dennis+Forbes

While I see the point Dennis is making, I often go with Jeff’s approach for testing
I go with the ‘Dennis’ (smart) approach always (mind the markup). THEN I always go and bruteforce test in as many ways possible. I’m always surprised at at least one edge case I missed. I try to make it a habit to think why I missed the particular case initially. That way my reasoning + hitrate improves.

Kevin_H · July 8, 2009, 12:00am

This post reminded me of this article: http://blog.dotnetwiki.org/2009/01/16/NamedFormatsPexTestimonium.aspx where he used Pex to automatically generate test cases where the two implementations differ. Perhaps something like that would be of use for you.

Someone2 · July 8, 2009, 12:00am

ah my tags tag was deleted fun

Tom_Dibble · July 8, 2009, 12:00am

First, the folks debating how to make sure the ‘*’ block is surrounded by either whitespace or the start/end of lines … doesn’t the regexp library being used support ‘word boundary’ matches? “\b” is usually it (http://www.regular-expressions.info/wordboundaries.html).

Second, I echo the concerns of trying to handle this as a regular expression problem, when it’s quite obviously a language grammer parsing problem more likely to be satisfactorially solved using BNF or PEG grammar.

Third, and most importantly, why are you eschewing libraries which are out there to do exactly this? I mean, one of the advantages of using a quasi-standard like markdown is that everyone and their mother has made a parser of some sort for it already. Don’t waste time reinventing the wheel!

An example PEG grammar for Markdown: http://github.com/jgm/peg-markdown/blob/master/markdown_parser.leg

You’ll need to use something like ANTLR to generate your C# parser code from that .leg file, but that should be a WHOLE lot easier than even what you’ve already done with regular expressions.

Fourth, I think the use of two different ways to do a very simple thing (’’ and ‘_’, and ‘**’ and ‘__’) is Just Plain Wrong. Provide one way to make bold, and one to make italics. Makes it less likely we’ll hit the other case by mistake. IMHO, the '’ is the most used one and least likely to cause problems.

Finally, I agree with other posters that markdown’s choice of ‘’ for italics and ‘**’ for bold is braindead (sorry, Gruber!). It should have been ‘/’ and '’ instead. But, at this point, markdown is markdown, and you don’t want an exception on your one site.

jasonmray · July 8, 2009, 12:00am

I’ve never quite understood why simple HTML markup is considered “inhumane”. What, really, is the difference between these:

italic
italic
iitalic/i

Why come up with some complicated regex filter to convert some contrived markup to HTML, when the original HTML was designed to be simple and human readable to begin with?

In almost all cases where I’ve seen this “markdown” style of formatting, there’s some big filter up front that automatically strips out all possible remnants of HTML as part of some cargo-cult security mechanism. Why not just modify the HTML filter to allow basic bold and italic tags through?

JM14 · July 8, 2009, 12:00am

I agree. This all seems way too complicated.

DennisF · July 8, 2009, 12:00am

The basics of Markdown – the parts that Jeff is trying to capture it seems – do have a certain elegance, paying homage to a less advanced era: When all you had was ASCII, it was generally agreed that could emphasize certain words, and draw attention to others, with nothing more than appropriately place characters. For those with such a habit, Markdown semantically draws from what they are use to.

I have seen a lot of sites that allow either Markdown, HTML, or some other bastardizations. The back-end process was always Markdown (where used) -> HTML -> correctness checker, so it is a concise set of code.

John_Topley · July 8, 2009, 12:00am

Why aren’t you using the nice semantic element, instead of the old, presentational element?

John_Topley · July 8, 2009, 12:00am

Let’s try again. Why aren’t you using the nice semantic “em” element, instead of the old, presentational “i” element?

StevenL · July 8, 2009, 12:00am

lol - you chose the Dark Side when you decided to use regex to parse markdown in order to solve a problem that didn’t need either regex or markdown

but regression testing is always a good thing