Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Microsoft Software Technology

Opera CTO Hits Back at Microsoft's Standards Push 246

Michael writes "Opera CTO Håkon Wium Lie hit back today at Microsoft's push to fast track Office Open XML into an ISO standard, in a blistering article on CNET. He also took a swipe at Open Document Format: 'I'm no fan of either specification. Both are basically memory dumps with angle brackets around them. If forced to choose one, I'd pick the 700-page specification (ODF) over the 6,000-page specification (OOXML). But I think there is a better way.' The better way being the existing universally understood standards of HTML and CSS. Putting this to the test, Håkon has published a book using HTML and CSS."
This discussion has been archived. No new comments can be posted.

Opera CTO Hits Back at Microsoft's Standards Push

Comments Filter:
  • fsck'n ugly (Score:5, Insightful)

    by Anonymous Coward on Saturday February 24, 2007 @02:37AM (#18132152)
    Yeah, but that "book" is fsck'n ugly. It doesn't even compare to a professionally typeset book, or something produced in LaTeX. I hope that isn't the "solution" to this standards "problem". Let's face it, the average Joe is going to use whatever Microsoft pushes at them. Case closed.
  • by Tablizer ( 95088 ) on Saturday February 24, 2007 @02:38AM (#18132154) Journal
    "Both are basically memory dumps with angle brackets around them."
  • CSS for Documents? (Score:5, Insightful)

    by zaydana ( 729943 ) on Saturday February 24, 2007 @02:42AM (#18132174)

    Having a word processor act more like a web browser would be awesome. Ever since I started using word processors (which for me was a long time after I started using web browsers), i've always thought, why doesn't updating this style make all text with that style update? Why do I always have to change the same thing over and over again?

    While turning word processors into web browsers would be stupid, things like CSS would be awesome to have in word processors.

  • by Evardsson ( 959228 ) on Saturday February 24, 2007 @02:44AM (#18132190) Homepage
    While I do agree that the ISO doesn't need more than one standard for printable documents, I don't think that Håkon Wium Lie is on the right track with HTML/CSS for print.

    Sure, it works, with enough tweaking, and CSS3, and a $350 download of a product to turn HTML/CSS3 into a PDF. This is better how? What about LyX, LaTeX, or even OpenOffice if you are just going to convert to PDF?

    The whole HTML/CSS-to-print thing shoots the real argument in the foot.
  • by 8-bitDesigner ( 980672 ) on Saturday February 24, 2007 @02:50AM (#18132216) Homepage
    Hmm... both of these standards suck. I know what, we need another choice!

    Somehow I don't think that's going to fix the problem. Oh, and pointing out that the Microsoft letter doesn't validate. Isn't that a little petty?
  • by the_womble ( 580291 ) on Saturday February 24, 2007 @02:59AM (#18132250) Homepage Journal
    Latex: its not that hard to learn.

    Lyx provides a GUI front end, but you lose a lot of flexibility.

    Texmacs might work for you as well, although I found it very clunky.
  • Re:fsck'n ugly (Score:5, Insightful)

    by AKAImBatman ( 238306 ) * <akaimbatman@g m a i l . c om> on Saturday February 24, 2007 @03:02AM (#18132278) Homepage Journal

    Yeah, but that "book" is fsck'n ugly. It doesn't even compare to a professionally typeset book, or something produced in LaTeX.

    You don't typeset with Microsoft Word, either. Which makes the entire argument specious. Word processors like MS Word and OOo Writer are for creating common documents like letters, memos, and maybe the occasional flyer. Neither one is particularly good at anything even close to professional publishing work. Even the book authors just use Word (or surprisingly, OOo Writer!) to do the text content. That text is then exported to a more sophisticated program, where the actual typesetting and page layouts are done.

    I think this fellow's point is that HTML/CSS formats can store any information that a Word Processor might need to store, with no need to invoke new technologies. To a certain extent, he may be correct. Unfortunately, HTML/CSS may make a good intermediary format, but it is not particularly good from a performance or usability perspective. Then again, XML formats in general are fairly poor choices for the same reason.

    I think if we want to break this conundrum, the industry is going to have to learn how to keep local data stores that are of high performance, while exporting intermediary formats when emailing or uploading to external computers. The only problem is finding a way of doing this so that it's completely transparent to users. The mythical "mom" doesn't want to worry about emailing a document in the right format, or having the right program to read the attachment she received. She just wants it to do what she tells it, with no bloody prompting with questions she has no answers for.
  • by SanityInAnarchy ( 655584 ) <ninja@slaphack.com> on Saturday February 24, 2007 @03:23AM (#18132352) Journal
    Only problem is, the Oasis page itself doesn't validate. However, it seems Wikipedia does...

    But if the Oasis pages did validate, the basic argument goes like this: "How can they claim to care about standards if they can't even bother to support that most universal standard of standards, HTML?" And indeed, I could still make that argument -- just look at the sad, sad state of affairs that is Internet Explorer's CSS [mis]handling.
  • by sweetooth ( 21075 ) on Saturday February 24, 2007 @03:27AM (#18132362) Homepage
    Keep in mind this was published by a bigwig at Opera. The Opera web browser tends to stay way ahead of the other browsers in terms of standards compliance. This includes things like the ability to use the page elements to force page breaking and to help create layouts useful for things like books, reports, etc. Opera is a great engine for rendering HTML & CSS, I personally just can't get past the UI.
  • Can != Should (Score:3, Insightful)

    by gbulmash ( 688770 ) <semi_famous@y a h o o . c om> on Saturday February 24, 2007 @03:58AM (#18132438) Homepage Journal
    Been a long time since I typeset anything, but I used Adobe Pagemaker when I typeset a couple of college magazines in the mid-90s and FrameMaker when I was maintaining courseware in the late '90s for Nortel.

    HTML + CSS vs. Word vs. OO.o seems to me to be an argument related to formatting documents, not a "book". It's not that you couldn't do it, but I'd consider using Quark or InDesign (what seems to be Adobe's successor to PageMaker) or even Tex and its variants (haven't used any Tex-based stuff, but heard wonderful things) for typesetting.

    Arguments about standards aside, proof of concepts aside, I'd think that the real issue when it comes to any job is using the best tool for it. It's not a question of whether you can use these tools to typeset a book, but if you should.

    The point of the proof of concept is to prove that the system is flexible or capable enough to go beyond its original intended use. I get that. But proving a chainsaw can be used to spread butter, doesn't mean it's inherently superior to a coping saw.

    - Greg
  • by mennucc1 ( 568756 ) <d9slash@mennucc1.debian.net> on Saturday February 24, 2007 @03:58AM (#18132440) Homepage Journal
    An extract of H Wium arguments:

    ODF is an XML-based dump of the internal data structures of OpenOffice, while OOXML is an XML-based dump of the internal data structures of Microsoft Office.

    In 2006, a year or so after ODF entered the fray, Microsoft submitted OOXML to the standardization process. Are we seeing a pattern here? Is Microsoft undermining standards by submitting them? Could it be that it wants both ODF and OOXML to fail?
    so Wium proposes to build a new standard from scratch , starting from HTML and CSS ; but, recognizing that they would not cover all "Office" documents, he goes to saying

    Additional semantics (say, formulas in spreadsheets) can be encoded as attributes, as do microformats, and CSS 3 offers advanced features for printing (e.g., footnotes and header and footers).
    My thoughts:
    • Suppose MicroSoft were to listen to Wium (which they wont). Guess what ? Those additional fields containing formulas (and anything else that makes {MS,Open}Office much more useful than HTML) again would be just an XML-based dump of the internal data structures of so and so.
    • I dont like , more in general this article. Wium is saying that MicroSoft is proposing OOXML to kill ODF ; and at the same time he is proposing to kill ODF in favour of a non-existent extension of HTML+CSS. It is like the guy saying : "I dont like the power plugs in my new house, lets tear the house down and rebuild it" , and at the same time saying "why are they taking so much time to build the house?". Suppose MicroSoft would use arguments as those by Wium to convince ISO to reject ODF and then start a new draft based on HTML, drafted in cooperation between MicroSoft and other partners (including OpenOffice). That would really kill any hope of an ISO standard for "office" documents.
  • by zerblat ( 785 ) <jonas.skubic@se> on Saturday February 24, 2007 @04:04AM (#18132470) Homepage
    HTML sucks for books. The reason is simple. HTML was designed for web pages. HTML does a fairly good job of covering the things you need when you create a web page (although, why is there no <menu>, and a bunch of other stuff that need to be fudged by using elements that don't really fit). In HTML there is no <chapter>, no <footnote>, no <toc>, no <index>. Also, with HTML, one file == one document. If you're writing a book, it would be nice to be able to for example have one file per chapter and include them all in a master file (assuming you're writing your HTML by hand, of course). That's not possible with HTML.

    It would be possible to extend HTML to include such features or to create a HTML-like format that is more suitable for books (cf docbook). I agree that "word processors" today are a horrible mess, and we definately need something like a modernised LaTeX, but HTML isn't it.
  • Re:fsck'n ugly (Score:5, Insightful)

    by Anonymous Coward on Saturday February 24, 2007 @04:26AM (#18132554)

    I don't see (x)html + css as being the answer either:
    Only because you can't tell the difference between "XHTML + CSS" and "web pages".

    -too many versions of html (4, and perhaps 5 soon) and xhtml (1.0, 1.1, strict, transitional, etc)
    So? Pick one as your word-processor standard, and rule all the others out. The existence of too many versions of MS Word doesn't seem to have hurt the .doc format.

    -different versions of CSS, browser support for it varies quite a bit (and is pretty much non-existent for CSS3)
    What does browser support have to do with word processing? We're talking about word processors, not web sites.

    -too many rendering engines, css hacks required so the content displays the same in most of them, etc
    And this is different from word processors how? Microsoft's XML format is absolutely crammed full of hacks to duplicate obscure rendering features of obsolete versions of Word, WordPerfect, etc. And it would surprise me very much if the rendering of ODF was pixel-identical between all the products that support it.

    -html/css sucks at MANY things - how about a self-updating TOC? (don't even try to say some javascript parsing the DOM for header tags with certain IDs to generate it dynamically!)
    You're thinking of web pages, not HTML. HTML used for a document could easily have an auto-generated table of contents. Remember that we're talking about using HTML as the file format for a word processor. A word processor can trivially parse the DOM for header tags and update a table of contents without requiring any JavaScript at all. It's kind of what word processors are for.

    Hell, how can you even tell the page numbers in a html "document" anyways?
    By looking at the little "Page N of N" display in your word processor, I would assume.

    -while word/OOo formats aren't real typesetting (like InDesign CS2 would do), at least they have half-way decent typography. Yeah, no fancy glyphs or super precise kerning, but it's still usable. On the web there's only a handful of "just OK" fonts one can use (unless everything is rendered server-side as images).
    What does "on the web" have to do with word processors? We're not talking about the web here. We're talking about word processors, which will have access to all the fonts the user owns, just like any other application.

    -if people use html/css, there would basically be no standards *at all* or anything even resembling it (much like anything we see on the web).
    Why not? We're talking about word processors, not the web. We're talking about computer-generated HTML, not something some 13-year-old hacked together by copying-and-pasting examples into Notepad. It would be trivial to enforce valid XHTML 1.1 + CSS2.1, for example.
  • by Anonymous Coward on Saturday February 24, 2007 @04:40AM (#18132594)
    So say you can absorb 30 pages a day, thats 200 days to read the spec.

    Oh and the spec is defined to fit an existing product, so that product fits the spec and there are unspecified patent hurdles attached to it. Wow which idiot would fall for that one.
  • Too true (Score:4, Insightful)

    by iamacat ( 583406 ) on Saturday February 24, 2007 @04:41AM (#18132598)
    700 pages is not understandable by anyone but authors. "C programming language" book is 1/3 in size, have endured for 20 years and was instrumental in solving many more problems than word processing. Also, creating an ODF document is a minor function in most applications and is not worth the effort to understand such a huge standard. Proponents of both standards should come up with a modular design instead. At the base level, stick with basic HTML - bold and italic tags, fonts and sizes, paragraph breaks. Define many extensions that can be implemented independently or in any combination, in a manner convenient for both computers and, in a pinch, humans. Opera guy is biased as well - while basic HTML is great at its limited function, CSS is not very readable by humans. Nor does it solve pagination, collaborative editing, resolution independence, color profiles for printing...
  • Re:How come? (Score:3, Insightful)

    by _Shad0w_ ( 127912 ) on Saturday February 24, 2007 @05:14AM (#18132672)

    My speculation would be that no-one wants to sit and read a 6,000 page specification. 700 pages is far more palletable.

    It's a crap way of judging the relative merits of specifications, but human nature will out.

  • Re:huh? (Score:3, Insightful)

    by kestasjk ( 933987 ) * on Saturday February 24, 2007 @05:58AM (#18132836) Homepage
    CSS would be a great standard, but it leaves too much to the people who implement it; is this a block type or inline? What should the default for this nonstandard tag be? etc, etc.

    If they spelled everything out without any ambiguity it would make a better standard.. but then it would be another "600 page long" standard with is what he seems to be against in the first place.
  • by panaceaa ( 205396 ) on Saturday February 24, 2007 @06:08AM (#18132876) Homepage Journal
    Why is anyone even talking about the opinion of a CEO? Opera is an HTML company -- they make HTML browsers. Why would the CEO of Opera have anything objective to say about OOXML or OpenXML? He wouldn't, which is why his pushes his own company's core competency: HTML. While Opera doesn't have a huge market share, if the market for HTML viewers grows, his company's likely to take a piece of that pie. But it's completely bunk because HTML's a mess of different standards, with many people using HTML 4.01 Transitional to this day, and the idea of people adopting CSS3 and writing documents using HTML is pretty far fetched. But you would never hear that from the CEO of an HTML browser company.
  • Re:fsck'n ugly (Score:3, Insightful)

    by vtcodger ( 957785 ) on Saturday February 24, 2007 @07:16AM (#18133066)
    ***I think this fellow's point is that HTML/CSS formats can store any information that a Word Processor might need to store, with no need to invoke new technologies. To a certain extent, he may be correct. Unfortunately, HTML/CSS may make a good intermediary format, but it is not particularly good from a performance or usability perspective. Then again, XML formats in general are fairly poor choices for the same reason.***

    The M in HTML stands for MARKUP. And it means it. HTML is NOT a layout language. Never has been, and apparently never will be despite unending attempts to use it for page layout. In fact, HTML documents look different in every browser -- which is not, I think, a characteristic that most users are going to desire for a large subset of documentation. How, for example, can you specify a an OCRable form, if the rendering program is free to move the damn boxes around?

    If someone would like to propose an standards based HTLL that focuses on document layout, they have my support. I don't care if it is XML based. Just that it works, is reasonably concise, everyone uses it, and that it replaces PDF as a vehicle for specifying documents that need to be rendered pretty much exactly as the author specified them.

    While I'm sure that an HTLL specification would be lengthy, I don't think it needs to embody every quirk of every version of Microsoft Word or Open Office.

  • Re:fsck'n ugly (Score:5, Insightful)

    by Lost my low ID nick ( 1035980 ) on Saturday February 24, 2007 @07:24AM (#18133080)
    So, McSmarty, how do I
      - position an image on page 4 of my document?
      - add footnotes?
      - embed fields (date, last editor...)?
      - mark the embedded TOC as TOC so that it gets regenerated on reload?
    etc.

    And on the CSS side, there are quite a lot of shortcomings, too.

    Of course, all of this would work with custom XML tags or special id/class conventions, BUT then you'd have to specify those. And getting this below 700 pages won't be easy.

    So repeat after me:

    HTML is *not* a description language suitable for word processing in its current state, and it is unclear it can be made so without sacrificing device indepence.
  • by 1u3hr ( 530656 ) on Saturday February 24, 2007 @07:36AM (#18133118)
    though at least before 2007 (which I haven't used so can't comment on) they haven't done much to bring attention to the feature

    Word DOS (version 4 at least) had it back almost 20 years ago. And actually it was much easier to use styles back in the DOS version. Current versions try so hard to second guess you in the quest for user-friendliness and layering features on top of features that you can change or create new styles without knowing or intending to. Old-school required you to RTFA, but then you could use styles very efficiently. Now styles are much more sophisticated, but hardly anyone uses them correctly. I get docuements from all kinds of people, including many university lecturers. None, out of hundreds over the last 15 years, has had a clue of how to style their documents. Headings are "Normal" with font commands to make them large; body text is "Heading 1" converted to 12-point Times; bulleted and numbered lists are a minefield, tables are a quagmire of hacks, spaces and tabs, etc...

  • by Kjella ( 173770 ) on Saturday February 24, 2007 @08:59AM (#18133420) Homepage
    Having a word processor act more like a web browser would be awesome. Ever since I started using word processors (which for me was a long time after I started using web browsers), i've always thought, why doesn't updating this style make all text with that style update? Why do I always have to change the same thing over and over again?

    Every word processor I've seen like forever has support for styles. The problem is:

    1) It's impossible to avoid creating a million new styles by accident. Try looking at the styles list and you'll see it's full of junk
    2) It's impossible to clean up a document with such a bunch of styles, for example say you have a document which has been completely fucked up with pseudo-styles. You've set "Normal" to be what the bulk text should be, and "Headings" to what they should be. What happened last time I tried it? Well, it was impossible to easily apply it without killing any bullet lists, bold, italics or any other intended variation of the normal text. Headers and numbering went beserk. Trying to do the same with the bullet list style lead to numbers going completely nutzoid, for some reason it thought everyone in the same style belonged to the same list so later lists would start at some random number.
    3) If you for some reason is stuck copying between different versions of Word (norwegian and english comes to mind) then you'll have double the number of styles, which obviously aren't in synch.

    So to sum it up what I would like:
    1) Don't auto-create styles
    2) This sentence does not contain three styles
    3) Sane "apply style" functions
          - Parituclary directed at fixing a mess
    4) Make styles have an ID, at least for the default ones make them international so header 1 is header 1 in every language
    5) Ability to "style-lock" documents for things like company standards, you can create new styles but not just randomly change around sizes and fonts
    6) More visible styles (OpenOffice does this, MS word doesn't) because people don't see them
  • by AlXtreme ( 223728 ) on Saturday February 24, 2007 @10:20AM (#18133704) Homepage Journal

    Great typesetting, collaborative book editing, screw LaTeX!

    Those who don't understand LaTeX are doomed to reinvent it... poorly.
  • Re:fsck'n ugly (Score:4, Insightful)

    by Mateo_LeFou ( 859634 ) on Saturday February 24, 2007 @10:36AM (#18133786) Homepage
    "The mythical "mom" doesn't want to worry about emailing a document in the right format, or having the right program to read the attachment she received. She just wants it to do what she tells it, with no bloody prompting with questions"

    No offense, but I'm getting sick of this line of reasoning. You're right, mom wants the computer to read her thoughts, know exactly what she really meant when she said X, anticipate every need she might have, and pre-calculate its complexity out of existence.

    In other news, my boss would like this entire website built in one hour ($40), never need support, and scale to 300,000 users.

    At a certain point IT's job goes from "give every user what heshe wants" to "educate users about what is feasible in the current technological situation.
  • by Luke ( 7869 ) on Saturday February 24, 2007 @12:39PM (#18134466)
    Using a word processor to write a book is like using stone tablets and and abacus for spreadsheets. You really ought to look at markup-based typesetters like LaTeX or DocBook or software specifically designed for book production.
  • Re:fsck'n ugly (Score:4, Insightful)

    by TheoMurpse ( 729043 ) on Saturday February 24, 2007 @12:53PM (#18134556) Homepage

    So, McSmarty, how do I
        - position an image on page 4 of my document?
        - add footnotes?
        - embed fields (date, last editor...)?
        - mark the embedded TOC as TOC so that it gets regenerated on reload?
    I'm on your side in this debate, but as a web dev I have knowledge over these things which you apparently do not. To embed a field, how about <meta name="author" content="TheoMurpse">. As for marking the embedded TOC, how about <div id="TOC">? For positioning an image on page 4, well, I don't know if you've ever looked at a DOC or ODT file, but the file itself says nothing about where page 3 ends and page 4 begins. Instead, you see that once the word processor has rendered the file. Thus, I see no difference between HTML and any other format. Hell, I don't even know if you can say "put this on page 4" in a LaTeX document. First of all, you'd never want to put it on page 4. Instead, you'd want to put it in between other elements, which may end up placing it on page 4, but then when you update your text on page 3, it may cause the image to need to be on page 5.

    Footnotes are easy, too: Text Text that needs a footnote.<div class="footnote">This is the footnote</div>. That's the same concept as in LaTeX, the best typesetting software out there.
  • Re:How come? (Score:3, Insightful)

    by jlarocco ( 851450 ) on Saturday February 24, 2007 @02:37PM (#18135294) Homepage

    I'm no programmer but it wouldn't take me a whole lot of time to write a basic parser.

    Well, the basic parser isn't really an issue. I haven't investigated either standard in any detail, but assuming they're actual XML, or even reasonably close, there are a million libraries that can handle the parsing. Expat, Xerces, Arabica, the Qt XML parser, and the Java library XML parser come to mind.

    The majority of the work is interpretting the tags and actually laying out the document in a standardized way. I can already load a Word *.doc file into OpenOffice and have it look relatively close to how it looks in Word. The reason it only looks "close" is that the Word .doc format isn't documented, so developers for competing products have to make some assumptions and ignore some stuff they can't figure out. Having a standard is supposed to rectify that.

    Ideally, with a standardized format, a document would look identical in any word processor that supported all the features used by the document. That's the whole point.

    Off-topic: I absolutely hate when people make statements like "I'm no programmer, but I could write that software in very little time." Contrary to popular belief, programming isn't trivial. Sure, you may be able to write the parser in very little time, but would other people want to maintain your code? Would people reading your code even be able to tell what you're doing? Would it have fewer bugs than code written by actual programmers? Would it be fast enough? Would you know what to do if it wasn't? Sorry, it's just a pet peeve.

Thus spake the master programmer: "After three days without programming, life becomes meaningless." -- Geoffrey James, "The Tao of Programming"

Working...