Office 2007 Fails OOXML Test With 122,000 Errors 430
I Don't Believe in Imaginary Property writes "Groklaw is reporting that some people have decided to compare the OOXML schema to actual Microsoft Office 2007 documents. It won't surprise you to know that Office 2007 failed miserably. If you go by the strict OOXML schema, you get a 17 MiB file containing approximately 122,000 errors, and 'somewhat less' with the transitional OOXML schema. Most of the problems reportedly relate to the serialization/deserialization code. How many other fast-tracked ISO standards have no conforming implementations?"
Does anyone know if Open Office is compliant with (Score:5, Interesting)
HTML (Score:5, Interesting)
I Remember When... (Score:3, Interesting)
What most made me smile was that the IBM EGA card was included in the matrix of results, showing a rating of 100% compatibility with itself.
Validates better against the TRANSITIONAL spec (Score:5, Interesting)
In other words, if you're validating against the TRANSITIONAL spec, the OOX documents aren't horribly far off. And it's wrong in such a way that's easy to compensate for in code (i.e. check for "true|on" for a truth value). That's a markedly different situation than described by the headline's "'somewhat less' with the transitional OOXML schema" claim.
And in case anyone claims that ODF doesn't have the same sort of problem, I refer you to AbiWord bug 11359 [abisource.com]/OpenOffice bug 64237 [openoffice.org]. This one is a show-stopper.
Re:You're missing the point of an ISO standard (Score:5, Interesting)
You need at least one coded reference implementation or else you'll end up with something in the standard which is difficult/impossible to implement. Especially in a 6,000+ page standard.
ISO would be well advised to take the method the IETF uses, which is to have two independent teams implement the standard based on the documentation before an RFC can reach a Draft Standard status. I suspect ODF would have only benefited from this process by cutting down its rough edges, while OOXML would have been so cumbersome that it would be simply dropped.
hmmm... 122k errors (Score:3, Interesting)
Re:Referenced article promotes a bogosity. (Score:3, Interesting)
The original mean went through Paris, but shifted to Greenwich as a result of the aforementioned invention, and the naval political pull the British earned as a result.
Yes, I think so. (Score:3, Interesting)
ODF is the tip of a very big iceberg [theregister.co.uk]. It's an important and public facing tip but it is a small part of both government and business wasting money on the upgrade treadmill and all the intentional waste of M$ Office. It's all downhill from here.
well... (Score:4, Interesting)
C++?
Try out the "export" keyword next time you write any C++.
Re:OOXML is such a Fraud! (Score:3, Interesting)
Re:Does anyone know if Open Office is compliant wi (Score:5, Interesting)
Re:That is an improvement (Score:5, Interesting)
The point of the article is that MS Office isnt conformant to the STRICT version. This shouldnt come as a surprise, as the change from the original OOXML to the strict version happened, but no new versions of MS Office have been released. The best thing anyone could reasonably expect of a company is that they would update it in the next Office 2007 service pack.
Office comes in a 2-4 year release cycle, and the change in ISO from the transitional version to the strict version happened after Office 2007 SP1 was already done.
How could MS have known in advance the changes that would happen to the standard? They cant see into the future.
Dont forget here that the STRICT version is NOT representative of what any version of office produces. We already knew that.
It was an ISO evolution of the submitted version (the transitional one). The vendor would need some time and a release cycle to adapt their products to it.
What _will_ be interesting is how/when/if MS does conform to the strict format.
On the other hand, the MS Word conformance to the transitional format seems reasonable. TFA only noted one problem, where an attribute value was using on/off rather than true/false. This is minor and easily fixed and/or recorded as a known issue.
Re:Stop using MiB (Score:5, Interesting)
So, we had a problem where different tools and formats defined it different ways. For a number of years, QuickTime used K=1024, while Windows Media and RealMedia used K=1000. Unless you were using Sorenson Squeeze, which "corrected" its Windows Media and RealMedia values by 1.024 so they matched the QuickTime files sizes!
Horrible.
Fortunately, the compression world has standardized on power-of-10 numbers, since that's what the MPEG standards and, well, all the professionals use.
So, now we have to do with complainsts about the mismatch between encoding a file that should be "4 GB" but doesn't fill up "4 GB" of drive space...
Sorry, 1024's got to be a KiB. No other feasible solution at this point, unless we decide to stop having computers talk to each other...
Re:Stop using MiB (Score:3, Interesting)
Re:Really? (Score:3, Interesting)
You created this account as a clever variation on westlake [slashdot.org], just like your Mactrope [slashdot.org] troll account was intended to be confused with Macthorpe [slashdot.org].
That makes it six sockpuppet accounts so far. To repeat what I've been asking you [slashdot.org], how long do you figure this can go on?
Re:Stop using MiB (Score:3, Interesting)
But still, if we assume a byte to be one unit we can as well use powers of ten.
Of course you could argue that the tendency of certain things (like RAM chips) to have sizes that are powers of two might imply using a power of two in language usage. But then again, lots of other things don't use power of two (e.g. most storage media and almost everything transmission-related). Who prevails? Do we follow RAM usage and have non-fitting storage and transmission? Do we follow storage/transmission and have non-fitting RAM? Do we follow xkcd and settle on 1012 bytes per kilobyte?
Or, of course, we just use unambiguous prefixes so people know which base we use. If you don't like "kibibyte" you can lobby IEC to instead adopt "computer science (not storage) kilobyte (CSkB)" and "general standard kilobyte (GSkB)".
Re:What's the Problem? (Score:3, Interesting)