|
|
|
|
|
|
Hotel cut-off :
October 27, 2006 |
|
|
|
|
| |
|
|
|
|
|
Venue: |
|
|
|
|
 |
|
|
Hilton Washington
Dulles Airport |
|
| |
|
 |
|
| |
|
|
Program
Wednesday, November 15, 2006
|
09:00-10:30 |
MORNING
TUTORIALS |
|
Presenter:
Asmus Freytag
President
ASMUS, Inc. (09:00-09:45) |
Track 1:
Unicode 5.0 Tutorial: Part 1 -
Characters in Action
Part I of the Unicode 5.0 Tutorial is a uniquely accessible and
entertaining way of visualizing the core concepts of the Unicode
standard. In this part you will find answers to these questions: What
is a Unicode character and how are Unicode characters represented and
used in a modern computing environment? How are Unicode characters
entered into and displayed on a computer? How are Unicode characters
interchanged? What is the interaction between Unicode and rich text
(markup)? How do end-users experience Unicode? Throughout Part I, the
Unicode 5.0 Tutorial gives typical examples of how the Unicode
Standard interacts with the other elements of an internationalized
software architecture. With the help of concrete scenarios for the use
of Unicode characters you will become familiar with the role the
Unicode Standard plays and the benefits of supporting it. Part I of
the tutorial provides a concrete context to which the more systematic
and detailed treatment of the features of the Unicode Standard
presented in Part II and Part III can be related.
|
|
|
Presenter:
Addison Phillips
Internationalization Architect
Yahoo! |
Track 2:
Internationalization:
An Introduction
What is internationalization? What do developers, product managers, or
quality engineers need to know about it? How does a software
development organization incorporate internationalization into the
design, implementation, and delivery of an application? This tutorial
provides an introduction to the topics of internationalization,
localization and globalization. Attendees will understand the overall
concepts and approach necessary to analyze a product for
internationalization issues, develop a design or approach, and deliver
a global-ready solution. The focus is on architectural approaches and
general concepts, but will include specific examples and exercises.
Some of the topics covered will include: character encodings and
Unicode; processing text in different languages; preparing for the
localization (translation) of user interfaces; making applications
"locale-aware", including format and display differences; as
well as approaches to delivering multi-lingual and multi-locale
software or content. |
|
|
Presenter:
Tex Texin
Internationalization Architect
Yahoo!
|
Track 3: Web Internationalization - Standards
and Best Practices
This tutorial is an introduction to
internationalization on the World Wide Web. The audience will learn
about the standards that provide for global interoperability and come
away with an understanding of how to work with multilingual data on
the Web. Character representation and the Unicode-based Reference
Processing Model are described in detail. HTML, XHTML, XML (eXtensible
Markup Language; for general markup), and CSS (Cascading Style Sheets;
for styling information) are given particular emphasis. The tutorial
addresses language identification and selection, character encoding
models and negotiation, text presentation features, and more. The
design and implementation of multilingual Web sites and localization
considerations are also introduced.
|
|
|
|
|
10:30-10:45 |
Morning
Refreshments |
|
|
|
|
10:45 – 12:15 |
MORNING TUTORIALS (Cont'd.) |
|
Presenter:
Richard Ishida
Internationalization Activity Lead
W3C
(09:45-12:15) |
Track 1: An Introduction to Writing Systems &
Unicode
The tutorial will provide you with a good understanding of the many
unique characteristics of non-Latin writing systems, and illustrate
the problems involved in implementing such scripts in products. It
does not provide detailed coding advice, but does provide the
essential background information you need to understand the
fundamental issues related to Unicode deployment, across a wide range
of scripts. It has also proved to be an excellent orientation for
newcomers to the conference, providing the background needed to assist
understanding of the other talks! The tutorial goes beyond encoding
issues to discuss characteristics related to input of ideographs,
combining characters, context-dependent shape variation, text
direction, vowel signs, ligatures, punctuation, wrapping and editing,
font issues, sorting and indexing, keyboards, and more. The concepts
are introduced through the use of examples from Chinese, Japanese,
Korean, Arabic, Hebrew, Thai, Hindi/Tamil, Russian and Greek. While
the tutorial is perfectly accessible to beginners, it has also
attracted very good reviews from people at an intermediate and
advanced level, due to the breadth of scripts discussed. No prior
knowledge is needed. |
|
|
|
|
|
Track 2 - Internationalization: An Introduction (Cont’d) |
|
|
|
|
|
Track 3 -
Web Internationalization - Standards and
Best Practices (Cont’d) |
|
|
|
|
12:15-13:15 |
LUNCH |
|
|
|
|
13:15-15:30 |
AFTERNOON
TUTORIALS |
|
Presenter:
Asmus Freytag
President
ASMUS, Inc. |
Track 1 - Unicode 5.0 Tutorial:
Part 2 - Fundamental Specifications
Part II of the Unicode 5.0 Tutorial builds on the concepts introduced
in Part I and systematically presents the details of fundamental
specifications that are part of the Unicode Standard. Topics include:
organization of the Unicode code space; principles used to allocate
and unify characters; encoding forms including definition of UTF-8,
UTF-16, UTF-32 and when to use each; how to use byte order mark;
combining characters and equivalent code sequences equivalent; format
characters and other special characters and code points; organization
of the Unicode Standard. Part II of the Unicode tutorial is
recommended for anyone interested in a systematic overview of the key
aspects of the standard. Detailed technical or programming experience
is not required. |
|
|
Presenters:
Elsebeth Flarup
Globalization Architect
IBM Corp.
Soeren Bendtsen
Advisory IT Specialist
IBM Corp. |
Track 2 -
Best Practices in Software Localization Process and Technology
Software localization is often treated as a separate process which
only starts towards the end of the development cycle, and in many
companies the localization group does not function as an integral part
of the development organization. This tutorial uses practical examples
to demonstrate how localization that is built into the process,
starting from the design phase, may help lower cost and improve
time-to-market for localized versions.
Topics include: Building international support into the product from
the beginning; Globalization and localization considerations when
selecting programming platforms, tools and third-party applications;
Globalization verification testing and how to use pseudo localization
effectively; Do's and don'ts when creating translatable text;
Translation file formats – recommendations and handling of
non-standard file formats; Translation file check tools and how they
can reduce translation problems, build issues and test duration;
Translation planning systems, workflow and problem reporting;
Translation verification testing; Source control and change freezes;
Terminology management; Computer aided translation tools such as
translation memory based systems. Demos will be used to
illustrate tools and processes.
|
|
|
Presenter:
Bill Hall
MLM Associates, Inc. |
Track 3 -
Internationalization and Localization Features of Microsoft .NET
The internationalization and localization paradigm of Microsoft
.NET is quite different from the older Win32 model, mandating a change
of viewpoint and a new set of skills for programmers, localizers, and
architects. In Windows you depend on a loose collection of API’s and
manifests to take care of globalization issues and managing resources.
Now your tools are globalization and resource classes and related
enumerations and the system is entirely Unicode based. If you
are considering a project in .NET, take the time to learn the basics
about how the globalization features are organized and managed. You
will realize that .NET is quite well designed, and practically any
developer can learn the subject well enough to do the right thing most
of the time. Indeed, thanks to the .NET model, most everything happens
automatically and you can rely on the default behavior most of the
time. But, if needed, you will also learn to recognize when an
override is necessary. You may not emerge as an expert, but you will
know the basics. You will be especially interested in the features of
.NET 2.0 including the ability to modify locales and applying strongly
typed resources. |
|
|
|
|
|
15:30-15:45 |
Afternoon
Refreshments |
|
|
|
|
15:45-18:00 |
AFTERNOON
TUTORIALS |
|
Presenter:
Asmus Freytag
President
ASMUS, Inc.
|
Track 1 - Unicode 5.0 Tutorial: Part 3 - Unicode
Algorithms
The Unicode Standard and related specifications by the Unicode
Consortium specify a number of algorithms. The specification of these
algorithms in the Unicode Standard depends on the Unicode Character
Properties. Part III of the Unicode 5.0 Tutorial surveys the
algorithms specified in the Unicode Standard, and extends the
discussion of Unicode character properties as they relate to each
algorithm. Part III covers many general aspects of Unicode algorithms:
Unicode Algorithm and the difference between an abstract algorithm
from an actual implementation; relation between algorithms and Unicode
Character Properties; techniques to access character properties.
Several algorithms are discussed in more detail for example: Unicode
Normalization and the requirements it addresses, including a
discussion of the Unicode Normalization forms NFC, NFD, NFKC, NFKD,
their interaction with the Web and what programmers need to know in
applying normalization; the Unicode Bidirectional Algorithm, and its
interaction with text layout; text boundary determination and
character foldings and much more. Part III of the Unicode 5.0 Tutorial
is more detailed and will touch on the description of algorithms and
other material that may require some familiarity with technical
concepts. |
|
|
Presenter:
Eva Sanchez
Director of Project Management
Basis
Technology
|
Track 2 -
Managing Internationalization Projects
This tutorial describes the life cycle of an internationalization
development project. It will commence with the Planning phase,
followed by the Execution Phase and finish with the Delivery Phase and
preparation for future releases. The goal of this tutorial is to
prepare the project management team as well as engineering and QA
management teams to anticipate the specific challenges of an
internationalization project and acquire mitigation strategies to
ultimately manage to completion a successful project. The
tutorial will not focus in one specific case study but the presenter
will bring real life examples into the discussion. The intended
audience is project managers, engineering managers, QA managers and
product managers. The level is Beginner/Intermediate
|
|
|
Presenter:
Pierre
Cadieux
President
i18N Inc. |
Track
3 - Making Sense of Oracle Character Sets and
Length Semantics
Everything you need to know to work with Oracle character sets.
A new model of Oracle character sets is presented, involving five
character sets: database, national, client, and more! The model
is mapped to Oracle usage in C/C++/Java/.NET. It is then used to
explain the subtleties and pitfalls of Oracle transcoding.
Numerous transcoding scenarios are illustrated visually with the
model, as are the various parameters controlling SQL literal
transcoding and Oracle's "form-of-use". Length
semantics are then introduced along with the related SQL and PL/SQL
functions. Finally, with all these features understood, the
presentation finishes by discussing the pros and cons of the various
ways of implementing Unicode in Oracle. |
Thursday, November 16, 2006
|
09:00-09:15 |
WELCOME & OPENING REMARKS |
|
|
Mark Davis
- President, Unicode Consortium |
|
|
|
|
09:15-10:00 |
KEYNOTE -
One Laptop per Child – $100 Laptop
Nicholas Negroponte
Chairman, One Laptop per Child and Chairman
Emeritus, MIT Media Laboratory |
|
|
|
|
10:00-20:00 |
EXHIBIT AREA OPEN |
|
|
|
|
10:00-10:30 |
Morning Refreshments in Exhibit Area |
|
|
|
|
10:30-11:20 |
SESSION 1 |
|
|
|
|
Presenter:
Richard Ishida
Internationalization Activity Lead
W3C |
Track 1 - Internationalizing Style on the Web:
What's Planned for CSS3
Work on CSS3 is getting nearer to completion and will
provide a huge leap forward for rendering and styling
non-Western text. Many of the innovations are focused
specifically on the needs of Asia and include such things as
vertical text, grid layout, auto spacing, sophisticated
white-space handling and text wrapping, justification, advanced
list styling, styling for ruby, emphasis, kumimoji, and
more. This presentation will show you examples of a number
of these developments, and discuss how we need help to move the
work forward.
|
|
|
Presenter:
Addison Phillips
Internationalization Architect
Yahoo! |
Track 2 - Undetecting the Encoding: Choosing a
Legacy Encoding Algorithmically
Detecting the character encoding used by a block of text is
an important first step in being able to process or display the
text correctly. Frequently, applications will use the encoding
to convert to Unicode in order to enable multi-lingual
support. But what happens when you want or need to convert
a Unicode text stream to a legacy encoding? Which encoding
should you use? How can you reliably detect the "best"
or most widely supported encoding for a given text buffer?
This presentation covers a method for detecting the best legacy
encoding to use when converting a UTF-8 text buffer. It
describes the encoding choices to be made; the reasons one might
want to do this; the interaction between "superset"
and "subset" encodings; and the data structures used
to get peak performance with the least amount of memory. |
|
|
Presenter:
Gwyneth Marshall
International Program Manager, Office
Microsoft Corporation
[ Top ] |
Track 3 - Microsoft Office 2007 – International
Collaboration and Analysis
Building upon the strong support that Office already gives
for worldwide documentation standards, this new version of
Office offers an even better user experience with deeper
international functionality. In this presentation we will
discuss newly supported locales as well as the depth of locale
support across Office documents. Office 12 offers a
language neutral architecture for improved multilingual support,
searching, and deployment, which will be discussed in the
presentation. Of particular interest will be the new file
formats that support easier data entry, retrieval, and
indexing. In addition, we will give an overview of some of
the new international features in Office 2007, including the
Language Reference ToolTip and the English Writing
Assistant. We continue to expand our work with Language
Interface Packs, and will highlight the new UI languages that
will be supported in Office 2007. |
|
|
|
|
111:30-12:20 |
SESSION 2 |
|
|
|
|
Presenter:
Raghuram Viswanadha
Software Engineer
ICU Team
IBM Corp.
[ Top ] |
Track 1 - StringPrep: Unicode in Network Protocols
Network protocols require consistent comparison of strings.
The StringPrep
framework (RFC 3454) facilitates this function. It provides sets
of rules
that can be applied to strings to prepare them for use in any
protocol or
program. Each system sets up a profile of StringPrep by
selecting a set of
rules. Important profiles such as NamePrep, NFS, ResourcePrep,
NodePrep
are explained. The usage of StringPrep and IDNA frameworks is
illustrated
by implementation in International Components for Unicode (ICU). |
|
|
Presenter:
Richard Gillam
Software Engineer
IBM Corp.
[ Top ] |
Track 2 - Parsing Personal Names
Applications that do things with people's names usually need
to parse them.
For example, lists of names are usually sorted by surname, so
you need to
identify the surname. You may also need to remove or otherwise
handle
extraneous components, such as titles, that often travel with a
name but
aren't technically part of it. You may also need to disentangle
records
that combine more than one name in a single string (such as
"John and Mary
Smith"), or identify names with missing components
("John"). And in a
global environment, you need to be able to do these things
correctly with
names from a wide variety of international cultures, each with
its own
conventions for naming people. You may also be faced with a
database of
names that are already divided into given name and surname, but
inconsistently or incorrectly. This talk will discuss all
the various things that make parsing personal names complicated,
taking an in-depth technical look at NameParser, a component of
IBM's Global Name Management suite, and the techniques it uses
for making sense out of personal names from different cultures.
|
|
|
Presenter:
Michael Kaplan
Technical Lead
Microsoft Corporation
[ Top ] |
Track 3 - Keyboards on Win32: Beyond the Keyboard
Layout Creator
Creating Microsoft Keyboard Layout Creator (MSKLC) was a
great step forward for ease of developing keyboards on Windows.
But there are plenty of features and problems (and bugs!) that
are not covered by this tool, some of which are not really
addressable even in future versions as they are core to Windows
itself. These issues can haunt both users and developers alike
unless they are understood, which is where this presentation
comes in -- a review of additional problems and their solutions
to make the keyboarding experience on Windows better for both
users and developers. |
|
|
|
|
12:30-13:00 |
LUNCH |
| |
|
|
13:30-14:20 |
SESSION 3
|
| |
|
|
Presenter:
Steven R.
Loomis
Globalization Center of Competency, IBM
Corp
[ Top ] |
Track 1 - Collation in
ICU
Collation is the general term for the process of determining
the sorting order of strings of characters for a given language.
It is a key function in computer systems; whenever a list of
strings is presented to users, they are likely to want it in a
sorted order so that they can easily and reliably find
individual strings. It is also crucial for the operation of
databases, not only in sorting records but also in selecting
sets of records with fields within given bounds.
It is quite tricky to get collation to work correctly for many
languages, and even more difficult to do it with the speed
demanded by customers. Luckily, the ICU library provides a
high-performance, full-functioned implementation of
international collation, one that is used in IBM products and
can be freely used in any other product. This presentation will
review the capabilities of ICU collation and illustrate what can
be done with it. |
|
|
Presenter:
John O'Conner
Technical Writer
Sun Microsystems
[ Top ] |
Track 2 - Character Conversions from Browser to
Database: or There and Back Again
This presentation considers the various points of character
set conversions that may occur in a typical web based
application. The session describes the path of character data as
it travels from browser form, through 2nd tier business logic,
and to its brief resting place within a database. Then it
follows the data on a return trip to the browser. All along,
common pitfalls and conversion errors will be described. |
|
|
Presenter:
Han-Yi Shaw
Lead Program Manager Macintosh Business
Unit Microsoft Corporation
Top |
Track 3 - Unicode in Office for Macintosh: Past,
Present, and Beyond
Microsoft Office 2004 for Macintosh represents the first
"Unicode-Throughout" version of Office for Mac with
significantly improved character fidelity and layout
compatibility. This presentation will provide a history on the
evolving customer demands for Unicode support in Office for
Macintosh and how the teams at the Macintosh Business Unit have
endeavored to deliver on the promise of a “Unicode-Throughout”
suite of application with the release of Office 2004 for
Macintosh. Additionally, the presenter would also like to
exchange ideas with leaders from the community, and collect
feedback on our Unicode efforts for the next version of Office
for Mac.
|
|
|
Presenter:
Adil Allawi
Technical Director Diwan Software Limited
Top
|
Track 4 - Unicode in Education
One of the biggest obstacles to a developing country is the
printing and distribution of educational textbooks in its native
language. There are many advantages to using computers in place
of paper book. Yet converting complex language textbooks to
electronic form can be a non-trivial task. My company developed
Islamic studies e-books for the Saudi ministry of education and
educational electronic text books for several Arab countries.
This paper gives a case-study of problems encountered and the
solutions that were applied. While the examples given are for
Arabic, the issues are relevant to many developing countries.
|
|
|
|
|
14:30-15:20 |
SESSION 4 |
|
|
|
|
Presenter:
Marin Millar
Globalization Manager
Microsoft
Corporation
Top |
Track 1 - Building Multilingual Websites with
ASP.NET 2.0
This talk will cover the new designer features and resource
architecture for localization in ASP .NET 2.0. It will show how
to generate resources using the designer. It will also explain
new methods for detecting the browser language, the declarative
localization model, resource extensibility, and multilingual
website deployment. |
|
|
Presenter: Tex Texin
Internationalization Architect
Yahoo!
Top |
Track 2 - How to be a CSI (Encoding Crime Scene
Investigator)
Join the CSI team, Grissom, Willows, Sidle, Stokes, et al.
in the forensic analysis of character encoding crimes.
This presentation will elaborate on the techniques of CSI
forensic analysis and its application to debugging character
encoding problems in software and web applications.
Several example problems will be diagnosed. |
|
|
Presenter:
Andy
Abbar
Director International Strategic
Initiatives Microsoft Europe
Top
|
Track 3 - Local Language Program
The importance of language drives Microsoft’s
global product strategy. The Local Language
Program is a global initiative to provide
localized desktop software and tools to
customers through collaboration with local
governments and language experts to build a
local IT economy. The Local Language Program
has three pillars (language/culture,
technology and community) and serves to
bridge the language and digital divides
between emerging and developed markets,
preserve the language and culture. I will
share the approach, success stories and our
plans for Windows Vista and Office2007. |
|
|
Presenter:
Waris Abdukerim Janbaz
Researcher &
Ph.D. Candidate
University of Paris VIII
|
Track 4 - Web Development Considerations
for Unicode-based Text Processing in Uyghur Language
This presentation focuses on text processing and accessibility
of an agglutinative Turkic language -- Uyghur. It describes the
basic concept of Uyghur Unicode font developing, character
displaying and character inputting methods within Uyghur-support-less
environment. This article also highlights the problems caused by
the absence of two Uyghur characters in the Unicode Standard's
table, and more importantly, the article proposes solutions to
overcome such incompleteness. |
|
|
|
|
15:20-16:00 |
Afternoon
Refreshments in Exhibit Area |
|
|
|
|
16:00-16:50 |
SESSION 5
|
|
|
|
|
Presenter:
Evan
Gerber
User Experience
Consultant Molecular
Top |
Track 1 -
Ich bin eine Website – The Impact of
Language and Culture on Internationalization
& Localization
In this session, Evan Gerber draws off of
practical experience on multiple
internationalization projects to clearly
illustrate the process and pitfalls inherent
in developing for multiple cultures and
languages. Attendees will garner
practical insights on the challenges,
strategies, and lessons learned which are
critical to the success of any
internationalization project. The
discussion moves through all phases of
software development, from team building to
ongoing maintenance. A comprehensive
overview of the impact of language and
culture on user experience and technical
architecture ensures that participants are
aware of the big picture impact.
Exploring specific tools in detail provides
tangible knowledge which can be applied
immediately. |
|
|
Presenter:
Richard Gillam
Software Engineer
IBM Corp.
Top |
Track 2 - A Forest of Tries
Most text-processing applications have to look up
arbitrary-length
character strings in lists, either to test for membership in a
group or to
look up additional information about a particular string. The
trie is a
classic technique for representing collections of strings,
offering a good
combination of memory efficiency and lookup performance. This
talk will
examine the basic concept of the trie and look at various
implementations--
from simple to complicated-- of tries that optimize for
different types of
list and application.
|
|
|
Presenter:
Russ Rolfe
Lead Program Manager
Microsoft
Top
|
Track 3 - Windows Vista, An Ever Expanding View of
Internationalization -- Updated
Windows Vista expands upon the foundation created in Windows
2000 and Windows XP to support users' Internationalization
needs. In this presentation, we will introduce the newly
supported locales, user input options and the extended font
coverage. Then we will show how the OS's localized User
Interface will be wholly supported with the Multilanguage User
Interface (MUI) technology introduced in Windows 2000 and
improved in Windows XP. Plus, we will discuss how Language
Interface Packs (LIPs) will be used to broaden the localized
language coverage. Finally the concept of custom cultures will
be presented. |
|
|
Presenter:
Toshiya Suzuki
Research Assistant Hiroshima University Top |
Track 4 - Investigation of Encodings in Legacy
Khmer TrueType Fonts and Possibility of Auto
Encoding Detection
In spite of the standardization of character encoding
implementation of OpenType based Unicode rendering system, still
legacy character encodings are widely used for South or
South-East Asian scripts to work with systems lacking
intelligent renderer. Although some scripts have de-jure
or de-facto standards, other scripts have no standards at
all. If the document uses such un-standardized fonts, the
document lacks the correct delcaration of encoding. Taking Khmer
script as a typical example, we investigate the encodings in the
freely distributed Khmer TrueType fonts and propose algorithm to
identify which encoding is used in the fonts.
|
|
|
|
|
17:00-17:50 |
SESSION 6
|
|
|
|
|
Presenter:
Christian Donner
Director, Distributed Systems
Aviva Life Insurance Company
Top |
Track 1 - A
Generic Content Localization Taxonomy
If you are planning to go international with your web
presence, or if you are planning to consolidate a number of
local websites into a single international web presence, the
translation of your content is a secondary concern. Before you
can start translating, you must define how content is structured
and where it will live, so that it can be targeted at the right
audiences in the right markets. In this session, Christian
Donner presents a generic approach for establishing a content
localization taxonomy that can be customized for your individual
circumstances. |
|
|
Presenter:
Michael Kaplan
Technical Lead
Microsoft Corporation
Top |
Track 2 - Sorting It All Out: More Words on
Collation
In a properly globalized product, users will have properly
collated data-e.g., in the file system, in a database, in an
e-mail address book. How should implementers go about ensuring
culturally-correct collation in a product? What are the
important linguistic issues of collation, and how do they
manifest themselves in technology? This presentation goes beyond
the basic tenets of collation in language, and really shows how
collation functions are used (using examples from the Win32
API). It will also touch upon best (and worst) practices. |
|
|
Presenter:
Ienup Sung
Staff Engineer
Sun Microsystems, Inc.
Top
|
Track 3 - Unicode Transition Planning for Solaris
Ubiquitous Unicode / ISO/IEC 10646 is one of the most
successful character set standards that is widely accepted and
implemented by numerous modern information technology standards
and products. The Globalization group at Sun believes that
adopting the Unicode as the default character set at Solaris
will benefit our customers and also Sun. This technical
presentation and demonstration outlines the latest Unicode
Transition Planning for Solaris at Sun and asks for your
comment. |
|
|
Presenter:
Thomas Milo
President
DecoType
|
Track 4 - Tasmeem - Ultimate Arabization
DecoType and WinSoft have joined forces to build the first
ever complete implementation of ACE technology, dovetailing it
into the Adobe Creative Suite. The result, dubbed Tasmeem (in
Arabic meaning both Design and "Tenacity in Pursuing a
Project"), realizes everything - and more - that was ever
rumoured about DecoType. ACE technology was created from
scratch to meet both Arabic and Unicode requirements without
compromise. The presentation will consist of a brief
summary of the concepts and a demo of the most striking features
of Tasmeem and their relevance for Unicode Arabization.
|
|
|
|
|
18:00-20:00 |
CONFERENCE RECEPTION (IN EXHIBIT AREA) |
Friday, November 17, 2006
|
09:00-09:45 |
KEYNOTE -
Where Would We Be Without UNICODE? |
|
Moderator: Dr. Richard Soley
OMG
Panelists:
Mark Davis
UNICODE
Richard Ishida
W3C
P. J. Plauger
Dinkumware, Ltd.
Erkki Kolehmainen
CSC-Finnish IT Center for Science |
In this Keynote Panel, leaders of standards organizations will
explain how they rely on the Unicode standard to forge ahead with
their own standards, and how the future of Unicode, in
internationalization, localization and globalization, will support
their own standards programs.
Other industry experts will detail how Unicode is key to enabling
new capabilities and relate the business reality of supporting
additional languages. Finally, the panel will discuss trends in
upcoming standards and new technology that affect
internationalization. |
| 09:45-10:00 |
Morning Refreshments |
| |
|
|
10:00-10:50 |
SESSION 7 |
|
|
|
|
Presenter: Mark Davis
President,
UNICODE Consortium
Top
|
Track 1 - What's New in Software Globalization
Standards
This presentation provides an update on the latest
developments in software globalization from the Unicode
Consortium, summarizing the most important changes in the Unicode
character encoding standard, related globalization standards and
specifications, the locale data repository (CLDR), etc. It also
describes important related developments from the IETF, ICANN, the
W3C, and others. |
|
|
Presenter:
Richard Gillam
Software Engineer
IBM Corp. Top
|
Track 2 -
A Brief History of Character Encoding
Unicode didn't just spring fully-grown from the forehead of
Zeus; it has many historical antecedents and its design is firmly
grounded in existing practice. In fact, to understand
Unicode fully, it's necessary to understand where it came from.
This talk will trace the history of character encoding in
information technology from its roots in the Morse telegraph code
through to Unicode. It will cover the Morse and Baudot telegraphy
codes, IBM's BCD punched-card code, ASCII and EBCDIC, the ISO 2022
encoding scheme, the ISO 8859 family of encodings, and various
other encodings, ending with the early history of Unicode and ISO
10646.
|
|
[10:00-11:50]
Moderator:
Steven R. Loomis
Software Engineer
Globalization Center of Competency
IBM
Corp.Panelists:
Mark Davis
Google
Mark Garrett
ModernGigabyte LLC
Tex Texin
Yahoo!
Ram Viswanadha
IBM Corp.
Daniel Yacob
The Ge'ez Frontier Foundation
|
Track 3 - Panel: Deploying the Common Locale Data
Repository
The Common Locale Data Repository is a project for the
exchange of
culturally sensitive (locale) information used in application and
system development, and to gather, store, and make available such
data. By pooling resources, the time and expense of collecting
good
data can be minimized. As well, minority languages and small
countries have a focal point for submitting data. This session
will
go into detail regarding exactly what types of locale information
are
available in CLDR, how the data is to be used, and how the CLDR
vetting process works to ensure the quality of data. Panelists
will
then discuss how they are making use of CLDR data, the latest
project
status, and issues in the collection and production of data. The
panel will consist of persons from multiple vendors involved in
deploying CLDR in their own products and projects, as well as
those
involved in the data gathering and vetting process. Comments and
questions will be welcomed from the audience.
|
|
|
|
|
11:00-11:50 |
SESSION 8 |
|
|
|
|
Presenter:
Charles Hornig
Globalization Architect IBM Corp.
Top
|
Track
1 - Advanced Java Globalization
This presentation describes how to improve the level of global
support of Java™ code from "works most of the
time" to "works all of the time". The focus is on
some specific areas we have identified in reviews of existing
code. For each topic, examples of how to code to avoid problems
and how to test for them will be provided. The presentation is
intended for anyone responsible for producing Java applications to
be used in a multinational environment. Attendees should be
familiar with the Java programming language, basic globalization
concepts, and basic Unicode concepts. |
|
|
Presenter: Michael
Alford
Senior Architect Travelport, Inc. |
Track 2 - Internationalization of the Travelport
B2C Platform
Travelport’s U.S.-only travel commerce platform that runs
Orbitz.com and CheapTickets.com will be internationalized in 2006
to support the hosting of many brands in different locales and
currencies simultaneously on the same server instances. We
investigated dates, times, time zones, currency, distance, string
comparisons, character sets, geography, white-labeling, and
e-commerce concerns for this Java-based platform. This case
study of the first phase of the project (the deployment of
Ebookers.co.uk) will discuss the key technology choices made
during the project, evaluations of Java and open-source
internationalization components, and enforcement of best practices
within application code.
|
|
|
|
Track 3 - Panel: Deploying the Common Locale Data
Repository
(Cont’d)
|
| |
|
|
12:00-13:00 |
LUNCH |
| |
|
|
13:00-13:50 |
SESSION 9 |
|
|
|
|
Presenter:
Dale Schultz
Globalization Test Architect
IBM Corp.
Top
|
Track 1 - The Why, Who, When and How of Efficient
Globalization Testing
A critical part of software development is the testing of the
product. Testing the globalization aspects of software is
challenging because test staff may not be familiar with the
details of the many functional requirements of globalized
software. This presentation will describe the new approach being
used by IBM to enable test teams to determine what needs to be
tested for adequate coverage and how the testing can be performed
in an efficient manner. It will describe the best practices for
who should do the testing and when. It will also identify some
common techniques that should be avoided. |
|
|
Presenter:
Mark Davis
Google
Top
|
Track 2 - Unicode in Google
Google makes extensive
use of Unicode in all of its products. For example, all web pages
-- no matter what their original encodings -- are mapped to
Unicode for processing. This presentation will discuss some of the
uses of Unicode in various Google products, and some of the
challenges involved in processing Unicode on an extremely large
scale. It will also discuss some of the approaches to
internationalization that have been found to be particularly
effective.
|
|
|
Presenters:
Fred Susi
Senior Product Manager
John Fay
Senior Product Specialist
Zebra Technologies Group
Top
|
Track 3 -
Embedding Unicode Support Directly into a Printer: A Case Study
The Unicode Standard unified how to encode characters in
computer equipment. However, it is left to a rendering device to
determine the size, orientation, and placement of glyphs. The
rendering device may even be required to choose between a series
of glyphs. Furthermore, with many pre-Unicode applications/systems
still in place, an output device (printer) must be able to
interpret and switch between the various Unicode and Legacy
encoding schemes to be commercially successful. This presentation
will discuss the issues and solutions uncovered while developing
one of the first Unicode-enabled printers that can transform
encoded data streams into human readable output. |
|
Moderator:
Deborah Anderson
UC BerkeleyPanelists:
Daniel
Yacob
Director
Ge'ez Frontier Foundation
David Germano Associate
Professor University of Virginia
Leonard Muellner Professor
Brandeis University
Neel Smith
Associate Professor
College of the Holy Cross |
Track 4 - Panel:
Unicode and the "Lesser
Known" Scripts: Historic and Minority Scripts in Unicode, New
Developments and Requirements
As the universal character set, Unicode aims
to cover the historic scripts of the world as well as modern
scripts. The historic scripts are of interest to students and
scholars, and also play a critical role in preserving our world
cultural heritage. Many are used in modern liturgical contexts.
This panel will highlight the new directions Unicode is providing
for currently encoded scripts, addressing such questions as: how
has Unicode influenced teaching and publishing? Has Unicode made
the material more accessible to a wider audience? What outstanding
problems and issues remain? For the over 40 unencoded minority
scripts, two questions will be raised: why is encoding these
scripts important and how the government can assist. |
|
|
|
|
14:00-14:50 |
SESSION 10 |
|
|
|
|
Presenter:
Charles Hornig
Globalization Architect IBM Corp. Top
|
Track 1 - Globalization Testing using Eclipse™
This presentation describes how to use the Eclipse Test and
Performance Tools Platform (TPTP) to build globalization tests.
TPTP is an open-source platform for testing tools. Among many
other capabilities, it provides frameworks for both static and
dynamic code analysis. IBM has used these frameworks to build a
suite of tests to detect common globalization coding errors. This
presentation will describe how this was done and how you can build
tests tailored to your own needs. The presentation is intended for
those developing globalization testing plans and tools. Attendees
should be familiar with basic globalization and Unicode concepts. |
|
|
Presenter:
Jennifer DeCamp
Foreign Language Technology Program
Manager
MITRE
Top
|
Track 2 -
Implementing Multilanguage Databases
While Unicode has significantly simplified the problems of
storing and searching multiple scripts, the use of Unicode is
often not sufficient for many applications, particularly
applications involving extensive legacy data. There are
applications in which it is necessary to know the language as well
as the script, but where language identification software is not
effective. There also are issues with languages for which Unicode
encodings are not yet available, display systems for languages
with complex scripts, Unicode-encoded fonts, and input methods so
that users can see and input data. Jennifer DeCamp will discuss
the issues and present solutions that are being used by the Board
of Geographic Names in their move to providing data in multiple
languages and scripts. |
|
|
Presenter:
John Harvey
Engineer
Apple Computer
Top
|
Track 3 - International Features of Mac OS X
Mac OS X is a modern, robust, Unix-based operating system.
This session covers the international capabilities of Mac OS X
primarily from an end user perspective, with a particular emphasis
on new features in Mac OS X 10.4 Tiger, the latest version. Topics
covered include supported languages, input methods and keyboard
layouts, locales, font technologies, and user customization.
Topics of interest to software developers and language experts
will also be considered. |
|
|
|
Track 4 - Panel: Unicode and Minority/Historic
Scripts; New Directions (Cont’d) |
|
|
|
|
14:50 – 15:10 |
Afternoon
Refreshments |
|
|
|
|
15:10 - 16:00 |
SESSION 11 |
|
|
|
|
Presenter:
Addison Phillips
Internationalization Architect
Yahoo!
Top
|
Track 1 - The Theory and Practice of Pseudo-Translation
Pseudo-translation is the use of a program to alter software
resources "as if" they had been translated. Usually this
takes the form of taking unaccented English letters and replacing
them with non-ASCII characters. Pseudo-translation can be used to
test the "localizability" of a piece of software or the
adaptability of the display to non-English characters, fonts, and
so forth. This session will present the various uses that an
organization might consider for pseudo-translation, along with the
in-depth examination of a pseudo-translation system developed by
the author. Some of the use cases covered will including visual
and functional testing, generating "global" test cases
from English-centric cases, and character encoding test
generation. |
|
|
Presenter:
Nelson Ng
Chief Globalization Architect
eBay Inc.
Top
|
Track 2 -
A Case Study of eBay UTF-8 DB Migration
The eBay Marketplace is a global trading platform – 193
million people in more than 150 countries are registered eBay
members. The eBay Marketplace has a site presence in 33
markets around the world, 24 of which have country specific
marketplaces supported by a single code base that supports eBay
members worldwide. This presentation will cover the
migration of the eBay Marketplace database from ISO Latin 1 to
UTF-8. Nelson Ng, Chief Globalization Architect at eBay,
will cover the inline approach used by eBay to address the
challenges of migrating over 100 databases with 18+ Billion
textual data records while keeping the entire site online. In addition to the preparation and execution of this migration
approach, Nelson will cover alternatives considered and lessons
learned. |
|
|
Presenters:
Naoto Sato
Java Internationalization
Engineer
Sun Microsystems
Craig R. Cummings Principal
Software Engineer
Oracle Corporation Top
|
Track 3 - New Internationalization Features of the
Java Platform - Java SE 6
See what internationalization features are planned for the
next version of the Java Platform -- Java SE 6 (codename Mustang). Highlighted in this session will be features such as
Locale Sensitive Services SPI, Normalizer API, ResourceBundle
enhancements, and new Japanese calendar support. It also gives you
a sneak peek of the latest planned features beyond. |
|
|
Presenter: Kuang-Hui Chiu Associate
Professor
National Taipei University
Chi-Ching Hsu
Standard and
Business Planner
IBM Taiwan Corp. Top |
Track 4 - Chinese Dilemma: How Many Ideographs are
Needed?
A Chinese proverb says that, "great
quantity is equal to beauty". This presentation will apply
this adage and describe what scenario may occur if the proverb is
applied to the information technology industry and the question of
how many ideographs are needed. It will explain the dilemma of
whether to accept the niche market requirement to collect
and display all ideographs or follow the more common users'
requirement to provide a basic set of ideographs only. |
|
|
|
|
16:10 - 17:00 |
SESSION 12 |
|
|
|
|
Presenter:
Pierre
Cadieux
President
i18N Inc Top |
Track 1 - The Localization Process and Applicable
Standards
A high-level architectural model of the
localization process is constructed,
describing the steps required for scalable
and sustainable multilingual
deployment. The model is built
graphically, item by item, starting from
source content creation through change
detection, text extraction, leveraging,
workload, translation, review, etc. to
produce localized content in several
languages. The main characteristics of
each step, and applicable software
components, are discussed. The model
then serves as backdrop to discuss which
localization standards exist (or are under
development) to support the localization
process. You will learn where and how
standards such as JCR, XLIFF, TMX, TWS, GMX/V
and others, interact. |
|
|
Presenter: Charles Hornig
Globalization Architect IBM Corp. Top |
Track 2 - Overhauling a C++ Application to Use
Unicode
This case study describes the process of
incorporating Unicode functionality into an
existing multi-platform C++ application,
from design through application delivery.
The focus is on the technical challenges and
how they were met (or not). The topics
discussed include: compatibility with
existing data and interfaces; divergent
Unicode models on different operating
systems; dealing with relational databases;
dealing with third-party libraries; the
operating system interface; choosing what to
test. The presentation is intended for those
considering globalization improvements to
existing applications. Adding Unicode
functionality can add years to the life of
an application at much less cost than a
complete replacement. |
|
|
Presenter: Michael
B. Toth
Program Manager R.B.Toth Associates
Doug
Emery
Data Manager
Emery IT Top |
Track 3 - From Parchment to the Web: Integrating
| | |