ai人工智能
Recently, I was involved in several “Human Rights by Design” discussions with fellow UX designers and product managers. One of the fascinating topics was about Artificial Intelligence (AI) ethics with a focus on accessibility, universal design and digital inclusion. The conversations came out really informative, engaging and quite thought-provoking, so I decided to probe deeper and share my thoughts and findings on how to establish best practices in AI ethics with regard to digital accessibility and usability.
[R ecently,我参与了一些“人权通过设计”与其他用户体验设计师和产品经理讨论。 引人入胜的主题之一是关于人工智能(AI)道德规范,重点是可访问性,通用设计和数字包容性。 对话的内容非常有益,引人入胜且发人深省,因此我决定更深入地探讨并分享我的想法和发现,以了解如何建立有关数字可访问性和可用性的AI道德最佳实践。
As society has entered into a new chapter, also known as the Fourth Industrial Revolution, AI is viewed to be rapidly changing the accessibility world. Some believe it makes the web more accessible. Some remain sceptical of these AI algorithms and technological evolutions. Whatever your view is, AI products are already changing the way we interact, learn and communicate.
随着社会进入新的篇章,也称为第四次工业革命,人们认为AI正在Swift改变可访问性世界。 有些人认为它使网络更易于访问。 一些人仍然对这些AI算法和技术发展持怀疑态度。 无论您是什么观点,人工智能产品已经在改变我们互动,学习和交流的方式。
In order to explore the ideas of how to create an inclusive digital future, it is vital to understand the technologies under AI’s umbrella and how they are related to accessibility. This becomes the first part of my research.
为了探索如何创建包容性数字未来的想法,至关重要的是要了解AI框架下的技术以及它们与可访问性的关系。 这成为我研究的第一部分。
Part 1: Terminologies and How AI Technologies are related to Accessibility
第1部分:术语以及AI技术与可访问性的关系
The terminologies in AI are interconnected and interdependent. People often confuse these terminologies. Let me provide two examples:
AI中的术语是相互联系和相互依存的。 人们通常会混淆这些术语。 让我提供两个示例:
a. people ask if data visualisation should be taken into consideration when discussing accessibility best practices for AI products.
一个。 人们问讨论AI产品的可访问性最佳做法时是否应考虑数据可视化。
b. people ask if a product is advertised as an AI product but technically it is a machine learning enhancement, do we still need to study it as part of the AI ethical framework?
b。 人们问一个产品是否被宣传为AI产品,但从技术上讲它是机器学习的增强,我们是否仍需要研究它作为AI道德框架的一部分?
The answer is yes to both questions.
这两个问题的答案都是肯定的。
1.1定义 (1.1 Definitions)
Here is a brief explanation of the key terms. I only picked up a few which are essential to this article. For any in-depth information or case studies, please check out the Elsevier AI Resource Center or other relevant scientific research.
这是关键术语的简要说明。 我只挑选了一些对本文至关重要的内容。 有关任何深入的信息或案例研究,请访问Elsevier AI资源中心或其他相关科学研究。
Machine Learning (ML) is a subset of AI that trains a machine on how to learn and what to learn. It can provide decisions and make predictions through trained neural networks.
机器学习 (ML)是AI的子集,可对机器进行学习方法和知识的培训。 它可以通过经过训练的神经网络提供决策并做出预测。
Neural Network is a series of algorithms behind the machine learning outcomes and it is inspired by the human brain.
神经网络是机器学习结果背后的一系列算法,它受到人脑的启发。
Deep Learning is a subset of machine learning, where neural networks are trained with large amounts of data so computers can perform tasks like speech recognition and image auto-tagging.
深度学习是机器学习的一个子集,其中,神经网络使用大量数据进行训练,因此计算机可以执行语音识别和图像自动标记之类的任务。
Natural Language Processing (NLP) is a subset of AI that uses machine learning to derive meanings from human languages so computers can handle tasks like auto-captioning and auto-translating.
自然语言处理 (NLP)是AI的子集,它使用机器学习从人类语言中获取含义,因此计算机可以处理自动字幕和自动翻译等任务。
Data Visualisation is used to display machine learning results. This is the quickest way to visually summarise the information so these trends, patterns and relationships can be easily digested and understood.
数据可视化用于显示机器学习结果。 这是直观地汇总信息的最快方法,因此可以轻松地消化和理解这些趋势,模式和关系。
Artificial Intelligence is a broad science of copying human abilities and a collection of technologies that extract insights and patterns from an astronomical amount of data.
人工智能是一门广泛的复制人类能力的科学,并且是从天文数据量中提取见解和模式的一系列技术。
1.2 AI技术如何使可访问性领域受益 (1.2 How AI Technologies can benefit the Accessibility Field)
Many of today’s most talked about AI-powered technologies have been driving digital inclusion. Some are developed specifically with disabled persons in mind. Some are created to improve digital inclusion in general. This includes but is not limited to:
当今最受关注的AI技术推动了数字包容性的发展。 有些是专门为残疾人而开发的。 总的来说,有些是为了改善数字包容性而创建的。 这包括但不限于:
Features
特征
- Voice control of computers and mobile phones for persons with mobility and physical impairments 行动不便和身体残障人士的计算机和手机的语音控制
- Adding automatic translation and caption content for persons with deafness and hearing loss 为聋哑和听力障碍者添加自动翻译和字幕内容
- Recognising images and adding alt text for persons who need screen reader support 为需要屏幕阅读器支持的人员识别图像并添加替代文本
- Shortening and summarising articles for persons with reading difficulties 为阅读困难的人缩短和总结文章
- Delivering facial recognition to tackle the authentication challenges for persons who find it hard to manage various passwords 提供面部识别以解决难以管理各种密码的人员的身份验证挑战
Tools
工具类
Machine learning can improve automated accessibility testing, help product owners manage their accessibility status better and eventually make the content more accessible. For instance, Deque Systems, Inc. has refined automated accessibility testing by leveraging Machine Learning technology in its axe beta (formerly axe Pro beta).
机器学习可以改善自动化的可访问性测试,帮助产品所有者更好地管理其可访问性状态,并最终使内容更易访问。 例如,Deque Systems,Inc.通过在斧头beta(以前称为斧头Pro beta)中利用机器学习技术改进了自动辅助功能测试。
What’s more, AI can establish a solid foundation for future accessibility improvement. As an example, Microsoft® launched an AI for Accessibility grant program with the aim to encourage developers to create products which can help persons with disabilities using Microsoft’s AI tools.
而且,AI可以为将来的可访问性改进奠定坚实的基础。 例如,Microsoft®启动了AI for Accessibility赠款计划 ,旨在鼓励开发人员使用Microsoft的AI工具开发可以帮助残疾人的产品。
1.3第四次工业革命和包容性态度 (1.3 The 4th Industrial Revolution and Inclusive Attitude)
Apart from these well-known positive uses that come with AI developments, there are boundless possibilities the technologies could bring in the future. The 4th Industrial Revolution contains a number of technologies which can be used in physical, digital and biological worlds, and AI is the key driver of it.
除了AI发展带来的这些众所周知的积极用途外,这些技术还可以为未来带来无限的可能性。 第四次工业革命包含许多可用于物理,数字和生物领域的技术,而AI是其推动力。
As more and more AI technologies get integrated into digital product offerings, organisations will inevitably encounter debates on AI ethics and accessibility. In particular, this will be the focal point in the companies using machine learning in customer-facing programs or having business interests in the public sector. For instance, parents might ask schools how they can ensure that students with disabilities can access AI-powered virtual learning environments.
随着越来越多的AI技术集成到数字产品中,组织将不可避免地遇到有关AI道德和可访问性的辩论。 特别是,这将成为在面向客户的程序中使用机器学习或在公共部门具有商业利益的公司的重点。 例如,父母可能会问学校如何确保残障学生可以访问基于AI的虚拟学习环境。
Looking back at history, the Industrial Revolutions have brought many benefits to humanity over time, but of course, it wasn’t all smooth sailing. The transitions also brought about a great deal of disruptions — the key to getting the best out of tech is to handle it with an inclusive attitude, caution and consideration for those affected. This leads to my second part of research — what’s the current status of AI ethics discussions and what design standards are most suited to ensure AI ethics in the accessibility area.
回顾历史,随着时间的流逝,工业革命给人类带来了许多好处,但是当然,并非一帆风顺。 过渡还带来了很多破坏-充分利用技术的关键是以包容的态度,谨慎和对受影响者的考虑来处理技术。 这引出了我的第二部分研究-AI道德规范讨论的当前状态以及最适合确保可访问性领域AI道德规范的设计标准。
Part II: AI Ethics and Human Rights by Design
第二部分:设计中的AI伦理与人权
The way in which I understand AI ethics is that they are part of the moral behaviour around the creation and use of emerging technologies. It means people first, society first and the product owners being accountable for the AI products they release. By all means, AI ethics are not designed to slow down the rate of innovation but instead, it is brought up to make humans more responsible for the development of AI products and transform our lives for the better.
我理解AI伦理的方式是,它们是围绕创造和使用新兴技术的道德行为的一部分。 这意味着以人为本,社会至上,产品所有者应对他们发布的AI产品负责。 不管怎样,人工智能伦理并不是为了减慢创新速度而设计的,而是要使人类对人工智能产品的开发负更多责任,并使我们的生活变得更好。
I have been working in the digital content field for over 10 years and have always been passionate about content accessibility and usability. My view on AI-powered digital content is very simple: if the content can be accessed by persons without disabilities, then it should be accessible to persons with disabilities.
我在数字内容领域工作了10多年,一直对内容的可访问性和可用性充满热情。 我对以AI技术为动力的数字内容的看法非常简单:如果该内容可供残障人士访问,那么残障人士应该可以访问该内容。
2.1关于AI和可访问性的道德考量的讨论 (2.1 Discussion of Ethical Considerations on AI and Accessibility)
There is a range of views about AI ethics and accessibility. Also, there are a large number of principles and high-level statements and position papers out there. Almost all technology giants openly share their thoughts and how they plan to ensure that they are conscious of making smart and ethical decisions. To cite an example, here is the general outlook of Microsoft in relation to how AI benefits persons with disabilities. They have identified the ethical considerations and have placed emphasis on the utmost importance of educating future generations with an inclusive design approach.
关于AI伦理和可访问性存在多种观点。 而且,那里有大量的原则,高级声明和立场文件。 几乎所有技术巨头都公开分享他们的想法以及他们如何计划以确保自己意识到做出明智和符合道德的决定。 举个例子,这是微软关于人工智能如何使残疾人受益的总体看法。 他们确定了道德考虑,并着重于 用包容性的设计方法来教育子孙后代至关重要。
“A AI and Accessibility: A Discussion of Ethical Considerations Microsoft
“ AI和可访问性:道德考量的讨论Microsoft
I support Microsoft’s perspective about design thinking— inclusive design is really the key to ensure AI technologies are making the digital world more accessible. Human rights can be advanced through the design process, and the concept of “Design for all”, or “Inclusive Design” or “Universal Design” or “Human Rights by Design” encourages accessibility for all people to the greatest extent possible tailored to human diversity and inclusion.
我支持Microsoft关于设计思想的观点-包容性设计确实是确保AI技术使数字世界更易于访问的关键。 人权可以在设计过程中推进,并体现了“ 设计为所有 ”或“ 包容性设计 ”或“ 通用设计 ”或“ 人权由设计 ”鼓励无障碍所有的人能够最大程度上适合人类多元化和包容性。
Having a good understanding of these design principles can help everyone listen, learn and adjust their individual lenses. This in turn will bring a holistic approach to diversity into the product design and development process. It also helps all stakeholders follow the same principles while working with each other.
充分了解这些设计原理可以帮助每个人聆听,学习和调整各自的镜片。 反过来,这将在产品设计和开发过程中采用整体的多样性方法。 它还可以帮助所有利益相关者在彼此合作时遵循相同的原则。
It is no secret that many organisations are currently developing machine learning algorithms to improve personalised user experiences. This aligns with the ultimate goal of UX designers whose role is to be the voice of the users, whose mission is to test and incorporate product feedback, including persons with disabilities to ensure optimal user experience for all.
众所周知,许多组织正在开发机器学习算法以改善个性化用户体验。 这与UX设计师的最终目标一致,他们的角色是成为用户的声音,其任务是测试和合并包括残疾人在内的产品反馈,以确保所有人的最佳用户体验。
2.2关于辅助功能和通用设计的AI道德准则 (2.2 AI Ethics Guidelines on Accessibility and Universal Design)
I believe that accessibility in AI is part of user experience, and all UX designers should be trained and instructed to ensure that the products and services are designed and developed to be accessible to all users.
我相信AI的可访问性是用户体验的一部分,所有UX设计师都应接受培训和指导,以确保设计和开发产品和服务以使所有用户都可以访问。
However, encouraging design for human diversity also requires frequent reviews and input from many sources. It would seem like an unattainable approach for a UX designer to tackle all accessibility-related issues in the AI field without the spirit of collaboration and cooperation with all internal and external stakeholders.
但是,鼓励设计以促进人类多样性还需要经常的回顾和许多方面的投入。 对于UX设计人员来说,如果没有与所有内部和外部利益相关者的协作与合作精神,解决AI领域中所有与可访问性相关的问题,这似乎是一种无法实现的方法。
For this reason, in my view, it is critical to understand AI ethics guidelines on accessibility, because accessibility in AI is fundamentally an ethical responsibility shared by government bodies, business owners and citizens. There is a need to have a shared framework for everyone to follow.
出于这个原因,我认为至关重要的是理解有关可访问性的AI道德准则,因为AI的可访问性从根本上说是政府机构,企业主和公民共同承担的道德责任。 有必要为每个人遵循一个共享的框架。
It is worth pointing out that governments can be served as a channel for “Human Rights by Design”, and they can help define and measure what’s acceptable.
值得指出的是,政府可以作为“设计人权”的渠道,它们可以帮助定义和衡量可接受的条件。
Many governments are putting forward guidelines on trustworthy AI and emphasise that AI products must be lawful (respecting all applicable laws and regulations), ethical (respecting ethical principles) and robust (both from a technical perspective while taking into account its social environment). To cite an example, here is the guideline from the EU.
许多政府都在提出有关可信赖的AI的准则,并强调AI产品必须合法(尊重所有适用的法律和法规),道德(尊重道德原则)和健壮(从技术角度考虑其社会环境)。 举一个例子,这是欧盟的指导方针。
“Accessibility and universal design. Particularly in business-to-consumer domains, systems should be user-centric and designed in a way that allows all people to use AI products or services, regardless of their age, gender, abilities or characteristics. Accessibility to this technology for persons with disabilities, which are present in all societal groups, is of particular importance. AI systems should not have a one-size-fits-all approach and should consider Universal Design principles addressing the widest possible range of users, following relevant accessibility standards. This will enable equitable access and active participation of all people in existing and emerging computer-mediated human activities and with regard to assistive technologies.” EU Ethics Guidelines for Trustworthy AI
“辅助功能和通用设计。 特别是在企业对消费者领域中,系统应以用户为中心,并以允许所有人使用AI产品或服务的方式设计,而不论其年龄,性别,能力或特征如何。 在所有社会群体中存在的残疾人无障碍使用这项技术尤为重要。 人工智能系统不应采用千篇一律的方法,而应遵循相关的可访问性标准,考虑适用于最大范围用户的通用设计原则。 这将使所有人能够平等地获得和积极参与现有的和新兴的计算机介导的人类活动以及辅助技术。” 欧盟关于可信AI的道德准则
It is important to note that people should be encouraged to share their concerns and provide feedback about AI ethics to government bodies. For companies who release AI products, many of them have internal high-level guidelines and some design principles to follow but implementing them in concrete and tangible ways will take time. Collaboration between governments, companies and citizens is needed to shape our world for the public good.
重要的是要注意,应该鼓励人们分享他们的担忧,并向政府机构提供有关AI伦理的反馈。 对于发布AI产品的公司,许多公司都有内部高级指南和一些设计原则,但是要以具体和切实的方式实施它们将需要时间。 政府,公司和公民之间的合作需要为公共利益塑造我们的世界。
2.3通用设计,包容性设计,无障碍设计—全能设计 (2.3 Universal Design, Inclusive Design, Accessible Design — Design for ALL)
To pursue opportunities while mitigating the risks, I believe that, same as other digital products, accessibility best practices and the principles of human-centred design must be followed and integrated as part of the AI development process. For us designers, designing a product is not really about expressing ourselves or following our product owners’ direction, design means that we can leverage best practices and envision our products in the context of their users and their environments.
为了在减少风险的同时寻求机会,我认为,与其他数字产品一样,可访问性最佳实践和以人为本的设计原则必须作为AI开发过程的一部分加以遵循和整合。 对于我们的设计师而言,设计产品并不是真正地表达自己或遵循产品所有者的方向,设计意味着我们可以利用最佳实践并根据其用户和环境来设想我们的产品。
Under the concept of “Human Rights by Design”, Universal Design principles are the most important ones as highlighted in the EU guidelines. Accessibility and usability in the sense of Universal Design refer to the design of inclusive environments, including everything people need to access, such as products, devices, services, digital content, and physical space. In other words, the principles can be used in both a physical world and a virtual world.
在“设计中的人权”概念下,通用设计原则是欧盟准则中强调的最重要原则。 通用设计意义上的可访问性和可用性是指包容性环境的设计,包括人们需要访问的所有内容,例如产品,设备,服务,数字内容和物理空间。 换句话说,原理可以在物理世界和虚拟世界中使用。
To give an example, when it comes to real-world applications, AI-equipped technology can be used in the customer-facing retail environment and provide different in-store experiences. For instance,
例如,在实际应用中,配备AI的技术可用于面向客户的零售环境,并提供不同的店内体验。 例如,
- High-end fashion retailers could install high-tech smart mirror technology, which combines voice and facial recognition when communicating with customers. As a designer, universal design means that the smart mirror needs to be installed properly for all users (for example, wheelchair access) and the technologies used also need to take all user types into consideration (for example, a customer who is deaf would not be able to use the voice control feature). 高端时尚零售商可以安装高科技智能镜技术,该技术在与客户沟通时将语音和面部识别功能结合在一起。 作为设计师,通用设计意味着需要为所有用户正确安装智能后视镜(例如,轮椅通道),并且所使用的技术还需要考虑所有用户类型(例如,聋哑人不会能够使用语音控制功能)。
- Supermarkets could have autonomous robots guide customers around the aisles, providing essential inventory information and answering questions. As a designer, universal design means that the robot should be equipped with the ability to communicate with people in various ways. It needs to be built with a reasonable size so it can navigate around the store. 超市可以有自动机器人引导过道的顾客,提供必要的库存信息并回答问题。 作为设计师,通用设计意味着机器人应该具备以各种方式与人交流的能力。 它需要以合理的大小构建,以便可以在商店中导航。
In relation to the digital world, universal design, inclusive design and accessible design are used interchangeably. They all focus on increasing the accessibility of interactive systems (websites, browsers, tools, and many other digital products) and share a similar design thinking process. But among them, universal design is the term preferred by lawmakers.
关于数字世界,通用设计,包含设计和可访问设计可互换使用。 他们都致力于提高交互式系统(网站,浏览器,工具和许多其他数字产品)的可访问性,并且共享相似的设计思维过程。 但其中,通用设计是立法者偏爱的术语。
The following 7 Principles of Universal Design are conceived by the Center for Universal Design at North Carolina State University, USA, led by the late Ronald Mace.
由已故的罗纳德·梅斯(Ronald Mace)领导的美国北卡罗来纳州立大学通用设计中心构思了以下7条通用设计原理 。
Equitable Use — The design is useful and marketable to people with diverse abilities.
公平使用 -该设计对于具有各种能力的人有用且可销售。
Flexibility in Use — The design accommodates a wide range of individual preferences and abilities.
使用的灵活性 —设计可适应各种个人喜好和能力。
Simple and Intuitive Use — Use of the design is easy to understand, regardless of the user’s experience, knowledge, language skills, or current concentration level.
简单直观的使用 -无论用户的经验,知识,语言技能或当前的专注程度如何,设计的使用都易于理解。
Perceptible Information — Use of the design is easy to understand, regardless of the user’s experience, knowledge, language skills, or current concentration level.
可感知的信息 -无论用户的经验,知识,语言技能或当前关注水平如何,设计的使用都易于理解。
Tolerance for Error — The design minimizes hazards and the adverse consequences of accidental or unintended actions.
误差容忍 —该设计将意外操作或意外操作的危害和不利影响降至最低。
Low Physical Effort — The design can be used efficiently and comfortably and with a minimum of fatigue.
体力劳动强度低 —该设计可以高效,舒适地使用,并且疲劳程度最小。
Size and Space for Approach and Use — Appropriate size and space is provided for approach, reach, manipulation, and use regardless of user’s body size, posture, or mobility.
方法和用途的大小和空间 —为接近,触及,操纵和使用提供了适当的大小和空间,而与用户的身体大小,姿势或活动性无关。
Keep in mind, if we review Universal Design at the micro-level, designers are not urged to find one design solution for all. Rather, designers are expected to explore solutions which are more inclusive. For example, provide alternative access for a drag and drop exercise feature online when it is impossible for a blind user to enjoy it. Once the most extreme accessibility issues like this are resolved, designers should be able to step back from individual features, move to a macro-level perspective, and look at the product as a whole.
请记住,如果我们从微观层面审查通用设计,则不建议设计师为所有人找到一种设计解决方案。 相反,希望设计师探索更具包容性的解决方案。 例如,当盲人用户无法享受拖放运动功能时,请为其提供在线替代运动功能。 一旦解决了此类最极端的可访问性问题,设计人员就应该能够从单个功能退后,转向宏观层次,并从整体上看待产品。
2.4 AI伦理与无障碍的未来 (2.4 AI Ethics and Accessible Future)
There are undoubtedly ethics-related questions about AI, such as job changes, bias and privacy. While evaluating these potential adverse impacts, we should always be open-minded about the fact that AI technologies have unlimited possible future uses to advance accessibility. Their responsible implementation is in our hands.
毫无疑问,有关人工智能的道德问题,例如工作变动,偏见和隐私。 在评估这些潜在的不利影响时,我们应该始终保持开放的态度,因为AI技术在未来可能会无限利用未来来提高可访问性。 他们负责任的执行掌握在我们手中。
My view on the subject is that innovative technologies can provide new ways to deliver products and services in the future. They can also improve and increase accessibility for people who want equal access. With future 5G connectivity, more and more AI-powered services will become faster, cheaper and more reliable.
我对此主题的看法是,创新技术可以提供未来提供产品和服务的新方式。 它们还可以改善和增加想要平等访问权限的人们的可访问性。 借助未来的5G连接,越来越多的AI驱动的服务将变得更快,更便宜,更可靠。
This leads to my third part of research — what sort of AI features or tools are worth further developments, how can we leverage these technologies to benefit persons with disabilities, and how can we ensure AI products are perceivable, operable, understandable and robust for all users.
这引出了我的研究的第三部分-什么样的AI功能或工具值得进一步开发,我们如何利用这些技术使残疾人受益,以及如何确保AI产品对所有人而言都是可感知,可操作,可理解和强大的用户。
Part III: How to ensure AI-powered Digital Products are accessible to all users
第三部分:如何确保所有用户都可以使用人工智能技术的数字产品
To ensure AI-powered products and its content are accessible to everyone, be it data analytics, a chatbot or an AI-based marketing tool, the first place I would reference is the Web Content Accessibility Guidelines (WCAG) 2.1 international standard. WCAG may not be 100% perfect and include every single disability or cover every single new technology, but they are the most comprehensive and most broadly adopted accessibility standard internationally.
为了确保所有人都能使用AI技术的产品及其内容,无论是数据分析,聊天机器人还是基于AI的营销工具,我首先要提到的是Web Content Accessibility Guidelines( WCAG )2.1国际标准。 WCAG可能不是100%完美,并包括每一个残疾或涵盖每一项新技术,但是它们是国际上最全面,最广泛采用的无障碍标准。
The best practices on the W3C website are a great resource for everyone. They overlap with many best practices on web development such as mobile web design, device independence, multi-modal interaction, usability and design for older users. They also help companies polish their brand image and expand their customer base. I expect that WCAG will cover additional AI products gradually in their future updates. In fact, WCAG 2.2 is coming soon.
W3C网站上的最佳实践是每个人的宝贵资源。 它们与许多Web开发最佳实践重叠,例如移动Web设计,设备独立性,多模式交互,可用性和面向老用户的设计。 他们还帮助公司树立品牌形象并扩大客户群。 我希望WCAG在未来的更新中将逐步涵盖其他AI产品。 实际上, WCAG 2.2即将推出。
Secondly, I would like to share my views on some AI products with regard to best practices and guidelines on universal accessibility. This is not an exhaustive list, but it contains some useful and practical information on the subject. I hope that it can give you some food for thought and support you in your decision-making process.
其次,我想就一些通用产品的最佳实践和指南分享我对某些AI产品的看法。 这不是一个详尽的清单,但包含有关此主题的一些有用和实用的信息。 我希望它可以给您一些思考的机会,并在您的决策过程中为您提供支持。
3.1人工智能在自动辅助功能测试和修复中的作用 (3.1 The Role of AI in Automated Accessibility Testing and Remediation)
There is an unquestionable need to integrate accessibility into the development process. While working on the product, one of the most frequently asked questions from the stakeholders is about automation and process improvement. It has been nearly everyone’s dream to be able to auto test and auto remediate all accessibility defects.
毫无疑问,需要将可访问性集成到开发过程中。 在产品上工作时,利益相关者最常问的问题之一是关于自动化和过程改进。 能够自动测试和自动修复所有可访问性缺陷几乎是每个人的梦想。
Can we use AI to unlock full accessibility compliance? Let’s discuss.
我们可以使用AI解锁完全可访问性合规性吗? 让我们讨论。
3.1.1机器学习增强的辅助功能扫描器 (3.1.1 Machine Learning enhanced Accessibility Scanners)
“Easy, fast, cheap and it can detect problems early on”, this is the general comment people have given to automated accessibility testing. Accessibility scanners have been around for a while. Recently, machine learning has drastically improved these tools’ features and made them much more user friendly, efficient and accurate. The technology can scan through a large number of web pages, documents and identify many defects without manual testing.
“简便,快速,廉价,它可以及早发现问题”,这是人们对自动辅助功能测试的普遍评价。 辅助功能扫描仪已经存在了一段时间。 最近, 机器学习已大大改善了这些工具的功能,并使它们更加用户友好,高效且准确。 该技术可以扫描大量网页,文档并识别许多缺陷,而无需手动测试。
It is worth mentioning that automated testing means using an automated tool to execute test cases. It is a technique used to compare the actual outcome with the expected results. Manual testing is performed by a human who sits in front of a machine to carefully executing the test step. To conduct a comprehensive and rigorous accessibility audit, the most common approach is to use both methods combined.
值得一提的是,自动化测试意味着使用自动化工具来执行测试用例。 它是一种用于将实际结果与预期结果进行比较的技术。 手动测试由坐在机器前的人员执行,以仔细执行测试步骤。 为了进行全面而严格的可访问性审核,最常用的方法是将两种方法结合使用。
The browser extension is the simplest version of automated testing. All we need to do is to install the add-on, and then click a button. Once the scan is completed, each tool will generate a report that flags accessibility issues based on WCAG 2.1 standards. The report will explain why the issues have to be resolved and provide guidance on how to fix them. The most well-recognised tools in the accessibility testing area are Microsoft Accessibility Insight, Google Lighthouse, Deque axe, and WebAim WAVE.
浏览器扩展是自动化测试的最简单版本。 我们需要做的就是安装加载项,然后单击一个按钮。 扫描完成后,每个工具都会生成一个报告,该报告根据WCAG 2.1标准标记可访问性问题。 该报告将解释为什么必须解决这些问题,并提供如何解决这些问题的指导。 辅助功能测试领域中最知名的工具是Microsoft Accessibility Insight, Google Lighthouse, Deque axe和WebAim WAVE。
The second type of tool uses APIs from the command-line, which requires customisation to organisations’ development environment. The benefit of implementing an API is that it can cover as many projects as the company needs and it can be integrated with other QA tools. For example, Selenium, Cypress, WebdriverIO, Protractor, Tenon and Appium. Organisations can also create their own bespoke automated tools by leveraging APIs or software components. Here is good documentation regarding Axe API.
第二种工具使用命令行中的API,这需要针对组织的开发环境进行自定义。 实施API的好处在于,它可以覆盖公司所需的多个项目,并且可以与其他QA工具集成。 例如,Selenium,Cypress,WebdriverIO,Protractor,Tenon和Appium。 企业还可以利用API或软件组件来创建自己的定制自动化工具。 这是有关Ax API的优质文档。
The third type of tool is designed for producing accessible documents. It can be part of the authoring tools, like Microsoft Accessibility Checker or Adobe Accessibility Checker. It automatically checks and remediates issues before users publishing or sending out their files. The tool is very easy and simple to use, just click the Checker from the Tools Panel and follow the instructions.
第三类工具旨在生成可访问的文档。 它可以是创作工具的一部分,例如Microsoft Accessibility Checker或Adobe Accessibility Checker 。 在用户发布或发送其文件之前,它会自动检查并修复问题。 该工具非常易于使用,只需单击“工具面板”中的“检查器”,然后按照说明进行操作即可。
Another useful tool for creating accessible files is Adobe InDesign, it supports accessible cross-media publications. For instance, to create an end of year report with numerous layouts and images, you can use InDesign accessibility features to ensure all elements are tagged before exporting the file to PDF, HTML or ePub3 (eBooks). The tool is not hard to use, but it requires some learning and practice.
创建可访问文件的另一个有用工具是Adobe InDesign,它支持可访问的跨媒体出版物。 例如,要创建包含大量布局和图像的年终报告,可以使用InDesign辅助功能来确保在将文件导出为PDF,HTML或ePub3(电子书)之前标记所有元素。 该工具不难使用,但需要一些学习和实践。
Here is a quick demo
这是一个快速演示
Step 1 Open a blank page with an image in Indesign. Add Alt Text using Object Export Options panel.
步骤1在Indesign中打开包含图像的空白页。 使用“对象导出选项”面板添加替代文本。
Step 2: Export it as Epub format, then inspect the element and see if it is successfully tagged.
步骤2:将其导出为Epub格式,然后检查元素并查看是否已成功标记。
In addition to manual remediation, you can ask a third party to help if there is a significant number of documents. For instance, Sensusaccess offers a machine-learning-enhanced auto-conversion tool for documents. Upload files of your choice, the tool will convert them into formats like accessible MP3 audio, DAISY full text and audio, Braille, and E-book. The company works closely with various universities.
除了手动修复之外,如果有大量文档,您也可以要求第三方提供帮助。 例如, Sensusaccess提供了一种用于文档的机器学习增强的自动转换工具。 上传您选择的文件,该工具会将它们转换为可访问的MP3音频,DAISY全文和音频,盲文和电子书等格式。 该公司与各大学紧密合作。
3.1.2人工智能驱动的自动辅助功能补救工具 (3.1.2 AI-powered Automated Accessibility Remediation Tools)
This type of technology is about creating custom JavaScript overlays or Tool-based overlays without having to touch the underlying source code of the website. The most discussed products in the markets are Amaze, AccessiBe, Mk-sense and Equalweb. The technology can add missing alternative text for an image, add missing HTML attributes and fix issues like icons and buttons, roles and landmarks. It automatically remediates issues on a website to comply with WCAG success criteria.
这种类型的技术是关于创建自定义JavaScript叠加层或基于工具的叠加层,而无需接触网站的基础源代码。 市场上讨论最多的产品是Amaze,AccessiBe,Mk-sense和Equalweb。 该技术可以为图像添加丢失的替代文本,添加丢失HTML属性,并修复图标和按钮,角色和地标等问题。 它会自动修复网站上的问题,以符合WCAG成功标准。
These are great tools for product owners and developers to understand the impact of code changes to accessibility and how the products can be improved by enhancing accessibility and empowering all users. In addition to that, the technology provides a short-term solution to organisations which are under pressure to fix accessibility issues overnight but have no development teams immediately available.
这些是使产品所有者和开发人员了解代码更改对可访问性的影响以及如何通过增强可访问性和赋予所有用户权限来改进产品的好工具。 除此之外,该技术为面临一夜之间解决可访问性问题的压力的组织提供了一种短期解决方案,但是这些组织没有立即可用的开发团队。
The reason why I believe at this time the solution is for the short term is that it won’t remediate anything which is complex. For example, WCAG requirement — Guideline 2.2 Enough Time: Provide users enough time to read and use content would still depend on applying code changes on the app itself. It also requires screen reader users to learn a new approach if it is a tool-based overlay. It means that they have to navigate a toolbar or get a plugin while JAWS or NVDA is assisting them to read the instructions.
我目前认为该解决方案是短期的,原因是它不会补救任何复杂的问题。 例如,WCAG要求— 准则2.2足够的时间:为用户提供足够的时间来阅读和使用内容仍取决于在应用程序本身上应用代码更改。 如果它是基于工具的覆盖,它还要求屏幕阅读器用户学习一种新方法。 这意味着当JAWS或NVDA帮助他们阅读说明时,他们必须导航工具栏或获取插件。
3.1.3使用机器学习进行图像自动标记 (3.1.3 Image Auto-tagging using Machine Learning)
This technology is for persons who need screen reader support. It leverages machine learning methods to recognise visual elements within an image, then identifies the patterns and looks into a large database, to make sense of the image, pinpoint its category and come up with a decision on what the image is about. Once the machine auto-tagged the image, a screen reader will be able to read “alt” text to users who are visually impaired.
该技术适用于需要屏幕阅读器支持的人员。 它利用机器学习方法来识别图像中的视觉元素,然后识别模式并查看大型数据库,以理解图像,查明图像的类别并决定图像的含义。 机器自动标记图像后,屏幕阅读器将能够向视力障碍的用户读取“替代”文本。
It is worth noting, with regard to businesses that possess a huge visual database, this innovative technology has improved businesses’ overall ability to execute, pivot and scale. For instance, it has helped content owners who have never tagged their images in the past to remediate the issue fast. Also, it has provided users with enhanced search performance, as search is normally based on keywords tagged in the content.
值得注意的是,对于拥有巨大可视数据库的企业而言,这项创新技术提高了企业执行,枢纽和扩展规模的整体能力。 例如,它可以帮助过去从未标记过图像的内容所有者快速修复该问题。 而且,由于搜索通常基于内容中标记的关键字,因此它还为用户提供了增强的搜索性能。
One caveat for this method is that for images like medical graphics, scientific illustrations, legal terminologies, and artworks, the content owners still have to do due diligence and take responsibility for the image descriptions, and they should not solely rely on machines to fix all their “alt” text problems yet. This is due to the fact that tag medical images with incorrect descriptions or under insufficient data integrity control is deemed recklessly risky. For this reason, using this technology wisely, safely and responsibly seems to be the preferred approach among content editors and accessibility experts.
这种方法的一个警告是,对于医学图形,科学插图,法律术语和艺术品等图像,内容所有者仍必须尽职调查并对图像描述负责,他们不应该仅仅依靠机器来修复所有内容。他们的“替代”文字问题呢。 这是由于以下事实:标记医学图像的描述不正确或在数据完整性控制不充分的情况下被认为存在鲁ck风险。 因此,在内容编辑器和可访问性专家中,明智,安全和负责地使用此技术似乎是首选方法。
With the data accuracy in mind, while acknowledging the discrepancy between a human and a machine’s ability to appropriately tag may lead to an unintended consequence, the argument of human review cost will never go away. For content owners who have millions of images, this can be prohibitively expensive and impossible to embrace retrospective remediation. Fortunately, with all fairness, both lawmakers and customers understand it, if it is deemed as a disproportionate burden, owners are not forced to retrofit content which is mission impossible to fix, with the condition that alternative access must be provided to users.
考虑到数据的准确性,虽然承认人与机器正确标记的能力之间的差异可能会导致意想不到的后果,但人为审核成本的争论永远不会消失。 对于拥有数百万张图像的内容所有者而言,这可能会非常昂贵,并且无法进行追溯性补救。 幸运的是,出于公平的考虑,立法者和客户都理解这一点,如果认为这是不成比例的负担 ,则所有者不必被迫改造无法修复的内容,而必须向用户提供其他访问权限。
3.1.4自动化和流程改进的主要要点 (3.1.4 Key Takeaways on Automation and Process Improvement)
The idea of utilising software to scan and check a considerable number of web pages or documents is well received by accessibility analysts, testers, and developers. I can see such automated testing tools or platforms being advanced further. One important caveat to remember is that if our intent is to provide a completely accessible product — not only do we need automated tools to discover critical issues, but we also have to have a deeper understanding of the WCAG criteria and think about real use cases.
辅助功能分析人员,测试人员和开发人员非常喜欢使用软件来扫描和检查大量网页或文档的想法。 我可以看到此类自动化测试工具或平台正在进一步改进。 需要记住的一个重要警告是,如果我们的意图是提供一个完全可访问的产品-我们不仅需要自动化工具来发现关键问题,而且我们还必须对WCAG标准有更深入的了解并考虑实际用例。
In my opinion, automated remediation tools are fantastic and the idea of auto-fixing is fascinating. I will definitely keep an eye on the technology and see where it leads to. But at this very moment, I generally recommend using it selectively and developing an additional full remediation program accordingly. I believe that the only way to build a quality website is to have good web development practices. People might be able to use AI to obtain a compliance certificate, but the products still need to be built for real users.
在我看来,自动修复工具非常棒,自动修复的想法也很吸引人。 我一定会密切关注这项技术,并了解它的发展方向。 但是目前,我通常建议选择性地使用它,并相应地开发其他完整的修复程序。 我相信,构建高质量网站的唯一方法是拥有良好的Web开发实践。 人们也许可以使用AI来获得合规性证书,但是仍然需要为实际用户构建产品。
After discussing the pros and cons of image auto-tagging, I think this area shows great potential and I am positive about the future. It is worth pointing out that the current status does not mean auto-tagging will not move forward and further boost the machine learning productivity. In the future, once the technology is advanced and becomes more reliable, the accuracy of the image descriptions will be improved as a result. Imagine that you upload an image, the system will add a nice description for you automatically — “A cat is playing with two kids in the garden — the sky is blue, and the grass is green”.
在讨论了图像自动标记的利弊之后,我认为该领域显示出巨大的潜力,并且我对未来充满信心。 值得指出的是,当前状态并不意味着自动标记不会向前发展并进一步提高机器学习效率。 将来,一旦技术进步并变得更加可靠,结果将提高图像描述的准确性。 假设您上传了一张图片,系统会自动为您添加一个漂亮的描述-“一只猫在花园里和两个孩子一起玩耍-天空是蓝色的,草是绿色的”。
3.2自动化决策AI系统和可访问性 (3.2 Automated Decision-Making AI Systems and Accessibility)
There is no doubt that data analytics is becoming more powerful thanks to AI. Nearly all industries can benefit from AI-powered predictive systems. For example, car manufacturers can leverage AI to predict maintenance needs, doctors can leverage AI to better coordinate care plans, and students can leverage AI to improve their learning.
毫无疑问,借助AI,数据分析功能变得越来越强大。 几乎所有行业都可以从基于AI的预测系统中受益。 例如,汽车制造商可以利用AI来预测维护需求,医生可以利用AI来更好地协调护理计划,而学生可以利用AI来改善学习。
The products these people are using are based on AI algorithms. Some of the systems can come to a decision automatically without any human involvement. These decisions can be based on customer information or data collected previously from other sources.
这些人使用的产品基于AI算法。 某些系统可以自动做出决策,而无需任何人工干预。 这些决策可以基于客户信息或先前从其他来源收集的数据。
While the process of gathering data may differ from product to product, as a UX designer, how can we make sure that the end products are accessible to all users?
虽然产品的收集过程可能因产品而异,但作为UX设计人员,我们如何确保所有用户都可以访问最终产品?
3.2.1 AI驱动的网站构建器 (3.2.1 AI-driven Website Builders)
This type of product is uniquely popular with small businesses or someone who is validating a market opportunity. Business owners are enabled to skip the process of utilising designers and they can simply leverage the system to handle the design and development. With respect to the product qualities, most of them have some accessibility features included, such as enhanced colour palette, keyboard navigation, screen reader support and mobile responsive. At this point, the most discussed AI-driven website builders are Wix ADI, Bookmark, Jimdo Dolphin and Firedrop.
这种类型的产品在小型企业或正在验证市场机会的人中特别受欢迎。 企业所有者可以跳过使用设计师的过程,他们可以简单地利用系统来处理设计和开发。 关于产品质量,其中大多数具有一些可访问性功能,例如增强的调色板,键盘导航,屏幕阅读器支持和移动响应。 此时,讨论最多的AI驱动网站构建器是Wix ADI,Bookmark,Jimdo Dolphin和Firedrop。
How does an AI web builder work? It initially asks a user some questions and what the person wants to achieve. Based on the user input, the system will use algorithms to check its database, including layouts, styles, content, navigation options, and colour palette etc. Then based on the user preference, it will create a unique website for the user with options to customise.
AI网站构建器如何工作? 最初它向用户询问一些问题以及该人想要实现的目标。 根据用户输入,系统将使用算法检查其数据库,包括布局,样式,内容,导航选项和调色板等。然后,根据用户偏好,它将为用户创建一个唯一的网站,并提供以下选项:定制。
To make the website as accessible as possible, the user or the owner still need to follow the best practices while adding content and other digital assets. For example, adding a meaningful image description for a photo, entering a description in the alternate text field for any equation elements, editing table properties and entering caption text, and make sure the photos being uploaded have enough contrast.
为了使网站尽可能易于访问,用户或所有者在添加内容和其他数字资产时仍需要遵循最佳做法。 例如,为照片添加有意义的图像描述,在替代文本字段中为任何方程式元素输入描述,编辑表格属性并输入标题文本,并确保要上传的照片具有足够的对比度。
On top of that, prior to setting up the site, the product owner ought to conduct some research work and select the best tool for accessibility purposes. There isn’t a specific standard for AI-powered website builders, but the owner could simply reap the benefits of Authoring Tool Accessibility Guidelines (ATAG) to help with the evaluation process. The ATAG 2.0 provides guidelines for designing web content authoring tools that are both accessible to authors with disabilities as well as promoting the production of more accessible web content by all authors.
最重要的是,在设置站点之前,产品所有者应进行一些研究工作,并为可访问性目的选择最佳工具。 对于基于AI的网站构建者,没有特定的标准,但是所有者可以简单地从“ 创作工具可访问性指南” (ATAG)中受益,以帮助评估过程。 ATAG 2.0提供了用于设计Web内容创作工具的指南,该工具既可供残疾作者访问,又可促进所有作者制作更易访问的Web内容。
3.2.2基于 AI的数据分析和数据可视化 (3.2.2 AI-powered Data Analytics and Data Visualisation)
As I mentioned earlier, data visualisation is the best way to visually summarise the information so these trends, patterns and relationships can be easily digested and understood. However, one of the main accessibility challenges in demonstrating AI-powered data analysis outcomes or the process of how the decision-making system works is also data visualisation, namely bars, charts, diagrams, maps and interactive infographics.
正如我前面提到的,数据可视化是可视化汇总信息的最佳方法,因此可以轻松地理解和理解这些趋势,模式和关系。 但是,在演示AI驱动的数据分析结果或决策系统如何工作的过程中,主要的可访问性挑战之一也是数据可视化,即条形图,图表,图表,地图和交互式信息图。
Universal Design means that everyone can access the same product without the adoption of a specific design. To put it simply, ensuring data visualisation is accessible for everyone goes well beyond considerations for colour contrast issues and colour blindness. It means that there is a need to ensure even the screen reader users can read the graphics.
通用设计意味着每个人都可以访问相同的产品,而无需采用特定的设计。 简而言之,确保每个人都可以访问数据可视化,远远超出了颜色对比问题和色盲的考虑范围。 这意味着需要确保甚至屏幕阅读器用户也可以阅读图形。
For a simple image, providing a succinct and informative text description is usually sufficient. But for complex graphics, this is not enough. Products must provide information that the visualisation conveys, such as what values are presented, the categories of data being shown, and the relationships between each category.
对于简单的图像,提供简短而翔实的文字描述通常就足够了。 但是对于复杂的图形,这还不够。 Products must provide information that the visualisation conveys, such as what values are presented, the categories of data being shown, and the relationships between each category.
Making data visualisation readable by screen reader technology is time consuming even for the most experienced developers, but no one should overlook the process and exclude users who cannot see the graphics. If there aren’t any development resources available or if there is a strict time-limit to release the products, a practical alternative is to invite a third-party supplier to assist.
Making data visualisation readable by screen reader technology is time consuming even for the most experienced developers, but no one should overlook the process and exclude users who cannot see the graphics. If there aren't any development resources available or if there is a strict time-limit to release the products, a practical alternative is to invite a third-party supplier to assist.
An exemplar company who makes accessible graphics is Highcharts. The product is free to use for personal projects, school websites, and non-profit organisations. The company has published some best practices on making accessible charts in their blog, including how to use SVG pictures, add text descriptions, duplicate data from charts to tables, apply shades of the same colour, and implement Highcharts Accessibility API.
An exemplar company who makes accessible graphics is Highcharts . The product is free to use for personal projects, school websites, and non-profit organisations. The company has published some best practices on making accessible charts in their blog , including how to use SVG pictures, add text descriptions, duplicate data from charts to tables, apply shades of the same colour, and implement Highcharts Accessibility API.
Every Highcharts license includes the Accessibility module, which contains many exciting features. After including the exporting and export-data modules, users can key-board tabbing a chart, read it by screen readers, view the chart as a data table, interact with the chart control using voice commands, download an SVG version of the chart then turn it into a tactile graphic using embossing printers.
Every Highcharts license includes the Accessibility module , which contains many exciting features. After including the exporting and export-data modules, users can key-board tabbing a chart, read it by screen readers, view the chart as a data table, interact with the chart control using voice commands, download an SVG version of the chart then turn it into a tactile graphic using embossing printers.
3.2.3 Challenges and Benefits of Decision-Making AI systems (3.2.3 Challenges and Benefits of Decision-Making AI systems)
To ensure decision intelligence related products are accessible to persons with disabilities requires thorough planning and dedicated resources. There must be a meaningful conversation between all stakeholders. An experienced UX designer should be able to take all factors into considerations, bridge the gap and merge the knowledge from all sides to define the best solution.
To ensure decision intelligence related products are accessible to persons with disabilities requires thorough planning and dedicated resources. There must be a meaningful conversation between all stakeholders. An experienced UX designer should be able to take all factors into considerations, bridge the gap and merge the knowledge from all sides to define the best solution.
Apart from implementing the products carefully and adhere to best practices, for awareness purposes, it is worth mentioning that automated decision-making based on AI could potentially discriminate persons with disabilities. Here is some useful information from the European Disability Forum.
Apart from implementing the products carefully and adhere to best practices, for awareness purposes, it is worth mentioning that automated decision-making based on AI could potentially discriminate persons with disabilities. Here is some useful information from the European Disability Forum .
“If an algorithm making a decision on the price of insurance policy discriminates against persons with disabilities, they may end up paying more for insurance or be denied cover. There are similar potential risks of discrimination in a wide range of areas: automated screening for recruitment, financial services and so on. While this may be unintentional, AI and other emerging technologies systems are likely to reinforce already pervasive exclusion of persons with disabilities, encouraging misrepresentation of persons with disabilities or other characteristics such as race, age, gender, sexual orientation, religion and so on.”
“If an algorithm making a decision on the price of insurance policy discriminates against persons with disabilities, they may end up paying more for insurance or be denied cover. There are similar potential risks of discrimination in a wide range of areas: automated screening for recruitment, financial services and so on. While this may be unintentional, AI and other emerging technologies systems are likely to reinforce already pervasive exclusion of persons with disabilities, encouraging misrepresentation of persons with disabilities or other characteristics such as race, age, gender, sexual orientation, religion and so on.”
On the other hands, a debate has two sides, it is important to know that these systems also provide great benefits to all users regardless of their personal circumstances. Here is an interesting perspective on automated decision making from the Information Commissioner’s Office.
On the other hands, a debate has two sides, it is important to know that these systems also provide great benefits to all users regardless of their personal circumstances. Here is an interesting perspective on automated decision making from the Information Commissioner's Office .
“Profiling and automated decision making can be very useful for organisations and also benefit individuals in many sectors, including healthcare, education, financial services and marketing. They can lead to quicker and more consistent decisions, particularly in cases where a very large volume of data needs to be analysed and decisions made very quickly.”
“Profiling and automated decision making can be very useful for organisations and also benefit individuals in many sectors, including healthcare, education, financial services and marketing. They can lead to quicker and more consistent decisions, particularly in cases where a very large volume of data needs to be analysed and decisions made very quickly.”
3.3 Natural Language Processing AI and their impact on Communication (3.3 Natural Language Processing AI and their impact on Communication)
Historically, computers can process and manipulate all sorts of data, nonetheless when it comes to language comprehension and word recognition, it is a very different world. To ask a machine to learn, understand and process real-world languages, machines have to be trained and the training process is complicated. It won’t ask machines to simply record a huge set of vocabulary. Instead, it requires machines to understand the syntax, semantics, pragmatics, discourse and the meaning behind those words (the cognitive aspect of language). It instructs machines to collect unstructured data and leverage algorithms to find patterns.
Historically, computers can process and manipulate all sorts of data, nonetheless when it comes to language comprehension and word recognition, it is a very different world. To ask a machine to learn, understand and process real-world languages, machines have to be trained and the training process is complicated. It won't ask machines to simply record a huge set of vocabulary. Instead, it requires machines to understand the syntax, semantics, pragmatics, discourse and the meaning behind those words (the cognitive aspect of language). It instructs machines to collect unstructured data and leverage algorithms to find patterns.
To understand how it might impact accessibility, two important terminologies are to be aligned: Natural Language Processing (NLP) and Natural Language Understanding (NLU). The latter is vital in order to achieve the success of NPP. It gets the machine to comprehend what a group of text really means and viewed as the first step towards NLP. To classify unstructured data, NLU is developed to identify the intended semantics from the multiple possible semantics and label them accordingly. It is also widely used to perform tasks like syntax analysis of grammatically correct typed sentences.
To understand how it might impact accessibility, two important terminologies are to be aligned: Natural Language Processing (NLP) and Natural Language Understanding (NLU). The latter is vital in order to achieve the success of NPP. It gets the machine to comprehend what a group of text really means and viewed as the first step towards NLP. To classify unstructured data, NLU is developed to identify the intended semantics from the multiple possible semantics and label them accordingly. It is also widely used to perform tasks like syntax analysis of grammatically correct typed sentences.
As I mentioned earlier, NLP is a subset of AI that uses machine learning to derive meanings from human languages so computers can automatically handle tasks which involve natural human languages like speech and text. NLP works closely with speech recognition and text recognition engines. While speech or text recognition is applied to enter information, NLP is used to understand the data and leverage the information to perform tasks. Chatbots, Auto-captioning (voice recognition), Optical Character Recognition (text recognition), Auto-translating (machine translation) and Writing enhancements are the most discussed applications in relation to accessibility.
As I mentioned earlier, NLP is a subset of AI that uses machine learning to derive meanings from human languages so computers can automatically handle tasks which involve natural human languages like speech and text. NLP works closely with speech recognition and text recognition engines. While speech or text recognition is applied to enter information, NLP is used to understand the data and leverage the information to perform tasks. Chatbots, Auto-captioning (voice recognition), Optical Character Recognition (text recognition), Auto-translating (machine translation) and Writing enhancements are the most discussed applications in relation to accessibility.
3.3.1 Chatbots (3.3.1 Chatbots)
A chatbot is an interactive software or conversational agent that communicates with a user through a chat window screen. It imitates human conversation — voice chat, text chat or both. Currently, AI chatbots are revolutionising the customer service industry. They are available 24/7 and have the ability to intelligently perform tasks and solve problems without human intervention. Since more and more customers are using chatbots to receive the support they need, making them accessible is imperative and not optional.
A chatbot is an interactive software or conversational agent that communicates with a user through a chat window screen. It imitates human conversation — voice chat, text chat or both. Currently, AI chatbots are revolutionising the customer service industry. They are available 24/7 and have the ability to intelligently perform tasks and solve problems without human intervention. Since more and more customers are using chatbots to receive the support they need, making them accessible is imperative and not optional.
To ensure that chatbots are accessible to all users and comply with accessibility guidelines, seven key criteria should be taken into considerations: keyboard navigation, skip to main content, chatbot landmarks, browsing with a screen magnifier, orientation, reflow and meet optimal visual design standards (font size, line spacing, word spacing, colour contrast and colour blindness). Chatbots can be accessible if our humans wish to do so. For example, Astute announced that their self-service chatbot is digitally accessible for consumers with disabilities.
To ensure that chatbots are accessible to all users and comply with accessibility guidelines, seven key criteria should be taken into considerations: keyboard navigation, skip to main content, chatbot landmarks, browsing with a screen magnifier, orientation, reflow and meet optimal visual design standards (font size, line spacing, word spacing, colour contrast and colour blindness). Chatbots can be accessible if our humans wish to do so. For example, Astute announced that their self-service chatbot is digitally accessible for consumers with disabilities.
3.3.2 Auto-captioning (3.3.2 Auto-captioning)
Automatic captioning technology has the ability to analyse video and audio content, and then automatically transcribes the audio through natural language processing AI. Features like this can support users who are not native speakers understand the content better and process the information fast. It helps users on a commuter train to watch content privately without headphones. It also assists users who are hard-of-hearing or deaf to communicate better. It basically provides convenience to both persons with and without disabilities.
Automatic captioning technology has the ability to analyse video and audio content, and then automatically transcribes the audio through natural language processing AI. Features like this can support users who are not native speakers understand the content better and process the information fast. It helps users on a commuter train to watch content privately without headphones. It also assists users who are hard-of-hearing or deaf to communicate better. It basically provides convenience to both persons with and without disabilities.
For digital products with video and audio content, this is a feature which everyone wants it and product owners need to implement it. In fact, this is part of WCAG requirements — Guideline 1.2: provide alternatives for time-based media. The good news is that most online tools have the feature built into the system. For example, Microsoft Translator, an AI-powered communication technology uses an advanced form of automatic speech recognition to convert spoken language; YouTube’s voice recognition technology, it automates sound effect captions with AI; Conference solution company, Zoom rolls out AI-powered transcripts and note-taking features; the latest news from Google AI — On-Device Captioning with Live Caption; and Microsoft Team also offers live captions during any meetings.
For digital products with video and audio content, this is a feature which everyone wants it and product owners need to implement it. In fact, this is part of WCAG requirements — Guideline 1.2: provide alternatives for time-based media. The good news is that most online tools have the feature built into the system. For example, Microsoft Translator , an AI-powered communication technology uses an advanced form of automatic speech recognition to convert spoken language; YouTube 's voice recognition technology, it automates sound effect captions with AI; Conference solution company, Zoom rolls out AI-powered transcripts and note-taking features; the latest news from Google AI — On-Device Captioning with Live Caption; and Microsoft Team also offers live captions during any meetings.
To ensure this feature is being utilised, it is important to remember to turn it on.
To ensure this feature is being utilised, it is important to remember to turn it on.
3.3.3 Screen Readers and Optical Character Recognition (OCR) (3.3.3 Screen Readers and Optical Character Recognition (OCR))
WCAG 2.1 guidelines do not only apply to web content, but they also cover PDF documents. PDF content can contain essential information. Hence, it needs to be accessible to persons who rely on assistive technology. Apart from using Adobe Acrobat and other remediation tools to tag each element and make the files accessible, another interesting technology to read content is OCR. The technology is related to text recognition, AI and computer vision. It converts a scanned document or a photo of a document into machine-encoded text.
WCAG 2.1 guidelines do not only apply to web content, but they also cover PDF documents. PDF content can contain essential information. Hence, it needs to be accessible to persons who rely on assistive technology. Apart from using Adobe Acrobat and other remediation tools to tag each element and make the files accessible, another interesting technology to read content is OCR. The technology is related to text recognition, AI and computer vision. It converts a scanned document or a photo of a document into machine-encoded text.
Optical Character Recognition feature was first introduced in screen reader Job Access with Speech (JAWS) 13 in 2011. It allows the screen reader to access any images on the screen that include text and recognise all of the text in a PDF document. The technology struggled to recognise the text initially since the quality of the document was not optimal. However, JAWS 2018 has drastically improved this functionality, the scanners have been improved to be able to read a variety of styles and sizes of text. As a note to best practice, technicians should master this technology and make good use of it when it comes to serving all customers with improved digital experience.
Optical Character Recognition feature was first introduced in screen reader Job Access with Speech (JAWS) 13 in 2011. It allows the screen reader to access any images on the screen that include text and recognise all of the text in a PDF document. The technology struggled to recognise the text initially since the quality of the document was not optimal. However, JAWS 2018 has drastically improved this functionality, the scanners have been improved to be able to read a variety of styles and sizes of text. As a note to best practice, technicians should master this technology and make good use of it when it comes to serving all customers with improved digital experience.
Here is the process of how to use OCR with an image.
Here is the process of how to use OCR with an image.
Step 1: Locate the image
Step 1: Locate the image
Step 2: Press the JAWS Key + space, then Press O (for OCR) and F (for File)
Step 2: Press the JAWS Key + space, then Press O (for OCR) and F (for File)
Step 3: JAWS will display the OCR result and read the text back to users
Step 3: JAWS will display the OCR result and read the text back to users
3.3.4 Auto-translating (3.3.4 Auto-translating)
Auto translation has brought a number of great benefits to linguistic accessibility. It encourages the removal of barriers of intercultural communication, facilitates international growth, and increases the potential of the media to promote information in various languages. Human translation is reliable, accurate and high quality, but often it is not accessible to most people due to the high cost and the feasibility to obtain a human resource. Over the last 20 years, machine translation has been gradually accepted by humans as part of the online experience.
Auto translation has brought a number of great benefits to linguistic accessibility. It encourages the removal of barriers of intercultural communication, facilitates international growth, and increases the potential of the media to promote information in various languages. Human translation is reliable, accurate and high quality, but often it is not accessible to most people due to the high cost and the feasibility to obtain a human resource. Over the last 20 years, machine translation has been gradually accepted by humans as part of the online experience.
As our society becomes more and more reliant on AI to advance user experience, machine translation is no exception. Many tech companies are leveraging AI to improve their translation accuracy. The most famous AI-powered neural machine translation system is Google’s. It has collected a large data set and built a system which sufficiently fast and relatively accurate to provide good quality translations for users. Compare to what we had 20 years ago, the current system represents a significant milestone.
As our society becomes more and more reliant on AI to advance user experience, machine translation is no exception. Many tech companies are leveraging AI to improve their translation accuracy. The most famous AI-powered neural machine translation system is Google's . It has collected a large data set and built a system which sufficiently fast and relatively accurate to provide good quality translations for users. Compare to what we had 20 years ago, the current system represents a significant milestone.
Six months ago, Microsoft announced a live presentation feature in PowerPoint with live subtitles in more than 60 languages. I was fortunately able to preview the feature at a conference. During the call, the presenter asked the audience to scan a QR code, which led us to a web link, then on the mobile screen, the subtitles displayed while he was speaking, also there was a language menu available for instant translation.
Six months ago, Microsoft announced a live presentation feature in PowerPoint with live subtitles in more than 60 languages. I was fortunately able to preview the feature at a conference. During the call, the presenter asked the audience to scan a QR code, which led us to a web link, then on the mobile screen, the subtitles displayed while he was speaking, also there was a language menu available for instant translation.
3.3.5 Writing Enhancements (3.3.5 Writing Enhancements)
With respect to NLP, another of my personal favourite is Grammarly, the feature is not subject to WCAG conformance requirements, but it is very useful and helpful to differently-abled people. The system is trained on naturally written text and guides people with and without learning difficulties during their writing process. It can help people improve spellings, punctuations, grammatical structures, and logical orders. It is similar to Microsoft Word and Microsoft Editor’s spelling and grammar checker but with more control and suggestions.
With respect to NLP, another of my personal favourite is Grammarly , the feature is not subject to WCAG conformance requirements, but it is very useful and helpful to differently-abled people. The system is trained on naturally written text and guides people with and without learning difficulties during their writing process. It can help people improve spellings, punctuations, grammatical structures, and logical orders. It is similar to Microsoft Word and Microsoft Editor's spelling and grammar checker but with more control and suggestions.
Grammarly’s AI learns from several databases which contain a large number of sentence collections that have been sorted, grouped and labelled. The machine then decides on what to suggest and what to correct. It draws conclusions from common mistakes, language patterns and certain language constructions. In other words, the tool provides writing enhancements by leveraging both machine learning and natural language understanding technologies. The software works as an extension on Microsoft Edge, Chrome and Firefox, as well as an add-in on Microsoft Word and Outlook.
Grammarly's AI learns from several databases which contain a large number of sentence collections that have been sorted, grouped and labelled. The machine then decides on what to suggest and what to correct. It draws conclusions from common mistakes, language patterns and certain language constructions. In other words, the tool provides writing enhancements by leveraging both machine learning and natural language understanding technologies. The software works as an extension on Microsoft Edge, Chrome and Firefox, as well as an add-in on Microsoft Word and Outlook.
3.3.6 NLP AI, Today and in the Near Future (3.3.6 NLP AI, Today and in the Near Future)
Today, NLP is everywhere. Its algorithms teach a machine to use a language just like how a human does it. Almost any features that involve languages would be based on NLP, namely machine translation, chatbots, conversational search, predictive typing, answering a question from the web and spell-checking. As all applications continue to grow and expand, humans are increasingly feeling comfortable to interact with machines.
Today, NLP is everywhere. Its algorithms teach a machine to use a language just like how a human does it. Almost any features that involve languages would be based on NLP, namely machine translation, chatbots, conversational search, predictive typing, answering a question from the web and spell-checking. As all applications continue to grow and expand, humans are increasingly feeling comfortable to interact with machines.
On the other hand, despite the success achieved in NLP applications, one pivotal caveat to note is that NLP AI is not 100% accurate yet, there are other factors which need to be taken into account. For instance, different accents, senses of humour, regional dialects, and language contractions. Also, one crucial fact that makes AI systems struggle with a deep and genuine conversation is its ability to take context into account.
On the other hand, despite the success achieved in NLP applications, one pivotal caveat to note is that NLP AI is not 100% accurate yet, there are other factors which need to be taken into account. For instance, different accents, senses of humour, regional dialects, and language contractions. Also, one crucial fact that makes AI systems struggle with a deep and genuine conversation is its ability to take context into account.
Talking about the future, I strongly believe that the technologies won’t stop here since we’ve only scratched the surface at this point. The future of NLP is literally open-ended. As technology progresses, NLP will continue to revolutionise communication between humans and machines. This area (speech recognition, natural language understanding, natural language processing and deep learning) will be improved further through iterative and incremental research and experimentations.
Talking about the future, I strongly believe that the technologies won't stop here since we've only scratched the surface at this point. The future of NLP is literally open-ended. As technology progresses, NLP will continue to revolutionise communication between humans and machines. This area (speech recognition, natural language understanding, natural language processing and deep learning) will be improved further through iterative and incremental research and experimentations.
Part IV: Conclusion
Part IV: Conclusion
Broadly speaking, the current consensus is that we do not fully understand how emerging technologies are impacting disability rights yet because the technologies are still under development and it could take years for them to become fully mature.
Broadly speaking, the current consensus is that we do not fully understand how emerging technologies are impacting disability rights yet because the technologies are still under development and it could take years for them to become fully mature.
But here is what I think should be taken into considerations:
But here is what I think should be taken into considerations:
4.1 Positive Impacts of AI on Persons with Disabilities (4.1 Positive Impacts of AI on Persons with Disabilities)
AI technologies and their advancements have a huge potential in making our products more accessible. For persons with disabilities, AI products can increase inclusion, independence and equal access. The features and tools I mentioned in this post are valuable to users with disabilities and make it easier for companies to accomplish their compliance. The technologies might not be perfect, but they are worth further investment and development. And I would like to reiterate that AI ethics are not designed to restrict competitive nature in business or slow down the rate of innovation, it is there to encourage us to make the technologies work better for us.
AI technologies and their advancements have a huge potential in making our products more accessible. For persons with disabilities, AI products can increase inclusion, independence and equal access. The features and tools I mentioned in this post are valuable to users with disabilities and make it easier for companies to accomplish their compliance. The technologies might not be perfect, but they are worth further investment and development. And I would like to reiterate that AI ethics are not designed to restrict competitive nature in business or slow down the rate of innovation, it is there to encourage us to make the technologies work better for us.
4.2 Constraints and Challenges with Disability-focused AI Solutions (4.2 Constraints and Challenges with Disability-focused AI Solutions)
For persons with disabilities, AI-related products have yet to be perfected for four key reasons. The first and foremost is about affordability. Even though many tools are widely available, costs can be a major barrier to obtain such technologies. Secondly, most AI solutions require internet access, some areas simply do not have adequate internet infrastructure. Thirdly, persons with disabilities are often lack of an opportunity to learn what works best for them and to update their digital skills. Lastly, access to tech support can be a challenging task in itself for persons with visual or hearing or motor impairments. Many companies only provide one communication method to contact them.
For persons with disabilities , AI-related products have yet to be perfected for four key reasons. The first and foremost is about affordability. Even though many tools are widely available, costs can be a major barrier to obtain such technologies. Secondly, most AI solutions require internet access, some areas simply do not have adequate internet infrastructure. Thirdly, persons with disabilities are often lack of an opportunity to learn what works best for them and to update their digital skills. Lastly, access to tech support can be a challenging task in itself for persons with visual or hearing or motor impairments. Many companies only provide one communication method to contact them.
4.3 Individual, Corporate, and Regulatory Responsibility for AI (4.3 Individual, Corporate, and Regulatory Responsibility for AI)
To reduce the potential negative impacts of AI, such as people with disabilities couldn’t access AI products or services, working with government bodies is vital. They create ethics frameworks, regulatory oversight and legal safeguards. Those frameworks could help companies understand the social and ethical implications better as well as protecting the public from being affected by unlawful and unethical products. Some might question if over-regulation would potentially stymie growth. The fact is that new AI regulations are coming, companies who do not understand AI ethics would need to come up with creative solutions to reassure the public about how they will run a profitable business without succumbing to the temptation of using AI in an unethical manner. The same applies to individual citizens.
To reduce the potential negative impacts of AI, such as people with disabilities couldn't access AI products or services, working with government bodies is vital. They create ethics frameworks, regulatory oversight and legal safeguards. Those frameworks could help companies understand the social and ethical implications better as well as protecting the public from being affected by unlawful and unethical products. Some might question if over-regulation would potentially stymie growth. The fact is that new AI regulations are coming, companies who do not understand AI ethics would need to come up with creative solutions to reassure the public about how they will run a profitable business without succumbing to the temptation of using AI in an unethical manner. The same applies to individual citizens.
4.4 Accessible Future is a Human Determination (4.4 Accessible Future is a Human Determination)
Keeping up with AI advancements and AI ethics can be overwhelming. Integrating ethics elements into product development circle can feel abstract for some. Having said that, things won’t go terribly wrong if we always adhere to best practices and use a human-centred design approach. After all, making any given piece of content, technology, or user interface accessible is ultimately a human determination. AI can help improve things, but it is humans who can decide whether the interaction is accessible, i.e. compatible with assistive tech, usable, and user-friendly. If we all follow the 7 Universal Design principles, then we know that we are at less risk of making anyone feel excluded from the new emerging technologies.
Keeping up with AI advancements and AI ethics can be overwhelming. Integrating ethics elements into product development circle can feel abstract for some. Having said that, things won't go terribly wrong if we always adhere to best practices and use a human-centred design approach. After all, making any given piece of content, technology, or user interface accessible is ultimately a human determination. AI can help improve things, but it is humans who can decide whether the interaction is accessible, ie compatible with assistive tech, usable, and user-friendly. If we all follow the 7 Universal Design principles, then we know that we are at less risk of making anyone feel excluded from the new emerging technologies.
4.5 Principles of Inclusion, Diversity and Equality (4.5 Principles of Inclusion, Diversity and Equality)
AI products should meet the needs and preferences of differently-abled people. The technologies at the outset should include everyone regardless of their race, age, gender, religion and social background. People with disabilities and without disabilities should be able to enjoy the products “together”. If we can have various forms of diversity to be part of the AI development process, it will empower all stakeholders to release responsible products. The future of AI will be decided by human actions. Together, we can build trust through our engagements and advance technologies for the benefit of mankind.
AI products should meet the needs and preferences of differently-abled people. The technologies at the outset should include everyone regardless of their race, age, gender, religion and social background. People with disabilities and without disabilities should be able to enjoy the products “together”. If we can have various forms of diversity to be part of the AI development process, it will empower all stakeholders to release responsible products. The future of AI will be decided by human actions. Together, we can build trust through our engagements and advance technologies for the benefit of mankind.
免责声明 (Disclaimer)
This article is based on my own research, interest and passion for the topic, and does not necessarily represent LexisNexis’s positions, strategies, or opinions.
This article is based on my own research, interest and passion for the topic, and does not necessarily represent LexisNexis's positions, strategies, or opinions.
致谢 (Acknowledgements)
Many thanks to David Lovell, Ted Gies and Emili Budell-Rhodes for providing excellent feedback, Harris Osiana and David Goco for sharing their experience on automated testing, and Aaron Capua for creating the lovely illustrations.
Many thanks to David Lovell, Ted Gies and Emili Budell-Rhodes for providing excellent feedback, Harris Osiana and David Goco for sharing their experience on automated testing, and Aaron Capua for creating the lovely illustrations.
References and Further Reading (References and Further Reading)
1. 8 ways to ensure your company’s AI is ethical
1. 8 ways to ensure your company's AI is ethical
2. AI & Language Translation: How to Train Your Machine Translation Engine
2. AI & Language Translation: How to Train Your Machine Translation Engine
3. AI website builders — do they live up to the hype?
3. AI website builders — do they live up to the hype?
4. Augmenting Education: Ethical Considerations for Incorporating Artificial Intelligence in Education
4. Augmenting Education: Ethical Considerations for Incorporating Artificial Intelligence in Education
5. Could smart mirrors change the way we shop?
5. Could smart mirrors change the way we shop?
6. Embrace 7 Principles of Universal Design for Better Website Design
6. Embrace 7 Principles of Universal Design for Better Website Design
7. Five Ways in Which Artificial Intelligence Changes the Face of Web Accessibility
7. Five Ways in Which Artificial Intelligence Changes the Face of Web Accessibility
8. How Artificial Intelligence is Rapidly Changing Web Accessibility
8. How Artificial Intelligence is Rapidly Changing Web Accessibility
9. Making the Web More Accessible Using Machine Learning
9. Making the Web More Accessible Using Machine Learning
10. Natural language processing, today and in the near future
10. Natural language processing, today and in the near future
11. Plug and Pray? — a disability perspective on artificial intelligence, automated decision-making and emerging technologies
11. Plug and Pray? — a disability perspective on artificial intelligence, automated decision-making and emerging technologies
12. Smart Mirrors transform retail
12. Smart Mirrors transform retail
13. The Fourth Industrial Revolution, by Klaus Schwab
13. The Fourth Industrial Revolution, by Klaus Schwab
14. The Past Decade and Future of AI’s Impact on Society
14. T he Past Decade and Future of AI's Impact on Society
15. What are some of the challenges we face in NLP today?
15. What are some of the challenges we face in NLP today?
15. What Is The Impact Of Artificial Intelligence (AI) On Society?
15. What Is The Impact Of Artificial Intelligence (AI) On Society?
16. What is Universal Design (definition and overview, the 7 principles, case studies)
16. What is Universal Design (definition and overview, the 7 principles, case studies)
翻译自: https://medium.com/lexisnexis-design/how-ai-can-influence-accessibility-aadd6a398996
ai人工智能