Category: Languages

  • Bringing Javanese and Sundanese into the Digital Voice Ecosystem

    Bringing Javanese and Sundanese into the Digital Voice Ecosystem

    Millions of people speak Javanese and Sundanese across Indonesia, yet these languages remain underrepresented in global voice technologies. As speech-based systems continue to grow, ensuring that diverse languages are included has become an increasingly important challenge.

    To contribute to this effort, SOI Asia recently concluded a three-month collaboration supporting the inclusion of Javanese and Sundanese on Mozilla’s Common Voice platform — an open initiative that collects voice data to help make speech technologies more inclusive and accessible.

    The team of coordinators, fellows and mentors: (from the top left) Achmad Husni Thamrin (SOI Asia),Marcos Sadao Maekawa (APNIC Foundation), Kirana Ajeng Pratiwi Nurdin (ITB), Heidi Schan Andriana (ITB), Abdur Rohman Muhammad (UB), Gilang Ramadhan (UB), Ratno Wahyu Widyanto (UB), Eueung Mulyana(ITB), Achmad ‘Abazh’ Basuki (UB).

    This collaboration was an initiative brought forward by the APNIC Foundation and coordinated through SOI Asia, engaging students and faculty members from partner universities in a shared effort that combined language, technology, and community contribution.

    The main outcomes of the collaboration were the localization of the Common Voice website and the preparation of more than 300 prompts for spontaneous speech recordings, enabling the platform to accept contributions in both Javanese and Sundanese. As a result, both languages were successfully opened for contributions on Common Voice in early February.

    Beyond the technical outcomes, the experience also carried a strong personal dimension. During the wrap-up session, several participants reflected on how the process allowed them to reconnect with their linguistic roots — revisiting everyday expressions, involving family members in discussions, and rediscovering the cultural depth of their own languages.

    Spontaneous Speech interface in Sundanese.

    For SOI Asia, this collaboration also highlights the important role that technology can play in supporting language preservation. By contributing to an open dataset like Common Voice, participants are helping ensure that underrepresented languages are not left behind as speech technologies continue to evolve.

    At the same time, the initiative reflects SOI Asia’s broader approach of creating opportunities for universities and communities to engage with real-world digital ecosystems through collaboration. In this case, the contribution extends beyond technical work, bringing together cultural knowledge, local context, and collective effort.

    While the formal collaboration has concluded, the platform remains open, and continued contributions will be essential to further grow these language datasets. SOI Asia also looks forward to sharing this experience with the wider community in upcoming activities.

  • Short Report: Introduction to Spoken Japanese for SOI Asia Project

    Short Report: Introduction to Spoken Japanese for SOI Asia Project

    “Introduction to Spoken Japanese” course was successfully completed on Jan 16th, 2021. We are also glad to share the short report about the course. please check here : (PDF)

    We plan to offer the course again this academic year. Looking forward to your participation in the near future.
    We deeply appreciate your generous understanding and continued support for this activity.

  • (AI3, SOI Asia Partner Universities’ Students ONLY) Call for: Introduction to Spoken Japanese

    (AI3, SOI Asia Partner Universities’ Students ONLY) Call for: Introduction to Spoken Japanese

    We are pleased to announce to offer a Japanese language course, “Introduction to Spoken Japanese”!

    The interactive language course will be provided on Zoom every Saturday from 2:00 pm to 3:50 pm (JST, GMT+9) from Nov 7th to Dec 26th 2020.

    We are calling for around 50 participants as international students to join this course. (Only for AI3, SOI Asia Partner universities’ students.)
    We’d like to ask each partner university to call for 5 to 10 students.
    The students who complete the course will be awarded by e-certificate.

    Please find the detail in the attached course syllabus. (Click here > PDF)

    Online Registration Form (Deadline: Oct 24th, 2020):
    The link to the registration form is shared with partners only.

    Notes:
    1) The learning data of all the participants will be collected and used only for the investigation and research of Second or Foreign Language Education in Educational Technology.
    The private information of the participants will be protected and never used for other purposes.
    The participants require to read and agree with the student consent form on the online registration form.

    2) This course is irrelevant to the grading system of other courses on SOI Asia. Please decide whether or not to participate in this research based on your voluntary decision.
    Auditing the course without participating in the experiment is also your voluntary decision.
    Also, once you have given your consent, you are free to withdraw your consent at any time, and will not have any disadvantages.

    If you have concerns or queries, please feel free to contact us.
    Contact person: Ms. Yuka Shori Kataoka (shori [at] sfc.wide.ad.jp)
    *Please replace [at] with @ when contacting her.