TC10 Newsletter – IAPR TC10

[IAPR-TC10] Newsletter 160 – March 2026

Posted on 17/03/202623/03/2026 by Christophe Rigaud

We are delighted to present the 160^th edition of the IAPR-TC10 newsletter.

This month, we bring you upcoming events and opportunities to engage with fellow researchers and practitioners. Special focus on the next summer school on document analysis and most relevant ICPR/ICDAR competions and workshops.

Whether you are a long-time member or new to our community, we hope this newsletter inspires and informs your work. Thank you for being part of IAPR-TC10—let’s continue to advance the field together!

Best regards,
Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) Open Call for Organizing DAR Events
3) IAPR TC10/TC11 Summer School on Document Analysis
4) ICDAR 2026 competition selection
5) ICDAR 2026 workshop selection
6) ICPR 2026 workshop selection
7) Job offer (1 new)

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

Join us! If you are not already a member of the TC10 community, please consider joining the TC10 mailing list by clicking the join us button on the main page the website.

1) Upcoming deadlines and events

2026

Deadlines
- 20 March, summer school on document analysis (SSDA) registration deadline
- 3^rd May, MANPU workshop full paper submission deadline
- 16 May, TrustDoc workshop full paper submission deadline
- 20 May, HIP and WoRMS workshop full paper submission deadline
- 22^nd May, DAS workshop full paper submission deadline

Events:
- May 25-29, summer school SSDA 2026, Vall de Núria, Catalonia, Spain
- August 17-22, conference ICPR 2026, Lyon, France
- August 30- Sept. 04, conference ICDAR 2026, Vienna, Austria

2) Open Call for Organizing DAR Events

The IAPR technical committees on graphics recognition (TC10) and reading systems (TC11) are regularly organizing scientific events for the Document Analysis and Recognition (DAR) community, including the ICDAR flagship conference.

In addition to specific calls for bids to host one of the events, we encourage teams to announce their interest in organizing one of the following events:

ICDAR: International Conference on Document Analysis and Recognition (annually; next possibility in 2029)
DAS: International Workshop on Document Analysis Systems (satellite event of ICDAR in even years; next possibility in 2028)
HIP: International Workshop on Historical Document Imaging and Processing (satellite event of ICDAR; next possibility in 2027)
GREC: International Workshop on Graphics Recognition (satellite event of ICDAR in odd years; next possibility in 2027)
SSDA: Summer School on Document Analysis (next possibility in 2027).

Anyone interested in hosting one of these events is invited to announce their interest via email to kc.santosh@usd.edu and andreas.fischer@unifr.ch, in order to receive feedback and support for preparing a proposal.

3) IAPR TC10/TC11 Summer School on Document Analysis

6th summer school on document analysis focused on next-generation Document Understanding, including Retrieval-Augmented Generation (RAG), Vision-Language Models (VLMs), and structured knowledge representation.

The program combines foundational lectures to hands-on practice sessions and solving real-world industry challenges in collaborative teams.

Five days immersed in document analysis in an extraordinary mountain valley.

Location: Vall de Núria, Catalonia (Transport from Barcelona included in registration fee)

Key dates:

Pre-registration deadline: March 20
Acceptance notification: March 28
Summer School: May 25–29, 2026

For more information, and pre-registration: https://ssdag.cvc.uab.es/
Previous editions are listed here: https://iapr-tc10.univ-lr.fr/?page_id=151

4) ICDAR 2026 competition selection

Here are the 2 from 8 competitions that we think will most interest TC10 newsletter readers (full list available here: https://icdar2026.org/index.php/competitions/)

ICDAR2026 Competition on Multimodal Reasoning over Documents in Multiple Domains (DocVQA2026)

Building on the successful DocVQA series of competitions, this new competition aims to test the multi-modal reasoning abilities of state-of-the-art models on 8 different document domains: business reports, scientific papers, slides, scientific posters, maps, comics, infographics, and engineering drawings.

Important Dates:

January 30, 2026: Validation set is published.
March 3, 2026: Test set available on the evaluation server.
April 3, 2026: Competition results deadline.
April 17, 2026: Competition report submission due.
June 22, 2026: Competition winners announced.

For more information and to participate: https://www.docvqa.org/challenges/2026

ICDAR 2026 Competition on Writer and Pen Identification from Hand-Drawn Circles (CircleID)

CircleID is the ICDAR 2026 competition on identifying who drew a circle and which pen was used, using only scanned images of hand-drawn circles. Although a circle is a simple shape, it contains rich, subtle cues from both human motor behavior and pen/ink characteristics. The challenge is to learn representations that disentangle writer style from pen properties in static images.

Participants receive a new dataset of 46155 scanned, circle images collected under controlled conditions from more than 51 writers and 8 pens. Two tasks are evaluated: (1) writer identification (with an “unknown writer” class) and (2) pen classification.

For more information and to participate: https://thomasgorges.github.io/icdar2026-circleid/

5) ICDAR 2026 workshop selection

Here are the 3 from 8 workshop that we think will most interest TC10 newsletter readers (full list available here: https://icdar2026.org/index.php/workshops/)

DAS2026 Document Analysis System

The 2026 edition will take place in Vienna, Austria. DAS 2026 provides a platform for researchers, practitioners, and enthusiasts to contribute to addressing the critical issues in practical document analysis systems. Through DAS 2026, we highlight the transformative power of document analysis technologies and encourage a forward-looking dialogue on their future trajectory in light of the advances of Artificial Intelligence and Large Language Models, ensuring the workshop not only reflects the state of the art, but also inspires the next wave of technological advancements in document analysis systems.

Topics of interest:

Submission Types:

Typically, the workshop covers invited speaker talks along with oral, poster, tutorial, demo sessions and working group discussions. DAS 2026 will accept contributions of the following types:

Full Papers: Full papers should describe complete works of original research in topics relevant to the workshop and should not exceed 17 pages (including figures and references). Accepted submissions will be presented during the workshop (orally or by poster) and published in the Springer Lecture Notes in Computer Science (LNCS) series.

Short Papers: Short papers provide an opportunity to report on research in progress, to present live applied demos and novel positions on document analysis systems. Accepted short papers must not exceed 8 pages (including figures and references) and will appear in an extra booklet (non-archived), not in the official proceedings.

Demos: As usual, DAS 2026 will host a demonstration session where participants can show their technologies and scientific contributions in a hands-on environment. The demos will be aligned with the poster sessions of the workshop, providing adequate time for interaction. In order to participate, interested personnel should submit a short paper not exceeding 6 pages overviewing the detailed technical specifications and practical aspects of the proposed demo.

Important Dates (NO EXTENSIONS, all deadlines are AoE time):

Full paper submission: Friday, 22nd May, 2026
Full paper acceptance notification: Monday, 15th June, 2026
Full papers camera-ready: Monday, 22nd June, 2026
Short papers/demos submission: Monday, 29th June, 2026
Short papers/demos acceptance notification: Monday, 13th July, 2026
Short papers camera-ready: Monday, 20th July, 2026
Workshop Dates: Thursday, 3rd September, 2026 – Friday, 4th September, 2026

For more information and to submission process: https://das2026.seecs.edu.pk

HIP: 8th International Workshop on Historical Document Imaging and Processing

It is our pleasure to announce that the 8th International Workshop on Historical Document Imaging and Processing (HIP’26) will be held in conjunction with ICDAR2026, on 3-4 September 2026 in Vienna, Austria.

The workshop brings together researchers working with historical documents and intends to be complementary and synergistic to the work in analysis and recognition featured in the main sessions of ICDAR, the premier international forum for researchers and practitioners in the document analysis community.

Submissions are received until 20 May 2026 (Time zone: Anywhere on Earth) via CMT and undergo review by the members of the Program Committee.

Submissions must follow the ICDAR guidelines and template provided. It is not required to anonymize the submission, but authors are welcome to do so if they prefer it. Acceptance notifications will be sent out 22 June 2026, with camera ready submissions due on 29 June 2026.

Workshop topics include (but are not limited to):

Imaging and Image Acquisition
Digital Archiving Considerations
Document Restoration/Improving readability
Document Content Acquisition and Information Extraction
Family History Documents and Genealogies
Automated Classification, Grouping and Hyperlinking of Historical Documents
Digital Humanities applications of document analysis and recognition
Artificial Intelligence and Machine Learning for historical documents

For more information and submission process: https://blog.sbb.berlin/hip2026/

WoRMS: 7th International Workshop on Reading Music Systems

The workshop seeks to connect researchers from the field of Optical Music Recognition with potential users. We are open for everything that is related to music, that has been written. The relevant topics of interest for the workshop include, but are not limited to:

Music reading systems
Optical music recognition
Datasets and performance evaluation
Image processing on music scores
Sheet music search
Writer identification
Authoring, editing, storing and presentation systems for music scores
Multi-modal systems
Novel input-methods to produce written music
Web-based Music Information Retrieval services
Applications and projects
Use-cases related to written music

We strive to make the workshop as interactive as possible, with participants getting the opportunity not just to present their work, but to discuss current research and foster relationships within the community. Therefore, promising ideas, work-in-progress submissions and recently submitted or published works are equally welcome.

Important Dates

Deadline for submission: 20 May 2026
Notification of acceptance: 22 June 2026
Camera ready: 30 June 2026
Workshop: 4 September 2026, Vienna

For more information and submission process: https://sites.google.com/view/worms2026/home

6) ICPR 2026 workshop selection

Here are the 2 from 29 (!) workshops that we think will most interest TC10 newsletter readers (full list available here: https://icpr2026.org/workshops.html)

7th International Workshop on coMics ANalysis, Processing and Understanding

Comics is a medium constituted of images combined with text and other visual information in order to narrate a story. Nowadays, comic books are a widespread cultural expression all over the world. The market of comics continues to grow, for example, the market in Japan is about 4.25 billion USD in 2015. Moreover, from the research point of view, comics images are attractive targets because the structure of a comics page includes various elements (such as panels, speech balloons, captions, leading characters, and so on), the drawing of which depends on the style of the author and presents a large variability. Therefore comics image analysis is not a trivial problem and is still immature compared with other kinds of image analysis.

Important dates

Title / abstract submission due: April 26th, 2026
Paper submission due: May 3rd, 2026
Notification of acceptance: June 1st, 2026
Camera-ready paper due: June 14th, 2026
Workshop: August 22nd, 2026

Scope and Topics

The scope of this workshop includes, but is not limited to,

– Comics Image Processing
– Comics Analysis and Understanding
– Comics Recognition
– Comics Retrieval and Spotting
– Comics Enrichment
– Born Digital Comics
– Reading Behavior Analysis of Comics
– Comics Generation
– Copy protection
– Fraud detection
– Physical/Digital Comics Interfaces
– Cognitive Processing and Comprehension of Comics
– Linguistics Analysis of Comics

Contact : i-m-manpu2026-inquiry@ml.omu.ac.jp
More information: https://manpu2026.imlab.jp

TrustDoc Trustworthy Document Understanding: Privacy, Unlearning, Robustness, and Explainability

Pattern recognition is the foundation of modern document image understanding, supporting progress in document classification, handwritten text recognition, DocVQA, and multimodal modeling of text, layout, and visual structure. As these systems are increasingly deployed in real-world, high-stakes environments, they must meet new expectations for trust, safety, and regulatory compliance. Current document image models are required to support machine unlearning, to resist imperceptible document forgeries and membership inference attacks, to preserve the privacy of sensitive handwritten or scanned data, and to offer transparent and interpretable decisions. These demands raise fundamental research questions on how models memorise and forget document-specific features, how handwritten text recognition can remain robust under regime changes, and how multimodal document representations should be evaluated from ethical and trustworthiness perspectives.

The workshop invites contributions that advance this emerging area. It specifically welcomes contributions on topics including (but not limited to):

Machine unlearning in document AI

Robustness in document image recognition systems

Privacy in document image understanding

Explainability and interpretability

Evaluation, benchmarks, and best practices

Applications and case studies

Important Dates

Paper Submission Deadline: 16/05/2026 CET

Notification to Authors: 11/06/2026 CET

Camera-ready Deadline: 18/06/2026 CET

Workshop Date: 21/08/2026 CET

Publication

Accepted papers will be published in the ICPR Workshops Proceedings within the Lecture Notes in Computer Science (LNCS), Springer series. All accepted papers must be presented at the workshop.

Contact

Email: lkang@cvc.uab.es, dimos@cvc.uab.es
Website: https://sites.google.com/view/trustdoc/home

7) Job offer (1 new)

Biomedical AI Scientists (multiple)

Location: USD AI Research, The University of South Dakota

Contact: Professor and Chair, KC Santosh

[Note: We are happy to hire those you have machine learning foundation.]

The USD AI Research, Department of Computer Science at the University of South Dakota (USD) is seeking Computational Scientists in Biomedical and Clinical AI Research (Biomedical AI Scientists) to support ongoing projects within the South Dakota Biomedical Computation Collaborative (SDBCC), a federally funded program. The research of this position will focus on foundational machine learning models on large-scale biomedical and clinical datasets, including single- and multi-omics (genomics, transcriptomics, proteomics, etc.), medical imaging, and electronic health records, with a strong commitment to AI Standards and Innovation. The role involves working on the advancement of AI and machine learning, in line with USD’s mission as the flagship university of the state, and home to South Dakota’s leading programs in AI.

This is a grant-funded, three-year position, with the potential for extension based on performance and continued funding. To learn more about the lab’s research, please visit www.ai-research-lab.org. For more information, please contact Dr. KC Santosh, Professor and Chair of Computer Science, Founding Director of the USD AI Research, and Principal Investigator for AI at SD-BCC, at kc.santosh@usd.edu.

You reached the end, hope you enjoyed reading the IAPR-TC10 newsletter n°160.

[IAPR-TC10] Newsletter 159 – March 2025

Posted on 05/03/202508/03/2025 by Christophe Rigaud

Welcome to the March edition of the TC10 newsletter.

In this edition you will find a welcome message from the new TC10 Chair, Prof. KC Santosh! This issue also includes, the list of ICDAR 2025 competitions, the open call for organizing next DAR events and the last IJDAR issue articles list.

ICDAR 2025 full papers deadline has just been extended to March 14th, good luck with finalizing your articles!

Best regards,
Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) Message from the new TC10 Chair
3) ICDAR 2025 competition list
4) Open Call for Organizing DAR Events
5) IJDAR paper alert
6) Job offer – 1 new

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

Join us! If you are not already a member of the TC10 community, please consider joining the TC10 mailing list by clicking the join us button on the main page the website.

1) Upcoming deadlines and events

2025

Deadlines
- ~~Feb. 21~~ March 14, ICDAR 2025 full paper submission deadline

Events:
- June 17-20, conference IGS 2025, Polythechnique, Montréal, Canada
- September 16-21, conference ICDAR 2025, Wuhan, Hubei, China
- November 10-13, conference ACPR 2025, Gold Coast, Australia

2) Message from the new TC10 Chair

As the newly appointed Chair of IAPR’s Technical Committee 10 (TC10) on Graphics Recognition, I am honored to serve this dynamic and evolving research community. Graphics recognition plays a vital role in document image analysis, with applications spanning engineering drawings, historical manuscripts, maps, medical informatics, and more—ultimately serving to enhance human understanding and interaction with visual data. TC10 remains committed to fostering collaboration, advancing research, and supporting innovative developments in this field. Through our flagship events, including the ICDAR conference and GREC workshop, as well as community-driven initiatives such as datasets, competitions, and benchmarking efforts, we aim to strengthen engagement and drive progress.

Unlike before, I am committed to recruiting and preparing new talents for the community (not just limited to TC-10 committee) and creating special journal issues for upcoming GREC workshops. I encourage you to stay connected, contribute to our activities, and help expand our community.

Your participation matters! I will ensure that your presence and contributions are valued. Let’s work together to shape the future of graphics recognition!

KC Santosh, PhD
Professor (AI) and Chair, University of South Dakota (www.kc-santosh.org)
Founding Director, AI Research Lab (www.ai-research-lab.org)

3) ICDAR 2025 competition list

1. ICDAR 2025 Competition on Historical Map Text Detection, Recognition, and Linking

2. ICDAR 2025 Competition on Understanding Chinese College Entrance Exam Papers

3. ICDAR 2025 Competition on FEw-Shot Text line segmentation of ancient handwritten documents (FEST)

4. ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts

5. ICDAR 2025 Competition on Multi-lingual Roadside Scene Text Recognition

6. ICDAR 2025 Competition on Automatic Classification of Literary Epochs

7. ICDAR 2025 Competition on Handwritten Text Recognition and Understanding

8. ICDAR 2025 Competition on Comics Understanding in the Era of Foundational Models

9. ICDAR 2025 Competition on Glyph Detection in 15th-Century European Printed Documents (to be confirmed)

4) Open Call for Organizing DAR Events

In addition to specific calls for bids to host one of the events, we encourage teams to announce their interest in organizing one of the following events:

ICDAR: International Conference on Document Analysis and Recognition (annually; next possibility in 2028)
DAS: International Workshop on Document Analysis Systems (satellite event of ICDAR in even years; next possibility in 2026)
HIP: International Workshop on Historical Document Imaging and Processing (satellite event of ICDAR in odd years; next possibility in 2025)
GREC: International Workshop on Graphics Recognition (satellite event of ICDAR in odd years; next possibility in 2025)
SSDA: Summer School on Document Analysis (biannually in odd years; next possibility in 2025).

KC Santosh (Chair, TC10)
Andreas Fischer (Chair, TC11)

5) IJDAR article alert (vol. 27, issue 3)

Volume 27, issue 3, September 2024. Please find below a link to the 18 articles:

Editorial for special issue on “advanced topics in document analysis and recognition”
Elisa H. Barney, Smith Marcus Liwicki, Simone Marinai

Exploring recursive neural networks for compact handwritten text recognition models
Enrique Mas-Candela & Jorge Calvo-Zaragoza

Self-training for handwritten word recognition and retrieval
Fabian Wolf & Gernot A. Fink

Neural models for semantic analysis of handwritten document images
Oliver Tüselmann & Gernot A. Fink

Evaluating learned feature aggregators for writer retrieval
Alexander Mattick, Martin Mayr, …, Vincent Christlein

CHWmaster: mastering Chinese handwriting via sliding-window recurrent neural networks
Shilong Li, Shusen Tang & Zhouhui Lian

Towards reduced-complexity scene text recognition (RCSTR) through a novel salient feature selection
Rina Buoy, Masakazu Iwamura, … , Koichi Kise

Parstr: partially autoregressive scene text recognition
Rina Buoy, Masakazu Iwamura, … , Koichi Kise

SemiDocSeg: harnessing semi-supervised learning for document layout analysis
Ayan Banerjee, Sanket Biswas, … , Umapada Pal

Approximate ground truth generation for semantic labeling of historical documents with minimal human effort – open access
Najoua Rahal, Lars Vögtlin & Rolf Ingold

Table image dewarping with key element segmentation
Ziyi Zhu, Zhi Tang Liangcai Gao

End-to-end semi-supervised approach with modulated object queries for table detection in documents
Iqraa Ehsan, Tahira Shehzadi, … , Muhammad Zeshan Afzal

A unified representation framework for the evaluation of Optical Music Recognition systems – open access
Pau Torras Sanket Biswas Alicia Fornés

ChemScraper: leveraging PDF graphics instructions for molecular diagram parsing
Ayush Kumar Shah, Bryan Amador, … , Richard Zanibbi

Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement
Nimol Thuon, Jun Du, … , Pengfei Hu

Am I readable? Transfer learning based document image rectification
Pooja Kumari Sukhendu Das

DocXclassifier: towards a robust and interpretable deep neural network for document image classification
Saifullah Saifullah, Stefan Agne, … , Sheraz Ahmed

Towards privacy preserved document image classification: a comprehensive benchmark
Saifullah Saifullah, Dominique Mercier, … , Sheraz Ahmed

6) Job offers – 1 new

Postdoctoral Researcher in Artificial Intelligence,
AI Research Lab
University of South Dakota

URL: https://yourfuture.sdbor.edu/postings/41727

The Department of Computer Science at the University of South Dakota (USD) is currently seeking a talented individual to join our dynamic and innovative research team as Postdoctoral Researcher. The role involves working on the advancement of artificial intelligence (AI) and machine learning, in line with USD’s mission as the flagship university of the state, and home to South Dakota’s leading programs in AI.

The successful candidate will engage in cutting-edge research projects in machine learning, computer vision, and specialized areas such as xAI, AI ethics, human AI, sustainable AI, and green computing. They will be expected to contribute to ongoing research activities, publish in top-tier conferences and journals, and collaborate with researchers (in healthcare, for example).

Duties & responsibilities:

Work on machine learning models (human AI) and contribute to ongoing research activities
Publish work in peer-reviewed high impact scientific conferences and journals, such as ICCV, CVPR and IEEE Transactions
Assist in writing research grant proposals

Qualifications:

A Ph.D. in Computer Science, Computer Engineering, Artificial Intelligence, or a related field
Strong mathematical foundation
Strong programming skills in languages such as Python, Julia, MATLAB, Rust, and/or C++
Proven track record of research excellence, as evidenced by publications in top-tier conferences and journals in machine learning and computer vision
Strong written and oral communication skills
Be able to conduct independent research

The position is for a one-year term, with the possibility of extension based on performance and funding availability. Competitive salary and benefits package will be provided. To apply, please submit a cover letter (1 page), CV, research statement (1 page), copy of one publication, and contact information for three references. Applications will be reviewed on a rolling basis until the position is filled. Immediate hiring is possible.
https://yourfuture.sdbor.edu/postings/41727

For further information, please contact Prof. Dr. KC Santosh, Department Chair, at kc.santosh@usd.edu. To learn more about funding, please follow the link: https://sd-bcc.org.

[IAPR-TC10] Newsletter 158 – August 2024

Posted on 27/08/202403/09/2024 by Christophe Rigaud

Welcome to the August edition of the TC10 newsletter.

In few days will start our main event of the year, ICDAR 2024 in Athens, Greece. Six workshops including DAS and MANPU will be held on Friday and Saturday. Three tutorials will take place Sunday and the main conference will run from Monday to Wednesday, including four keynote speakers, orals and many competition and poster sessions. This year’s best young investigator award goes to… Dr. Vincent Christlein congratulations!

This issue also includes call for papers for the IJDAR journal
track of ICDAR 2025 and the open call for organizing next DAR events.

Looking forward to seeing you in Athens,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) ICDAR 2024 Young Investigator Award – nomination
3) ICDAR 2024 Doctoral Consortium
4) ICDAR 2024 MANPU workshop – news
5) ICDAR 2024 DAS workshop – repost
6) ICDAR 2025 Call for Paper
7) ICDAR-IJDAR 2025 Journal Track – Call for Paper
8) Open Call for Organizing DAR Events
9) IJDAR paper alert
10) Job offers – repost

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

Join us! If you are not already a member of the TC10 community, please consider joining the TC10 mailing list by clicking the join us button on the main page the website.

1) Upcoming deadlines and events

2024

Deadlines
- Nov 15, ICDAR 2025 Journal track paper submission deadline

Events:
- August 30, workshop MANPU 2024, Athens, Greece
- August 30-31, workshop DAS 2024, Athens, Greece
- August 30-4 Sep., conference ICDAR 2024, Athens, Greece
- December 1-5, conference ICPR 2024, Kolkata, India

2025 and later

Deadlines
- Feb 7 abstract submission deadline, ICDAR 2025
Events
- Sep 17-21, conference ICDAR 2025, Wuhan, China

2) IAPR/ICDAR Young Investigator Award – nomination

The IAPR/ICDAR Award Program is an established program designed to recognize individuals who have made outstanding contributions to the field of Document Analysis and Recognition in one or more of the following areas:

Research
Training of students
Research/Industry interaction
Service to the community

The IAPR/ICDAR Young Investigator Award is presented biannually in even years. Recipients need to be less than 40 years old at the time the award is made.

The recipient of the award 2024 is:

Dr. Vincent Christlein, for outstanding contributions to handwriting recognition applied to forensics and historical documents.

Congratulations!

The award will be presented at ICDAR 2024, followed by a keynote speech of Dr. Christlein.

Jean-Christophe Burie and Andreas Fischer
TC10 and TC11 Chairs

3) ICDAR 2024 Doctoral consortium

In 2011, the first Doctoral Consortium in the Document Analysis community was organized in conjunction with the International Conference on Document Analysis and Recognition (ICDAR). This has led to successful successor events at ICDAR 2013, ICDAR 2015, ICDAR 2017, ICDAR 2019, ICDAR 2021, and ICDAR 2023. The tradition of having a Doctoral Consortium as a satelite event to the ICDAR main conference will further be continued at ICDAR 2024 in Athens, Greece.

The goal of the ICDAR 2024 Doctoral Consortium is to create an opportunity for PhD students to test their research ideas, present their current progress and future plans, and receive constructive criticism and insights related to their future work and career perspectives. Students will have the opportunity to present an overview of their research plan during the poster sessions of the main conference. In addition, a mentor (a senior researcher who is active in the field) will be assigned to each student to provide individual feedback.

Participation in the ICDAR 2024 Doctoral Consortium will be limited to 25 students. Prospective participants are encouraged to submit their application by June 24, 2024. The Doctoral Consortium Chairs will then review all applications received. Preference will be given to students who are at a stage in their studies most likely to benefit (i.e., they have identified a research direction and published some initial results, but the thesis is not yet set in stone).

Participation to the Doctoral Consortium will be free for all accepted students. There will be no extra registration fees.

The ICDAR 2024 Doctoral Consortium will take place during the poster sessions of the main conference, i.e., on Monday, September 2, and Tuesday, September 3.

4) ICDAR 2024 MANPU workshop – news

The 6th International Workshop on coMics ANalysis, Processing and Understanding
Friday, August 30, 2024
Grand Hyatt Hotel, Athens, Greece
https://manpu2024.imlab.jp/#program

Organized in conjunction with ICDAR2024 The 18th International Conference on Document Analysis and Recognition, Grand Hyatt Hotel, Athens
Greece, August 30 - September 4, 2024

News: keynote speech, August 30 at 9:00am

The patterns of global comics
Neil Cohn (Tilburg University)

While sequential images are pervasive in society, from picture books to instruction manuals and storyboards, they appear most complex in comics around the world. Over the past two decades, an increasing focus on corpus analysis has begun to allow greater insights into how comics are structured and comprehended, across work blending manual and computational annotations. Here I will focus on the insights from two corpus projects: the Visual Language Research Corpus (360 comics, 9 countries, 48,000 panels) and the TINTIN Corpus (1,030 comics, 144 countries/territories, 76,000 panels). This analysis will highlight how the structures of comics change over time while interacting with the culture and languages of their authors, revealing distinctive “visual languages” that balance diverse and universal features, consistent with other linguistic systems.

https://manpu2024.imlab.jp/#keynote

5) ICDAR 2024 DAS workshop

The 16th International Workshop Document Analysis Systems
Friday, August 30 and Saturday, August 31, 2024
https://das2024.seecs.edu.pk/

Organized in conjunction with ICDAR2024 The 18th International Conference on Document Analysis and Recognition, Grand Hyatt Hotel, Athens Greece, August 30 - September 4, 2024

Document Analysis System (DAS) will be organized as a satellite workshop in conjunction with ICDAR 2024. Join us at DAS 2024 to explore the cutting-edge of document analysis technologies! This workshop is an unparalleled opportunity for researchers, practitioners, and technology enthusiasts to engage in a forward-looking discourse on document analysis systems’ evolution and future directions.

For more details, please have a look at the following link

https://das2024.seecs.edu.pk/submission.html

We are looking forward to your excellent contributions!

DAS Organizing Committee

6) ICDAR 2025 Call for Paper

September 17-21, 2025 Wuhan, Hubei, China
https://www.icdar2025.com/

ICDAR is the premier event for scientists and practitioners involved with document analysis and
recognition, a field of growing importance in our current age of digital transition. The 19th
edition of this flagship conference will beheld in Wuhan, September 17-21, 2025.

Conference Paper Submissions

There is both a standard conference paper track and a journal track at ICDAR 2025; details regarding the journal track maybe found in a separate Call for Papers.

Reviewing for conference papers will be double blind. Authors should not include their names, affiliations, or acknowledgements in submitted manuscripts, and should ensure that their identity is not revealed indirectly by citing their earlier work in the third person.

Papers maybe up to 15 pages long (including references) in Springer Lecture Notes in Computer Science (LNCS) format. Detailed submission instructions for authors can be found on the conference website (https://www.icdar2025.com/home).

Important Dates

Feb 7, 2025: Conference Title & Abstract submission deadline
Feb 21, 2025: Conference full paper upload and editing closed
Apr 11, 2025: Conference reviews due
Apr 18, 2025: Conference rebuttal due
Apr 25, 2025: Conference paper acceptance notation
May 16, 2025: Camera-Ready and final notation

Publication: Springer Lecture Notes in Computer Science

The conference proceedings will be published as part of the Springer Lecture Notes in Computer Science (LNCS) series. Accepted papers will be freely available through SpringerLink from the conference website for one year after publication, and will later be freely available through SpringerLink four years after publication.

Contact:

Xu-Cheng Yin <xuchengyin@ustb.edu.cn>
Dimosthenis Karatzas <dimos@cvc.uab.es>
Daniel Lopresti <lopresti@cse.lehigh.edu>

pc-chairs@icdar2025.org

7) 2025 ICDAR-IJDAR Journal Track – Call for Paper

https://www.icdar2025.com/submission/journal-track
https://link.springer.com/journal/10032/updates/27412600

Following a feature of ICDAR2019 through ICDAR2024, ICDAR 2025 will again include the option of a journal track that offers the rapid turnaround and dissemination times of a conference while providing the paper length, scientific rigor, and careful review process of an archival journal.

The ICDAR-IJDAR journal track invites high-quality submissions that present original work in the areas of Document Analysis and Recognition appropriate to both the International Conference on Document Analysis and Recognition (ICDAR) and the International Journal on Document Analysis and Recognition (IJDAR). Accepted papers will be published in a special issue of IJDAR and will receive an oral presentation slot at the ICDAR 2025 conference.

Authors who submit their work to the journal track commit themselves to present their results at the ICDAR conference in case of acceptance. Springer Nature, the publisher of IJDAR, will make the papers accepted for the journal track freely available in a time frame of four weeks around the conference, beyond being available in the archival journal.

Format: Papers submitted to the journal track should follow the format of IJDAR submissions and all of the standard guidelines.

Important Dates

Nov 15, 2024: Journal track paper submission deadline
Jan 10, 2025: Initial journal track decision announced
Mar 14, 2025: Journal track paper revise submission deadline
May 23, 2025: Final notification

IJDAR Journal Track Submission Guidelines:
Authors should submit their papers via the ICDAR 2025 collection on the
IJDAR website. Please choose ICDAR 2025 from the collection dropdown on the “Details” page of the submission process. Submitted papers should present original, unpublished work, relevant to one of the topics of the Special Issue. All submitted papers will be evaluated on the basis of relevance, significance of contribution, technical quality, scholarship, and quality of presentation, by at least three independent reviewers. It is the policy of the journal that no submission, or substantially overlapping submission, be published or be under review at another journal or conference at any time during the review process. Manuscripts will be subject to a peer reviewing process and must conform to the author guide
lines available on the IJDAR website.

Publication: Springer Lecture Notes in Computer Science
The conference proceedings will be published as part of the Springer Lecture Notes in Computer Science (LNCS) series. Accepted papers will be freely available through SpringerLink from the conference website for one year after publication, and will later be freely available through SpringerLink four years after publication.

8) Open Call for Organizing DAR Events

In addition to specific calls for bids to host one of the events, we encourage teams to announce their interest in organizing one of the following events:

ICDAR: International Conference on Document Analysis and Recognition (annually; next possibility in 2028)
DAS: International Workshop on Document Analysis Systems (satellite event of ICDAR in even years; next possibility in 2026)
GREC: International Workshop on Graphics Recognition (satellite event of ICDAR in odd years; next possibility in 2025)
SSDA: Summer School on Document Analysis (biannually in odd years; next possibility in 2025, see call for proposal in section 9 of this newsletter).

Anyone interested in hosting one of these events is invited to announce their interest via email to jean-christophe.burie@univ-lr.fr and andreas.fischer@unifr.ch, in order to receive feedback and support for preparing a proposal.

Jean-Christophe Burie (Chair, TC10)
Andreas Fischer (Chair, TC11)

9) IJDAR article alert (vol. 27, issue 2)

Volume 27, issue 2, June 2024. Please find below the 6 articles:

The image and ground truth dataset of Mongolian movable-type newspapers for text recognition
Min Lu Feilong Bao … Guanglai Gao

TableStrRec: framework for table structure recognition in data sheet images
Johan Fernandes Bin Xiao … Ala Abu Alkheir

Offline handwritten mathematical recognition using adversarial learning and transformers
Ujjwal Thakur & Anuj Sharma

Perceptual cue-guided adaptive image downscaling for enhanced semantic segmentation on large document images
Chulwoo Pack Leen-Kiat Soh & Elizabeth Lorang

SoftCTC—semi-supervised learning for text recognition using soft pseudo-labels – Special Issue Paper
Martin Kišš Michal Hradiš … Michal Kula

Attention-based multiple siamese networks with primary representation guiding for offline signature verification
Yu-Jie Xiong Song-Yang Cheng … Yu-Jin Zhang

10) Job offers – repost

For internship, please find an updated list maintained by IAPR of 38 companies with locations (some remote), requirements, etc.: https://iapr.org/about-us/internships/

PhD Positions Available (Fall 2024 or Spring 2025)

Document and Pattern Recognition Lab, RIT, USA

Area: Graphics-Oriented and Multi-Modal Information Retrieval and Information Extraction

Currently we are looking to recruit 2 PhD students to start their PhD as members of the Document and Pattern Recognition Lab in the CS Department in Fall 2024. The focus of our work is on recognizing and retrieving information in document, images, and videos, with an emphasis on graphical notations (e.g., math and chemistry).

An overview of some projects from the lab may be found online here. Former PhD students from the dprl work in a variety of industrial research positions (e.g., at Apple in the bay area) and as professors at universities (e.g., at DePaul University in Chicago, and the University of Southern Maine).

Information for Applicants:

· All applicants should carefully review the dprl lab pages at: https://www.cs.rit.edu/~dprl/index.html.

Have a look at the demonstrations, papers, dissertations, and theses published out of the lab.

· PhD applicants must hold or soon be completing a BSc or MSc in Computer Science (with foundations in theory, algorithms, and implementation).
· To apply, send your CV/resume along with a brief research proposal sketch (1-2 pages) that includes:

a specific research question,
how this is related to previous work in the dprl, and
a short sketch of how you would go about addressing the question
Note: You will not be committed to work proposed, this is for application purposes only.

· If your background and materials are a fit for the lab, I will email to set up two interviews over Zoom: the first for discussion, and the second as a technical interview

If you wish to apply or have any questions about the available positions, please do not hesitate to email me at rxzvcs@rit.edu. If you think you may be interested, please contact me soon, the deadline for formal applications to the program is in early-to-mid January of 2024.

Richard Zanibbi.

[IAPR-TC10] Newsletter 157 – March 2024

Posted on 27/03/202428/03/2024 by Christophe Rigaud

Welcome to the March edition of the TC10 newsletter.

In this issue you’ll find ICDAR 2024 satellite workshop (DAS, MANPU) and several competition announcements, ICPR 2024 deadline extension, IJDAR paper alert and a special issue at Multimedia Tools and Applications: Heritage Preservation in the Digital Age. There are also several approaching deadlines listed in the first section.

I wish you a pleasant reading,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) MANPU@ICDAR 2024 Call for Paper
3) DAS@ICDAR 2024 Call for Paper
4) Call for Nominations for the 2024 IAPR Fellow Awards – repost
5) ICDAR 2024 Occluded RoadText Competition
6) ICDAR 2024 Competition on Handwriting Recognition of Historical Ciphers
7) ICDAR 2024 Competition on Map Text Detection, Recognition, and Linking
8) SSDA 2025: Call for proposal for the next summer school on document analysis – repost
9) I CPR 2024 call for paper – extended
10) International Computer Vision Summer School
11) Special issue at Multimedia Tools and Applications: Heritage Preservation in the Digital Age
12) Open Call for Organizing DAR Events
13) IJDAR paper alert
14) Job offers

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

Join us! If you are not already a member of the TC10 community, please consider joining the TC10 mailing list by clicking the join us button on the main page the website.

1) Upcoming deadlines and events

2024

Deadlines
- ~~March 20~~ April 10, paper submission deadline, ICPR 2024, Kolkata, India – extended
- March 31, workshop proposal deadline for ICPR 2024
- March 31, deadline of the IAPR Fellow Awards Call for Nominations
- April 5, proposal submission for hosting SSDA 2025
- April 15, paper submission deadline for special issue: Heritage Preservation in the Digital Age – extended
- April 23, abstract submission deadline MANPU 2024 – extended
- April 29, paper submission deadline DAS 2024
- June 15, tutorial proposal deadline for ICPR 2024

Events:
- August 30, workshop MANPU 2024, Athens, Greece
- August 30-31, workshop DAS 2024, Athens, Greece
- August 30-4 Sep., conference ICDAR 2024, Athens, Greece
- December 1-5, conference ICPR 2024, Kolkata, India

2025 and later

2) MANPU@ICDAR 2024 Call for Paper

The 6th International Workshop on coMics ANalysis, Processing and Understanding
August 30, 2024
Grand Hyatt Hotel, Athens, Greece
https://manpu2024.imlab.jp

Organized in conjunction with ICDAR2024 The 18th International Conference on Document Analysis and Recognition, Grand Hyatt Hotel, Athens
Greece, August 30 - September 4, 2024

Why should we research on Comics?

For example, the detection and recognition of the characters in a comics page are not trivial since the main character can be a human being, an animal, or even an imaginary character. In this context, the “pattern recognition” task is a tricky problem. Comics analysis has aroused interest among researchers. The number of scientific papers dealing with comics analysis has significantly increased in international conferences and journals during the last ten years.

Important Dates

Title and abstract submission due:	April 23rd, 2024
Paper submission due:	April 30th, 2024
Notification of acceptance:	May 27th, 2024
Camera-ready paper due:	June 10th, 2024
Workshop:	August 30th, 2024

Scope and Topics

The scope of this workshop includes, but is not limited to,

Comics Image Processing
Comics Analysis and Understanding
Comics Recognition
Comics Retrieval and Spotting
Comics Enrichment
Born-digital comics
Reading Behavior Analysis of Comics
Comics Generation
Copy protection – Fraud detection
Physical/Digital Comics Interfaces
Cognitive Processing and Comprehension of Comics
Linguistics Analysis of Comics

Datasets

To evaluate the proposed works, participants will be able to use the following datasets that are publicly available. Researchers can request to download them at each website.

eBDtheque consists of 100 images with ground truth for panels, speech balloons, tails, text lines, leading characters.
website: http://ebdtheque.univ-lr.fr/

Manga109 consists of over 20 thousand images of 109 volumes (21,142 images).
website: http://www.manga109.org/en/

return to top

Paper Submissions

Submission and Review

All papers will have to be submitted through the EasyChair submission system on or before the submission deadline. Authors can update their papers before the submission deadline. MANPU 2024 will follow a single-blind review process. The accepted papers will be published in the “ICDAR pre-conference volume” edited by Springer in the Lecture Notes in Computer Science (LNCS) series.

Paper format and length.

Papers should be formatted with the style files/details available at Information for Authors of Springer Computer Science Proceedings. The LaTeX template for LNCS can be downloaded here. It is also available on Overleaf. Only PDF files are accepted. A complete paper should be submitted in the proper format. Papers accepted for the workshop will be allocated up to 15 pages (usually not counting references) in the proceedings. Submissions are expected to be in the range of 10-15 pages.

Submission site

Paper submissions for MANPU2024 will be handled through the Easy Chair conference management system. Please visit https://www.easychair.org/conferences/?conf=manpu2024.

General Co-Chairs,
Jean-Christophe Burie University of La Rochelle, France
Motoi Iwata Osaka Metropolitan University, Japan
Yusuke Matsui The University of Tokyo, Japan

Check last news: https://manpu2024.imlab.jp

3) DAS@ICDAR 2024 Call for Paper

The 16th International Workshop Document Analysis Systems
August 30-31, 2024
https://das2024.seecs.edu.pk/

Organized in conjunction with ICDAR2024 The 18th International Conference on Document Analysis and Recognition, Grand Hyatt Hotel, Athens Greece, August 30 - September 4, 2024

We welcome papers and demos in the following categories according to Springer formatting guidelines.

Full papers (12-15 pages) for comprehensive research findings,
Short papers (6-8 pages) for research in progress, and novel ideas
Demos. Must be accompanied by a short paper (6-8 pages)
Demos for Accepted ICDAR papers (4-8 pages) highlighting the application and technical detail of the accepted paper.

Important Dates

Full Paper submission: April 29, 2024
Full Paper Acceptance notification: May 27, 2024
Full Papers Camera-ready: June 07, 2024
Short Papers/Demos Submission: June 03, 2024
Short Papers/Demos Acceptance Notification: June 10, 2024
Short Papers Camera-ready: June 17, 2024

For more details, please have a look at the following link

https://das2024.seecs.edu.pk/submission.html

We are looking forward to your excellent contributions!

DAS Organizing Committee

4) Call for Nominations for the 2024 IAPR Fellow Awards – repost

Deadline: March 31, 2024

Full 2024 Nomination Instructions can be found here (.docx)

To initiate a nomination, a nominator must complete and submit an IAPR Fellow Nomination Form. Any member of an IAPR Member Society can serve as nominator, except for the nominee themself and the current members of the Executive and Fellow Committees.

Each nomination must be endorsed by at least one recommendation letter (submitted endorsement form), either from a member of an IAPR Member Society (different from the nominator) or from an IAPR Fellow.

All electronic documents (Nomination and Endorsement forms) must be submitted electronically and will be acknowledged by an email. Submission problems should be reported to the IAPR Webmaster, cc’ing the Fellow Committee Chair, Prof. Umapada Pal, Indian Statistical Institute, Kolkata, India:

To: webmaster@iapr.org
Subject: Submission problem — IAPR Fellowship 2024
CC: umapada@isical.ac.in; umapada_pal@yahoo.com

Click here for a list of members of the IAPR Fellow Committee

IAPR appreciates your efforts to support our fellowship program!

5) ICDAR 2024 Occluded RoadText Competition

The Occluded RoadText challenge is designed to evaluate and advance existing methodologies in scene text detection and recognition, specifically under conditions of partial occlusion.

The challenge introduces a new dataset featuring real-world traffic scenes with natural objects obscuring parts of the text. Participants will need to develop methods that can decipher partially visible text and leverage the context of the entire image, including other visible texts, objects and supplementary images.

The challenge contains three different tasks – 1. Text Localization, 2. Single Image End-to-End Recognition, and 3. Multi Image End-to-End Recognition

Competition Website

https://rrc.cvc.uab.es/?ch=29

Important Dates

* 15th January 2024: Competition Announced
* 19th February 2024: Validation data released
* 5th March 2024: Test data release
* 20th March 2024: Submission site opens
* 10th May 2024: Deadline for competition submissions

All deadlines are in the AoE time zone.

Organizers

* George Tom, Minesh Mathew, Ajoy Mondal, C. V. Jawahar – CVIT, IIIT Hyderabad
* Jerod Weinman – Grinnell College
* Dimosthenis Karatzas – Computer Vision Center, Universitat Autonoma de Barcelona, Spain

6) ICDAR 2024 Competition on Handwriting Recognition of Historical Ciphers

Call for participation

https://rrc.cvc.uab.es/?ch=27&com=introduction

Handwritten Text Recognition (HTR) in low resource scenarios (i.e. when the amount of labeled data is scarce) is a challenging problem. This is particularly the case of historical encrypted manuscripts, so called ciphers, which contain secret messages, and were typically used in military or diplomatic correspondence, records of secret societies, or private letters. In order to hide their contents, the sender and receiver created their own secret method of writing. The cipher alphabets oftentimes include digits, Latin or Greek letters, Zodiac and alchemical signs combined with various diacritics, as well as invented ones. The first step in the decryption process is the transcription of these manuscripts which is not easy due to the great variation of hand-writing styles, and cipher alphabets with a few number of pages. Although different strategies can be considered to deal with the insufficient amount of training data (e.g. few-shot learning, self-supervised learning) the performance of available HTR models is not yet satisfactory. Thus, we believe that a competition with a large number of symbol sets and scribes can boost the research of HTR in low resource scenarios.

Dates:

18 February 2024: test data released
10 May 2024: submission of results (extended deadline)

7) ICDAR 2024 Competition on Map Text Detection, Recognition, and Linking

Call for Participation

Dear colleagues,

The deadline for final submission has been extended to April 15th, so you still have a month left to register for and participate in the ICDAR 2024 Competition on Map Text Detection, Recognition, and Linking! If you are at the forefront of text detection and recognition, we invite you to join us: https://rrc.cvc.uab.es/?ch=28

Please also note that we are now accepting submissions for the final test set.

This competition aims to tackle the unique challenges of detecting and recognizing textual information (e.g., place names) and linking words to create location phrases from scanned historical map images.

Training, validation and test sets are already available, along with evaluation tools. Follow us on Twitter (@ICDAR24_MapText) to stay updated with the latest news!

The competition is organized by seasoned competition organizers from the following institutions: University of Minnesota (USA), Grinnell College (USA), Univ Gustave Eiffel, ENSG, IGN, LASTIG (France), EHESS (France), EPITA (France).

We look forward to seeing your innovative solutions in action!

— ICDAR 2024 MapText Organizers

8) SSDA 2025: Call for proposal for the next summer school on document analysis – repost

Important Dates

April 5, 2024 Proposal Submission Deadline

Submit Proposals via email to:

Foteini Simistira Liwicki (TC11 Representative), foteini.liwicki@ltu.se
Momina Moetesum (TC10 Representative), momina.moetesum@seecs.edu.pk

The “IAPR TC10/TC11 Summer School on Document Analysis” is intended to become the primary educational activity of IAPR TC11 (Reading Systems) and TC10 (Graphics Recognition). The School is meant to be a training activity where participants are exposed to the latest trends and techniques of Reading Systems and Graphics Recognition.

The aim of the School is to provide both an objective and clear overview and an in-depth analysis of the state-of-the-art research in selected topics of Reading Systems and Graphics Recognition. The School should aim to provide a stimulating opportunity for young researchers and PhD students in the field.

Individuals and groups who are interested in Reading Systems and Graphics Recognition are invited to submit proposals for organizing and hosting the 2025 IAPR TC10 / TC11 Summer School. As the previous summer schools were organized in Asia, Europe, and the Sub-continent, organizing teams from the Americas and Africa are encouraged to submit a bid in order to facilitate the envisioned rotational scheme of the IAPR TC10 / TC11 Summer School.

In order to fully plan their bid, it is expected that proposers familiarize themselves with the guidelines for organizing the School first. The Guidelines can be found at the TC11 Web site or click here.

Please consider submitting a proposal for this increasingly important event for the TC10/TC11 community. If you have questions, please do not hesitate to contact the TC11 and TC10 SSDA representatives: Foteini Simistira Liwicki (TC11 Representative) and Momina Moetesum (TC10 Representative).

Previous events:

As a reference, the 2023 Summer School on Document Analysis was held in Fribourg, Switzerland (URL: https://ssda2023.isc.heia-fr.ch/ )

9) ICPR 2024 call for paper – extended

Greetings from the ICPR-2024 organizing committee!

ICPR-2024 is the 27th event of the series which will be held at Kolkata, India during December 1-5, 2024 and ICPR-2024 will be celebrating 50years of ICPR.

Paper submission deadline of ICPR-2024 is extended upto April 10, 2024. Organizing committee invites prospective researchers to submit papers, workshop proposals, tutorial proposals, competition proposals, etc. Information about the calls for Papers/Workshops/Tutorials/Competitions are available at the ICPR-2024 website:

https://icpr2024.org/

The deadlines for the different calls are as follows:

·         Paper submission deadline:  April 10,  2024 (extended)
·         Competition Proposal submission deadline: March 31, 2024
·         Workshop proposal submission deadline: March 31, 2024
·         Tutorial proposal submission deadline: June 15, 2024

Submission and Review:

ICPR-2024 will follow a single-blind review process. Authors can include their names and affiliations in the manuscript.

Paper Format and Length:

Springer LNCS format with maximum 15 pages (including references) during paper submission. To take care of reviewers’ comments, one more page is allowed (without any charge) during revised/camera ready submission. Moreover, authors may purchase up to 2 extra pages.

Springer LNCS paper formatting instructions and templates for ICPR-2024 are available in the ICPR-2024 website (https://icpr2024.org/for-papers.html)

Please visit the ICPR-2024 website icpr2024.org for more details.

For any enquiry please contact the ICPR-2024 Secretariat via email at icpr2024@gmail.com and icpr2024@isical.ac.in

We look forward to your participation in ICPR-2024.

With regards,

Umapada Pal, Indian Statistical Institute, Kolkata, India
Josef Kittler, University of Surrey, UK
Anil Jain, Michigan State University, USA
(ICPR-2024 General Chairs)

Rama Chellappa, Johns Hopkins University, USA
Apostolos Antonacopoulos, University of Salford, UK
Cheng-Lin Liu, Institute of Automation of Chinese Academy of Sciences, China
Subhasis Chaudhuri, Indian Institute of Technology Bombay, India
(ICPR-2024 Program Chairs)

10) International Computer Vision Summer School

**Computer Vision in the Age of Large Language Models**

Sicily, Italy, 07-13 July 2024

Web: www.dmi.unict.it/icvss
Email: icvss@dmi.unict.it

APPLICATION DEADLINE: 31 March 2024

https://iplab.dmi.unict.it/icvss2024/Application

*MOTIVATION AND DESCRIPTION*

The eighteenth edition of the International Computer Vision Summer School aims to provide both an objective and clear overview and an in-depth analysis of the state-of-the-art research in Computer Vision. The last decade has seen a revolution in the theory and application of computer vision and machine learning. The next decade will see machine perception in embodied systems which learn representations for and from action and interaction.

The courses will be delivered by world renowned experts in the field, from both academia and industry, and will cover both theoretical and practical aspects of real Computer Vision problems as well as examples of their successful commercialization.

The school aims to provide a stimulating opportunity for young researchers and Ph.D. students. The participants will benefit from direct interaction and discussions with world leaders in Computer Vision. Participants will also have the possibility to present the results of their research, and to interact with their scientific peers, in a friendly and constructive environment.

*LIST OF SPEAKERS*

Zeynep Akata, University of Tübingen, DEU
Vijay Badrinarayanan, Wayve, USA
Michael Black, Max Planck Institute for Intelligent Systems, DEU
Andreas Geiger, University of Tübingen, DEU
Georgia Gkioxari, California Institute of Technology, USA
Abhishek Gupta, University of Washington, USA
Aaron Hertzmann, Adobe Research, USA
Derek Hoeim, University of Illinois at Urbana-Champaign, USA
Diane Larlus, Naver Labs Europe, FRA
Richard Newcombe, Meta Reality Labs Research, USA
Pietro Perona, California Institute of Technology, USA
Stefano Soatto, Amazon and University of California Los Angeles, USA
Andrea Tagliasacchi, Simon Fraser University, CAN
Antonio Torralba, Massachusetts Institute of Technology, USA

*SCHOOL DIRECTORS*

• Roberto Cipolla, University of Cambridge, United Kingdom

• Sebastiano Battiato, University of Catania, Italy

• Giovanni Maria Farinella, University of Catania, Italy

*SPECIAL SESSION – Industry meets Student*

In addition to the academic programme, we are organizing a special session to allow students to meet and learn about the opportunities and activities at the world leading research laboratories and companies which are exploiting computer vision.

*INDUSTRIAL PANEL*

Picture of last editions: https://photos.app.goo.gl/7pMQXeKvGhn7TSMaA

Previous editions link: https://iplab.dmi.unict.it/icvss2024/PreviousEditions

*MORE INFORMATION*

www.dmi.unict.it/icvss
icvss@dmi.unict.it

Facebook: https://www.facebook.com/groups/540991352579753/

11) Call for papers Special Issue: Heritage Preservation in the Digital Age

Heritage Preservation in the Digital Age: Advances in machine learning, monomodal and multimodal processing, and human-machine interaction 

Special issue at Multimedia Tools and Applications, Springer

Deadline: April 15, 2024
https://link.springer.com/journal/11042/updates/26502650

Aims and Scope

This special issue focuses on analyzing, processing and valorizing all types of data related to cultural heritage, including tangible and intangible heritage. As stated by UNESCO, cultural heritage provides societies with a wealth of resources inherited from the past, created in the present for the benefit of future generations. The massive digitization of historical analogue resources and production of born digital documents provide us with large volumes of varied multimedia heritage data (images, maps, text, video, 3D objects, multi-sensor data, etc.), which represent an extremely rich heritage that can be exploited in a wide variety of fields, from research in social sciences and computational humanities to land use and territorial policies, including urban modeling, digital simulation, archaeology, tourism, education, culture preservation, creative media and entertainment.

In terms of research in computer science, artificial intelligence, and digital humanities, they address challenging problems related to the diversity, specificity or volume of the media, the veracity of the data, and the different user needs with respect to engaging with this rich material and the extraction of value out of the data. These challenges are reflected in the corresponding sub-fields of machine learning, signal processing, mono/multi-modal techniques and human-machine interaction.

The objective of this special issue is to present and discuss the latest and most significant trends on analysis, understanding and promotion of heritage contents, focusing on advances on machine learning, signal processing, mono/multi-modal techniques, and human-machine interaction. We welcome research contributions for (but not limited to) the following topics:

Monomodal analysis: image, text, video, 3D, music, sensor data and structured referentials
Information retrieval for multimedia heritage
AI assisted archaeology and heritage data processing
Multi-modal deep learning and time series analysis for heritage data
Heritage modeling, visualization, and virtualization
Smart digitization and reconstruction of heritage data
Open heritage data and bench-marking

The scope of targeted applications is extensive and includes:

Analysis, archaeometry of artifacts
Diagnosis and monitoring for restoration and preventive conservation
Geosciences / Geomatics for cultural heritage
Education
Smart and sustainable tourism
Urban planning
Digital Twins

Important Dates

Submission deadline: ~~31 March 2024~~ —> 15 April 2024
Review period: 16 April – 30 July 2024
Notification: 31 July 2024
Author revision deadline: 15 October 2024
Final notification: 31 October 2024

Guest Editors

Valerie Gouet-Brunet, IGN-ENSG, University of Gustave Eiffel, France (valerie.gouet@ign.fr)
Ronak Kosti, Piscsart AI Lab, Germany (ronakfau@gmail.com)
Li Weng, School of Information Technology, Zhejiang Financial College, China (lweng@zfc.edu.cn)

Submission Guidelines

Authors should prepare their manuscript according to the Instructions for Authors available from the Multimedia Tools and Applications website. Authors should submit through the online submission site at https://www.editorialmanager.com/mtap/default.aspx and select “SI 1250- Heritage Preservation in the Digital Age: Advances in machine learning, monomodal and multimodal processing, and human-machine interaction” when they reach the “Article Type” step in the submission process.

SUMAC 2023. Please note that the authors of selected papers presented at workshop SUMAC2023 (ACM Multimedia 2023) are invited to submit an extended version of their contributions by taking into consideration both the reviewers’ comments on their conference paper, and the feedback received during presentation at the conference.

12) Open Call for Organizing DAR Events

In addition to specific calls for bids to host one of the events, we encourage teams to announce their interest in organizing one of the following events:

ICDAR: International Conference on Document Analysis and Recognition (annually; next possibility in 2027)
DAS: International Workshop on Document Analysis Systems (satellite event of ICDAR in even years; next possibility in 2026)
GREC: International Workshop on Graphics Recognition (satellite event of ICDAR in odd years; next possibility in 2025)
SSDA: Summer School on Document Analysis (biannually in odd years; next possibility in 2025, see call for proposal in section 9 of this newsletter).

Jean-Christophe Burie (Chair, TC10)
Andreas Fischer (Chair, TC11

13) IJDAR article alert (vol. 27, issue 1)

Volume 27, issue 1, March 2024. Please find below the 7 articles:

BPFormNet: a lightweight block pyramid network for form segmentation and classification
Hanyang Lin, Yongzhao Zhan & Chongshu Wu

Chart classification: a survey and benchmarking of different state-of-the-art methods
Jennil Thiyam, Sanasam Ranbir, Singh Prabin & Kumar Bora

Chinese text recognition enhanced by glyph and character semantic information
Shilian Wu, Yongrui Li & Zengfu Wang

Attribute-based document image retrieval
Melissa Cote Alexandra & Branzan Albu

A multifaceted evaluation of representation of graphemes for practically effective Bangla OCR
Koushik Roy, Md Sazzad Hossain, Pritom Kumar Saha, Shadman Rohan, Imranul Ashrafi, Ifty Mohammad Rezwan, Ifty Mohammad Rezwan, B. M. Mainul Hossain, Ahmedul Kabir & Nabeel Mohammed

A deep learning-based solution for digitization of invoice images with automatic invoice generation and labelling
Halil Arslan Yunus Emre Işık Yasin Görmez

Correction to: MRZ code extraction from visa and passport documents using convolutional neural networks
Yichuan Liu, Otkrist Joren, Gupta Hailey & Dan Raviv

14) Job offers – 1 new

For internship, please find an updated list maintained by IAPR of 38 companies with locations (some remote), requirements, etc.: https://iapr.org/about-us/internships/

PhD Positions Available (Fall 2024 or Spring 2025)

Document and Pattern Recognition Lab, RIT, USA

Area: Graphics-Oriented and Multi-Modal Information Retrieval and Information Extraction

Information for Applicants:

· All applicants should carefully review the dprl lab pages at: https://www.cs.rit.edu/~dprl/index.html.

Have a look at the demonstrations, papers, dissertations, and theses published out of the lab.

a specific research question,
how this is related to previous work in the dprl, and
a short sketch of how you would go about addressing the question
Note: You will not be committed to work proposed, this is for application purposes only.

· If your background and materials are a fit for the lab, I will email to set up two interviews over Zoom: the first for discussion, and the second as a technical interview

[IAPR-TC10] Newsletter 156 – November 2023

Posted on 07/11/202317/11/2023 by Christophe Rigaud

Welcome to the November edition of the TC10 newsletter.

In this issue, you will find a welcome message for the new educational officer and dataset curator, several calls and reports regarding past and next ICDAR conferences, SSDA summer school and also next ICPR and DAS editions. Along with two calls for nomination from the IAPR association, the editorial of the special Issue: “Advanced Topics in Document Analysis and Recognition” (14 articles) and finally, please find the summary of the last IJDAR issue and a two job offers.

I wish you a pleasant reading,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) Welcome to the new TC10 educational officer and dataset curator
3) Call for Nominations for the 2024 IAPR Fellow Awards
4) Call for Nominations for the IAPR/ICDAR Young Investigator Award
5) ICDAR 2023: ScalDoc workshop report
6) ICDAR 2024: Journal-first Track Call for Papers and Reviewers
7) ICDAR 2024: Call for Tutorials
8) SSDA 2023: Report of the last summer school on document analysis
9) SSDA 2025: Call for proposal for the next summer school on document analysis
10) DAS 2024: Call for Proposals
11) ICPR 2024: Call for Papers
12) ICDAR 2027: Call for Hosting Proposals
13) Open Call for Organizing DAR Events
14) Special Issue: “Advanced Topics in Document Analysis and Recognition”
15) IJDAR paper alert
16) Job offers

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

1) Upcoming deadlines and events

2023

Deadlines:
- November 14, paper submission firm deadline, Journal-First track of ICDAR 2024
- November 30, proposal submission for hosting DAS 2024
- December 15, nominations due for the IAPR/ICDAR Young Investigator Award
Events:

2024 and later

Deadlines
- January 20, paper submission deadline, ICPR 2024, Kolkata, India
- January 31, proposal submission for hosting ICDAR 2027
- April 5, proposal submission for hosting SSDA 2025
- March 31, deadline of the IAPR Fellow Awards Call for Nominations

Events:
- February 27-29, conference VISAPP, Rome, Italy
- August 30-4 sep., conference ICDAR 2024, Athens, Greece
- December 1-5, conference ICPR 2024, Kolkata, India

2) Welcome to the new TC10 educational officer and dataset curator

The TC10 Committee is delighted to welcome two new members in the team : Momina Moetesum as Educational Officer and Sanket Biwas as Dataset Curator.

Welcome to the TC10 committee.

Jean-Christophe Burie

Sanket Biwas is currently a Ph.D. candidate in the Computer Vision Center (CVC), Universitat Autònoma de Barcelona, Spain

Momina Moetesum is an Assistant Professor in Faculty of Computing at the School of Electrical Engineering and Computer Sciences (SEECS), National University of Sciences and Technology (NUST), Pakistan.

3) Call for Nominations for the 2024 IAPR Fellow Awards

Deadline: March 31, 2024

Full 2024 Nomination Instructions can be found here (.docx)

To: webmaster@iapr.org
Subject: Submission problem — IAPR Fellowship 2024
CC: umapada@isical.ac.in; umapada_pal@yahoo.com

Click here for a list of members of the IAPR Fellow Committee

IAPR appreciates your efforts to support our fellowship program!

4) Call for Nominations for the IAPR/ICDAR Young Investigator Award

Nominations Due: December 15, 2023

Research
Training of students
Research/Industry interaction
Service to the community

There are two awards categories, which have been presented together bi-annually in the past: The Young Investigator Award (less than 40 years old at the time the award is made) and the Outstanding Achievements Award. Because of the new yearly schedule of ICDAR, the two categories will alternate in odd and even years.

For ICDAR 2024, nominations are invited for the following award category:

IAPR/ICDAR Young Investigator Award

The award will consist of a token gift and a suitably inscribed certificate. The recipient will be invited to give the opening keynote speech at the ICDAR 2024 conference, introduced by the previous recipient of the award.

The nomination pack should include the following:

A nominating letter (1 page) including a brief citation to be included in the certificate.
Supporting letters (1 page each) from 3 active researchers from at least 3 different countries.

A nomination is usually put forward by a researcher (preferably from a different institution than the nominee) who is knowledgeable of the scientific achievements of the nominee, and who organizes letters of support.

The submission procedure is strictly confidential, and self-nominations are not allowed.

Please send nominations packs electronically to the TC10/11 chairs: Andreas Fischer andreas.fischer@hefr.ch and Jean-Christophe Burie jean-christophe.burie@univ-lr.fr. The deadline for receiving nominations is December 15, 2023 but early submissions are strongly encouraged.

5) ICDAR 2023: ScalDoc workshop report

The ICDAR 2023 workshop on Scaling-up Document Image Understanding aimed at opening the discussion on possible ways to widen our community through the use of data preparation efforts and the definition of large-scale (grand) challenges that drive progress in the field.
Fruitful discussions led to the creation of a Slack channel to gather academic and industrial researchers willing to collaborate on the creation of a “dataset of datasets”.

Please find full report here: https://iapr-tc10.univ-lr.fr/wp-content/uploads/2023/11/ScalDoc-Textual-Report.pdf

This effort should enable the training and evaluation of new foundation models for document understanding.

A first meeting have been scheduled on Wed. 2023-09-13, at 19:00 UTC+2
Don’t hesitate and join us, we are already more than 20!
Slack invitation: https://bit.ly/datasetofdatasets

6) ICDAR 2024: Journal-first Track Call for Papers and Reviewers

Following a feature of ICDAR2019 through ICDAR2023, ICDAR 2024 will again include the option of a journal track that offers the rapid turnaround and dissemination times of a conference while providing the paper length, scientific rigor, and careful review process of an archival journal.

The submission site in Springer is now open: https://www.springer.com/journal/10032/updates/26200090

Important Dates

Submissions Due: November 14, 2023 (firm deadline)

Note that the deadline has been moved to 14 November 2023 because of the delay in opening this site. Please check the above website for further submission details and important dates.

For more details and guidelines, please visit: https://www.springer.com/journal/10032/updates/26200090

Reviewers

We would like to encourage anyone interested in doing reviews for this
interesting journal track to contact Prof. Elisa Barney elisa.barney@ltu.se.

7) ICDAR 2024 Call for Tutorials

Call for Tutorials for ICDAR 2024

The ICDAR 2024 Organizing Committee invites proposals for tutorials that
will be held on August 30th to September 4th (the correct final date will
be communicated as soon as possible), before the main conference begins.

Important Dates

Proposals Due: Nov. 30, 2023
Acceptance Notification: Dec. 23, 2023
Dates of Tutorials: Aug. 30 – Sep. 4, 2024

ICDAR 2024 Tutorials should serve one or more of the following objectives:

Introduce students and newcomers to major topics of Document Analysis
and Recognition (DAR) research.
Provide instructions on established practices and methodologies.
Introduce expert non-specialists to a DAR subarea. Survey a mature
area of DAR research and/or practice.
Motivate and explain a DAR topic of emerging importance.
Overview DAR systems for industrial solutions (suggestion for
researchers in industry).
Introduce some recent innovative techniques for DAR research and
software quality, such as open-source libraries, high-level API, technical
frameworks for expert developments, etc. (suggestion for expert
programmers).

An ICDAR tutorial should aim to give a comprehensive overview of a specific
topic related to DAR. A good tutorial should be educational rather than
just a cursory survey of techniques. The topic should be of sufficient
relevance and importance to attract significant interest from the ICDAR
community. Typical tutorial audiences consist of PhD students studying
computer vision, image processing or pattern recognition, but also include
researchers and practitioners from both academia and industry. In order to
facilitate innovative collaboration and interaction between researchers in
academia and industry, the Tutorial Chairs strongly encourage proposals for
industrial tutorials, in which researchers in companies describe DAR
systems and overview industrial solutions to document analysis problems in
real use-case industrial scenarios.

Proposals should be up to 4 pages in length, and should contain the
following information:

Title of the tutorial.
Scope and motivation. A brief description of the tutorial, suitable
for inclusion in the conference registration brochure.
Preference for the duration (full day or half day). Due to agenda
constraints, half day tutorials are recommended. If a full day is
needed,provide a brief justification.
A detailed outline of the tutorial. Course description with list of
topics to be covered, along with a brief outline.
Relevance for ICDAR. A description of why the tutorial topic would be
of interest to a substantial part of the ICDAR audience.
Expected target audience in terms of composition and estimated number
of attendees. Prerequisite knowledge of the ICDAR audience for attending
the tutorial.
Short CV of organizers. A brief CV of the presenter(s), including
name, postal address, phone number, e-mail address, web page, background in
the tutorial area (projects, relevant publications or tutorial-level
articles on the subject), evidence of teaching experience.
The name and e-mail address of the corresponding presenter. The
corresponding presenter should be available for e-mail correspondence
during the evaluation process, in the case clarifications and discussions
on the scope and content of the proposal are needed.

Evaluation
The evaluation of the proposal will take into account its general interest
for ICDAR attendees, the quality of the proposal (e.g., a tutorial that
simply lists a set of concepts without any apparent rationale behind them
will not be approved) as well as the expertise and skills of the
presenters. We emphasize that the primary criteria for evaluation will be
whether a proposal is interesting, well-structured, and motivated in
relation to Document Analysis and Recognition, rather than the perceived
experience/standing of the proposer. Last but not least, the tutorial
should attract a meaningful audience, cover hot topics and incorporate new
knowledge to the community. Those submitting a proposal should keep in mind
that tutorials are intended to provide an overview of the field; they
should present reasonably well established information in a balanced way.
Tutorials should not be used to advocate a single avenue of research, nor
should they promote a product.

Notes
Tutorial slides must be provided to us for inclusion on the conference
website and also on the TC-10 and TC-11 websites, as educational material.
The ICDAR main conference organizers will handle the tutorial registration
and provide the space, coffee breaks and other facilities required to
organize tutorials (e.g. a room, a projector and a screen).

Submission Guidelines & Inquiries
All proposals should be submitted by electronic mail to the Tutorial
Chairs: – Alicia Fornes afornes@cvc.uab.es – Vincent Christlein
vincent.christlein@fau.de

Feedback, comments and/or suggestions would be provided within two weeks of
receiving the proposal. Final acceptance (or rejection) would be decided by
December 23, 2023.
Inquiries should be sent to tutorials-chairs@icdar2024.net or the above
emails.

8) SSDA 2023: Report of the last summer school on document analysis

We are happy to announce that the 5th IAPR TC10/TC11 summer school on document analysis was a great success! It took place from July 3 to 7, 2023 in Fribourg and Moléson, Switzerland. 21 PhDs and young researchers from 8 different countries benefited from high-quality lectures by experts in the field:

• Apostolos Antonacopoulos, University of Salford, United Kingdom, Large-scale Recognition of Information-rich Documents: From Unreadable Data to Structured Information
• Jean-Christophe Burie, University of La Rochelle, France, Analysis and understanding of comics: From the detection of basic elements to the creation of semantic links with classic and deep
learning-based approaches.
• Gernot Fink, TU Dortmund University, Germany, Deep Learning for Word Spotting: Foundations and Current Developments.
• Andreas Fischer, University of Fribourg and University of Applied Sciences and Arts Western Switzerland, Structural methods for document analysis and recognition: From rule-based models to data-driven deep learning.
• Alicia Fornes, Universitat Autonoma de Barcelona and CVC, Spain, Handwriting Recognition in Low Resource Scenarios.
• C.V. Jawahar, Towards a deeper understanding of documents, III-T Hyderabad, India.
• Koichi Kise, Osaka Prefecture University, Japan, Reading of Reading for Actuating: Augmenting Human Learning by Experiential Supplements.
• Rich Kent, CTO of Taina, UK, You can’t hide from tax anymore!, A real world example of how one company has used document analysis and recognition to change the tax industry

The video of some lectures will be available soon on the website: https://ssda2023.isc.heia-fr.ch/.

Alongside the courses, participants were able to actively take part during a pitch and a poster session. A handwriting recognition competition was also held. Many social activities, scientific exchanges and campfires were organized, helping to create and/or strengthen links between participants.

Congratulations to Marco Peers, TU Vienna, who won the best poster award and the prize for excellence.

Finally, we would like to thank all the organizers and sponsors who made this event possible. Special thanks to IAPR for funding 4 grants and reducing accommodation costs for students to DIVA, DIUF, Centenary Fund from University of Fribourg and HEIA for their financial and administrative contributions. Thank you to the speakers for your time and knowledge sharing.

See you soon for the 6th edition!

Anna Scius-Bertrand,
SSDA 23 organizing committee.

9) SSDA 2025: Call for proposal for the next summer school on document analysis

Important Dates

April 5, 2024 Proposal Submission Deadline

Submit Proposals via email to:

Foteini Simistira Liwicki (TC11 Representative), foteini.liwicki@ltu.se
Momina Moetesum (TC10 Representative), momina.moetesum@seecs.edu.pk

Previous events:

As a reference, the 2023 Summer School on Document Analysis was held in Fribourg, Switzerland (URL: https://ssda2023.isc.heia-fr.ch/ )

10) DAS 2024: Call for Proposals

Deadline: November 30, 2023

Submission Method: email to jean-christophe.burie@univ-lr.fr and andreas.fischer@unifr.ch

Document Analysis Systems (DAS) is an IAPR sponsored workshop focusing on system-level issues and approaches. In DAS 2022, it was decided by the participants to hold DAS as a satellite workshop with annual ICDAR starting from 2024 onwards. We are seeking proposals to host the 16th Document Analysis Systems workshop co-located with ICDAR in 2024.

Anyone interested in submitting a proposal to host DAS 2024 in Greece should drop an email to jean-christophe.burie@univ-lr.fr and andreas.fischer@unifr.ch by November 30, 2023.

We are looking forward to receiving high quality proposals and making the first chapter of DAS-ICDAR 2024 a big success!

Jean-Christophe Burie (Chair, TC10)
Andreas Fischer (Chair, TC11)

11) ICPR 2024: Call for Papers

Greetings from the ICPR-2024 organizing committee.

The International Conference on Pattern Recognition (ICPR) is the flagship conference of the International Association of Pattern Recognition and the premier conference in Pattern Recognition, covering Computer Vision, Machine learning, Image, Speech, Sensor Pattern Processing etc. ICPR-2024 is the 27th event of the series which will be held at Kolkata, India during December 1-5, 2024. This conference provides a great opportunity to nurture new ideas and collaborations for students, academics, and industry researchers. ICPR-2024 will cover the following six tracks:

– Artificial Intelligence, Pattern Recognition and Machine Learning

– Computer and Robot Vision

– Image, Speech, Signal and Video Processing

– Biometrics and Human Computer Interaction

– Document Analysis and Recognition

-Biomedical Imaging and Bioinformatics

The main conference has several highlights, including keynotes by top experts, invited talks by academic and industry professionals, oral paper presentation, poster paper presentation, etc. ICPR-2024 will also have many workshops and Tutorials.

Prospective authors are invited to submit papers.

Important deadlines:

Paper submission open: January 20, 2024
Paper submission deadline: March 20, 2024
Reviews sent to authors (Acceptance/rejection/Revision): June 20, 2024
Revision/rebuttal submission deadline: July 10, 2024
Final Acceptance notification: August 5, 2024
Camera-ready submission: August 31, 2024

Submission and Review:

ICPR-2024 will follow a single-blind review process. Authors can include their names and affiliations in the manuscript.

Paper Format and Length:

Springer LNCS paper formatting instructions and templates for ICPR-2024 are available in the icpr2024 website (https://icpr2024.org/cfp.html)

The PDF version of the CFP has been attached for your kind reference. Please visit the ICPR-2024 website icpr2024.org for more details.

Contact:

For any enquiry please contact the ICPR-2024 Secretariat via email at icpr2024@gmail.com and icpr2024@isical.ac.in

Please also circulate it among your colleagues.

We look forward to your participation in ICPR-2024.

With regards

Umapada Pal, Indian Statistical Institute, Kolkata, India

Josef Kittler, University of Surrey, UK

Anil Jain, Michigan State University, USA

(ICPR-2024 General Chairs)

Rama Chellappa, Johns Hopkins University, USA

Apostolos Antonacopoulos, University of Salford, UK

Cheng-Lin Liu, Institute of Automation of Chinese Academy of Sciences, China

Subhasis Chaudhuri, Indian Institute of Technology Bombay, India

(ICPR-2024 Program Chairs)

12) ICDAR 2027: Call for Hosting Proposals

Deadline: January 31, 2024

Submission Method: Email to the TC10/11 chairs Andreas Fischer andreas.fischer@hefr.ch and Jean-Christophe Burie jean-christophe.burie@univ-lr.fr

The International Conference on Document Analysis and Recognition (ICDAR) is the flagship event of TC10/11. It has been established as a bi-annual conference in 1991 and is now held annually, starting 2023. The aim of ICDAR is to bring together international experts to share their experiences and to promote research and development in all areas of Document Analysis and Recognition. The ICDAR Advisory Board is seeking proposals to host the 21st International Conference on Document Analysis and Recognition, to be held in 2027 (ICDAR 2027).

Any consortium interested in making a proposal to host an ICDAR should first familiarise themselves with the “Guidelines for Organizing and Bidding to Host ICDAR” document which is available on the TC10 and TC11 websites (https://iapr-tc10.univ-lr.fr and http://www.iapr-tc11.org, respectively).

A link to the most current version of the guidelines appears below. Please check on the website of TC11 for the latest version.

Note that the current guidelines still refer to a bi-annual event. An updated version is coming soon that will reflect the change to an annual conference.

http://www.iapr-tc11.org/mediawiki/images/ICDAR_Guidelines_2016_02_27.pdf

The submission of a bid implies full agreement with the rules and procedures outlined in that document.

The submitted proposal must define clearly the items specified in the guidelines (Section 5.2).

It has been the tradition that the location of ICDAR conferences follows a rotating schedule among different continents. Hence, proposals from the Americas are strongly encouraged. However, high quality bids from other locations, for example, from countries where we have had no ICDAR before, will also be considered. Proposals will be examined by the ICDAR Advisory Board.

Proposals should be emailed to the TC10/11 chairs Andreas Fischer andreas.fischer@hefr.ch and Jean-Christophe Burie jean-christophe.burie@univ-lr.fr by January 31, 2024.

ICDAR Advisory Board,
Andreas Fischer (Chair, TC11), Jean-Christophe Burie (Chair, TC10), Anna Esposito (Chair, IAPR C&M), Koichi Kise, Dimosthenis Karatzas, Srirangaraj Setlur

13) Open Call for Organizing DAR Events

In addition to specific calls for bids to host one of the events, we encourage teams to announce their interest in organizing one of the following events:

ICDAR: International Conference on Document Analysis and Recognition (annually; next possibility in 2027)
DAS: International Workshop on Document Analysis Systems (satellite event of ICDAR in even years; next possibility in 2024)
GREC: International Workshop on Graphics Recognition (satellite event of ICDAR in odd years; next possibility in 2025)
SSDA: Summer School on Document Analysis (biannually in odd years; next possibility in 2025)

Jean-Christophe Burie (Chair, TC10)
Andreas Fischer (Chair, TC11)

14) Special Issue: “Advanced Topics in Document Analysis and Recognition”

Editorial from Koichi Kise, Richard Zanibbi, Rajiv Jain & Gernot A. Fink:

The ongoing advancement of deep learning techniques, such as the Transformer and Large Language Models, continues to enhance both the accuracy and efficiency of methods in the field of Document Analysis and Recognition, while also broadening its scope. The primary objective of this Special Issue is to keep pace with these developments and showcase the latest advancements in document analysis and recognition. Upholding our tradition established for journal track papers from the ICDAR conference with issues released in 2019 and 2021, we present the third edition of the Special Issue, under the same title.

Despite the persistent effects of the COVID-19 pandemic, we initiated a call for papers, disseminating this widely through the web pages of IJDAR and ICDAR. By November 2022, we received 33 submissions that were deemed relevant to the scope of our work. Each submission was assigned to one of ourselves as guest editor, with careful attention paid to avoiding any conflicts of interest. We procured reviews from experts in the field, adhering to the journal’s standard practices. After a rigorous review process, often involving two or three rounds, we accepted 13 papers for publication in this special issue. These papers reflect both the breadth and depth of current research in the field of Document Analysis and Recognition.

The accepted papers can be grouped into several categories: document image processing and classification (two papers), historical document analysis (four papers), character recognition (one paper), online handwriting analysis (two papers), layout analysis (two papers), and applications (two papers). We will briefly summarize the accepted papers in… See more

15) IJDAR article alert (vol. 26, issue 4)

Volume 26, issue 4, December 2023. Please find below the 5 articles:

Trajectory-based recognition of in-air handwritten Assamese words using a hybrid classifier network
Ananya Choudhury & Kandarpa Kumar Sarma

HWNet v3: a joint embedding framework for recognition and retrieval of handwritten text
Praveen Krishnan, Kartik Dutta & C. V. Jawahar

An end-to-end pipeline for historical censuses processing – open access
Rémi Petitpierre, Marion Kramer & Lucas Rappo

A brief review of state-of-the-art object detectors on benchmark document images datasets
Trong Thuan Nguyen, Hai Le, Truong Nguyen, Nguyen D. Vo & Khang Nguyen

Review of chart image detection and classification
Filip Bajić & Josip Job

16) Job offers – 1 new

Post-doctoral position at the Computer Vision Center, Barcelona

We are seeking two postdoctoral researchers to join the Vision, Language and Reading group at the Computer Vision Center (CVC), in Barcelona, Spain, one focused on FEDERATED LEARNING AND DIFFERENTIAL PRIVACY and another in COMPUTER VISION.

The position is available for a minimum of 2 years and is linked to the “European Lighthouse on Secure and Safe AI” (ELSA), a European Project funded by Horizon Europe and backed by the ELLIS network of excellence. The project covers research topics that include robustness, privacy and human agency and will develop use cases in areas such as autonomous driving, robotics, health and document intelligence.

CANDIDATE’S PROFILE

The candidate should possess a PhD in machine learning or computer vision and have a strong publication record. We are looking for candidates who have publications in top conferences like CVPR, ECCV, ICCV, ICDAR, NeurIPS, ICML, ICLR.

The candidate should have a strong background in machine learning and computer vision. Experience on document image analysis and/or visual question answering would be positive. The applicants are expected to be fluent in both oral and written communication in English. They should work well in a team while demonstrating initiative and independence. The candidate is expected to co-supervise PhD students.

The successful candidate is expected to contribute to the design and development of AI solutions for document understanding, employing privacy preserving techniques and infrastructures set up by the ELSA project.

THE COMPUTER VISION CENTER

The selected candidate will work in the Computer Vision Centre (CVC), Barcelona, a research institute comprising more than 130 researchers and support staff, dedicated to computer vision research and knowledge transfer. With a strong international projection and links to the industry, the Computer Vision Centre offers an exciting environment for scientific career development. The Computer Vision Centre has a plan for expansion of its permanent research staff base and has received the “HR Excellence in Research” award as a provider and supporter of a stimulating and favourable working environment.

The direct responsible for these posts will Dr Dimosthenis Karatzas, leading the Vision, Language and Reading research group at the CVC.

Barcelona is a vibrant city and an important Artificial Intelligence hub. The high quality of life is combined with an open and international looking character of the city. Barcelona is very well connected by air, sea and ground transportation. The region of Catalonia boosts its own AI strategy, in which the CVC is a key player.

RESEARCH CONTACT

If you are interested in the position, please contact Dr Dimosthenis Karatzas for more information and applications (dimos@cvc.uab.es)

APPLICATION PROCESS

Apply by filling in the online form at:
Computer Vision: http://www.cvc.uab.es/blog/2023/01/11/postdoc-position-in-computer-vision/
FL and DP: http://www.cvc.uab.es/blog/2023/01/11/postdoc-position-in-computer-vision/

MORE INFO

ELSA project: https://elsa-ai.eu/
Computer Vision Center: http://www.cvc.uab.es/
Vision, Language and Reading group: https://www.vlr.ai/

Ph.D Student : Multimodal models for Document Image Understanding, LITIS, France

Link: https://www.litislab.fr/en/job/phd-student-multimodal-models-document-image-understanding

Keywords

Deep Learning, Vision, OCR, Document Understanding, Natural language processing

Context and main objectives

The digital transformation of libraries, which has been based on OCR (Optical Character Recognition) technology for more than 20 years, faces some limitations both in terms of quality, due to the diversity of the collections and the limitations of OCR technology, and in terms of added value due to a lack of structuring and high-level indexing. Named entity extraction is still little used because it relies on language processing technologies, which were not very adaptable until recently. More generally, the semantic indexing of collections is underdeveloped and integrated with metadata. We propose to develop multimodal models (text + image) for the extraction of information from collections of digitized documents in large libraries. The literature shows that work in this direction is still underdeveloped, and that it is mainly aimed at processing commercial documents (invoices etc…).

The proposed project aims to disrupt the traditional sequential document processing workflow by combining Vision models and Large Language Models (LLM) to provide a more streamlined and efficient approach. The standard two-stages architectures based on OCR + NER (Optical Character Recognition, Named Entity Recognition) are now giving way to end-to-end multimodal approaches known as Document Understanding, which are more versatile and easily adaptable to new corpora, making it easier and more cost-effective to set up and run document processing projects. As a result, this accessible, user-friendly approach will democratize access to advanced AI technologies for a wider range of institutions, contributing to the evolution of the technology value chain in the Libraries, Archives and Museums (LAM) sector and opening up new opportunities for research and discovery.

The proposed work program funded by the FINLAM project (Foundation INtegrated models for Libraries Archives and Museum, ANR 2023), relies on the expertise of LITIS to study the most relevant multimodal architectures to integrate the language knowledge conveyed by the large language models developed recently and to study the modalities of specialization/adaptation of these models in conjunction with the learning of a generic optical encoder, benefiting from the annotated collections available at the French national library (Bibliothèque nationale de France – BnF). User interaction will be considered according to different scenarii of closed and open queries.

Application

Thierry Paquet, Thierry.Paquet@univ-rouen.fr

Pierrick Tranouez, Pierrick.Tranouez@univ-rouen.fr

Clément Chatelain, Clement.Chatelain@insa-rouen.fr

[IAPR-TC10] Newsletter 155 – May 2023

Posted on 24/05/202331/05/2023 by Christophe Rigaud

Welcome to the May edition of the TC10 newsletter.

In this issue, you will find several announcements related to ICDAR 2023 (registration is open): the call for doctoral consortium, the call for nominations for outstanding achievements award and the call for hosting ICDAR 2026. GREC 2023 post conference workshop will be held on August 24^th in San José, California, USA. Registration to the next summer school on document analysis (SSDA) is also open and will take place in Fribourg, Switzerland this coming July. Finally, please find the summary of the last IJDAR issue and a new job offer in Barcelona.

I wish you a pleasant reading,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) ICDAR 2023: Call for Doctoral Consortium
3) ICDAR 2023: Call for Nominations for the IAPR/ICDAR Outstanding Achievements Award
4) ICDAR 2026: Call for Hosting Proposals
5) GREC 2023 – repost
6) 5^th TC10/TC11 Summer School on Document Analysis (SSDA 2023)
7) 1^st Workshop on Automatically Domain-Adapted and Personalized Document Analysis (ADAPDA)
8) 27^th International Conference on Theory and Practice of Digital Libraries
9) IJDAR paper alert
10) Job offers – 1 new

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

1) Upcoming deadlines and events

2023

Deadlines:
- June 5, full paper submission deadline TPDL 2023
- June 26, doctoral consortium application deadline ICDAR 2023
- June 30, hosting proposal due ICDAR2026
Events:
- July 3-7, 2023, summer school SSDA 2023, Fribourg, Switzerland
- August 21-23, conference ICDAR 2023, San José, California, USA
- August 24, doctoral consortium of ICDAR 2023, San José, California, USA
- August 24, workshop GREC 2023, San José, California, USA

2024 and later

Events:
- September 2024, conference ICDAR 2024, Athens, Greece

2) ICDAR 2023: Call Doctoral Consortium

In 2011, the first Doctoral Consortium in the Document Analysis community was organized in conjunction with the International Conference on Document Analysis and Recognition (ICDAR). This has led to successful successor events at ICDAR 2013, ICDAR 2015, ICDAR 2017, ICDAR 2019, and ICDAR 2021. The tradition of having a Doctoral Consortium as a satelite event to the ICDAR main conference will further be continued at ICDAR 2023 in San José, California, USA.

The goal of the ICDAR 2023 Doctoral Consortium is to create an opportunity for PhD students to test their research ideas, present their current progress and future plans, and receive constructive criticism and insights related to their future work and career perspectives. A mentor (a senior researcher who is active in the field) will be assigned to each student to provide individual feedback. In addition, students will have the opportunity to present an overview of their research plan during a special poster session.

Participation in the ICDAR 2023 Doctoral Consortium will be limited to 25 students. Prospective participants are encouraged to submit their application by June 26, 2023. The Doctoral Consortium Chairs will then review all applications received. Preference will be given to students who are at a stage in their studies most likely to benefit (i.e., they have identified a research direction and published some initial results, but the thesis is not yet set in stone).

Participation to the Doctoral Consortium will be free for all accepted students. There will be no extra registration fees.

The ICDAR 2023 Doctoral Consortium will take place the day after the main conference, i.e., on Thursday, August 24.

SUBMISSION PROCEDURE

Students willing to participate should submit an application in a SINGLE PDF file. The submission should be prepared using the instructions and style files for Springer computer science proceedings. The LaTeX template for LNCS can be downloaded here. It is also available on Overleaf.

The PDF file should contain the following information:

Student’s name
University
Title of your thesis
Supervisor of the thesis
Starting and expected finalization date of the PhD
Short research plan (1-2 pages about the work)
Short CV (1-2 pages)

The research plan should contain an overview of the PhD topic, the steps made so far (including a list of publications), and the actions planned before finishing the PhD, especially novel research ideas to be pursued.

Submission should be done before June 26, 2023 to: https://easychair.org/conferences/?conf=dcicdar2023

IMPORTANT DATES

Submission deadline: June 26, 2023
Acceptance notification and Mentor assignment: July 10, 2023
Mentoring period: July 10 to August 24, 2023
Doctoral consortium: August 24, 2023

We are looking forward to your participation!

Andreas Fischer and Veronica Romero, Doctoral Consortium Chairs, dc-chairs@icdar2023.org

3) ICDAR 2023: Call for Nominations for the IAPR/ICDAR Outstanding Achievements Award

Important Dates

Nominations Due:  April 30, 2023

Research
Training of students
Research/Industry interaction
Service to the community

For ICDAR 2023, nominations are invited for the following award category:

IAPR/ICDAR Outstanding Achievements Award

The award will consist of a token gift and a suitably inscribed certificate. The recipient will be invited to give the opening keynote speech at the ICDAR 2023 conference, introduced by the previous recipient of the award.

The nomination pack should include the following:

A nominating letter (1 page) including a brief citation to be included in the certificate.
Supporting letters (1 page each) from 3 active researchers from at least 3 different countries.

The submission procedure is strictly confidential, and self-nominations are not allowed.

Please send nominations packs electronically to the TC10/11 chairs: Andreas Fischer andreas.fischer@hefr.ch and Jean-Christophe Burie jean-christophe.burie@univ-lr.fr. The deadline for receiving nominations is April 30, 2023 but early submissions are strongly encouraged.

4) Call for Hosting Proposals: ICDAR 2026

Deadline: June 30, 2023

Submission Method: Email to the TC10/11 chairs Andreas Fischer andreas.fischer@hefr.ch and Jean-Christophe Burie jean-christophe.burie@univ-lr.fr

The International Conference on Document Analysis and Recognition (ICDAR) is the flagship event of TC10/11. It has been established as a bi-annual conference in 1991 and is now held annually, starting 2023. The aim of ICDAR is to bring together international experts to share their experiences and to promote research and development in all areas of Document Analysis and Recognition. The ICDAR Advisory Board is seeking proposals to host the 20th International Conference on Document Analysis and Recognition, to be held in 2026 (ICDAR 2026).

A link to the most current version of the guidelines appears below. Please check on the website of TC11 for the latest version.

Note that the current guidelines still refer to a bi-annual event. An updated version is coming soon that will reflect the change to an annual conference.

http://www.iapr-tc11.org/mediawiki/images/ICDAR_Guidelines_2016_02_27.pdf

The submission of a bid implies full agreement with the rules and procedures outlined in that document.

The submitted proposal must define clearly the items specified in the guidelines (Section 5.2).

Proposals should be emailed to the TC10/11 chairs Andreas Fischer andreas.fischer@hefr.ch and Jean-Christophe Burie jean-christophe.burie@univ-lr.fr by June 30, 2023.

ICDAR Advisory Board,
Andreas Fischer (Chair, TC11), Jean-Christophe Burie (Chair, TC10), Anna Esposito (Chair, IAPR C&M), C.V. Jawahar, Dimosthenis Karatzas, Koichi Kise

5) GREC 2023 – repost

15th International Workshop on Graphics Recognition (GREC)

August 24, 2023
San Jose, USA - post conference workshop of ICDAR2023
http://grec2023.univ-lr.fr/

GREC workshops provide an excellent opportunity for researchers and practitioners at all levels of experience to meet colleagues and to share new ideas and knowledge about graphics recognition methods. Graphics Recognition is a sub-field of document image analysis that deals with graphical entities in engineering drawings, comics, musical scores, sketches, maps, architectural plans, mathematical notation, tables, diagrams, etc.

The aim of this workshop is to maintain a very high level of interaction and creative discussions between participants, maintaining a “workshop” spirit, and not being tempted by a “mini-conference” model.

GREC 2023 will continue the tradition of past workshops held at the Penn State University (USA), Nancy (France), Jaipur (India), Kingston (Canada), Barcelona (Spain), Hong Kong (China), Curitiba (Brazil), La Rochelle (France), Seoul (Corea), Lehigh (USA), Nancy (France), Kyoto (Japan), Sydney (Australia) and Lausanne (Switzerland).

The workshop will comprise several sessions dedicated to specific topics related to graphics in document analysis and graphic recognition. For each session, there will be an invited presentation describing the state of the art and stating the open questions for the session’s topic, followed by a number of short presentations that will contribute by proposing solutions to some of the questions or presenting results of the speaker’s work. Each session will be concluded by a panel discussion.

Important dates

Abstract submission: April 17th, 2023
Full paper submission: April 24th, 2023 – 11:59PM Pacific Time Zone

Acceptance notification: May 24th, 2023
Camera ready due: May 31th, 2023

Workshop: August 24th, 2023

More information at: http://grec2023.univ-lr.fr/
Registration is open: https://icdar2023.org/registration/

General Chair: Jean-Christophe Burie
Program Co-Chair: Nathalie Girard, Jorge Calvo-Zaragoza, Samit Biswas

6) 5th TC10/TC11 Summer School on Document Analysis (SSDA 2023)

Website: https://ssda2023.isc.heia-fr.ch
Dates: July 3-7, 2023
Location: Fribourg and Moléson-sur-Gruyères, Switzerland

Important Dates

May  4, 2023  Registration open
May 22 June 8, 2023  Application for financial assistance
Jun  5, 2023  End for Early bird registration
Jun 23, 2023  Registration closed

We are pleased to announce that the 5th TC10/TC11 Summer School on Document Analysis and Recognition (SSDA), endorsed by TC-10 (Graphics Recognition) and TC-11 (Reading Systems), will be held in Switzerland from July 3 to 7, 2023.

SSDA 2023 continues the tradition of the past summer schools held in Jaipur (India) in 2017, La Rochelle (France) in 2018, Islamabad (Pakistan) in 2019, and Luleå (Sweden) in 2021.

A link for online registration will be available here on May 4, 2023. As it will be a in person event, the number of places is limited. First come, first served. We would like to welcome you all, but in case all the places are taken, a waiting list will be set up. Several grants are available for students, specifically for students coming from countries with low financial status. More information here. The deadline for submitting an application for financial support is May 22, 2023.

The scientific theme of the SSDA 2023 is Statistical and Structural Methods for Document Analysis. The theme highlights the importance of both statistical and structural pattern recognition approaches. The latter plays a central role for understanding not only the individual elements of a document but also their relations. With the advent of Transformers for sequence analysis and geometric deep learning for graph-based analysis, new perspectives are opening up in the field of structural deep learning. Topics of interest include document image processing, indexing and retrieval of documents, handwriting recognition, historical document analysis, document analysis for literature search, document summarization and translation, and document understanding, among others.

The SSDA is also a great occasion to get in touch with experienced researchers from the field of Document Analysis and Recognition (DAR). The following invited speakers have already agreed to hold a lecture at the summer school (list is still subject to change):

Apostolos Antonacopoulos, University of Salford, United Kingdom
Jean-Christophe Burie, University of La Rochelle, France
Véronique Eglin, INSA, France
Gernot Fink, TU Dortmund University, Germany
Andreas Fischer, University of Fribourg and University of Applied Sciences and Arts Western Switzerland
Alicia Fornès, Universitat Autònoma de Barcelona, Spain
C.V. Jawahar, International Institute of Information Technology Hyderabad, India

Besides traditional lectures, a focus will be put on practical sessions, with a programming competition in document analysis. The summer school is also an opportunity to meet young and not so young researchers in the community. Therefore, participants are asked to prepare a short pitch and a poster about their research.

Two awards will be presented on the last day of the summer school: the Best Poster Award and the Excellence Award, both with a financial reward. The winner of the Excellence Award will be selected among the participants for their knowledge, collaboration ability, and their performance in the competition.

In addition, to create and reinforce interactions between participants, several social activities will be organized depending on the weather, such as a visit of an alpine cheese factory, a visit of the medieval town of Gruyère and its castle, and a panoramic walk up to 2000 meters of altitude to the summit of the Moléson mountain.

As Switzerland’s cost of living is expensive, in order to reduce the fees for students and to provide a stimulating work environment, the SSDA will be organized as follows: on the first and the last day (Monday and Friday), it will take place on the university campus in Fribourg. During the week (Monday evening until Friday morning), the summer school will be held in a chalet in the mountains near Fribourg. The registration fees are all-inclusives: scientific program, social activities, accommodation, all meals and coffee breaks.

Registration: https://ssda2023.isc.heia-fr.ch/registration/

Contact: ssda2023@hefr.ch

7) 1st Workshop on Automatically Domain-Adapted and Personalized Document Analysis (ADAPDA)

August 25, 2023
San Jose, USA - post conference workshop of ICDAR2023
Website: https://sites.google.com/view/adapdaicdar23

SCOPE

Document Analysis (DA) technologies are becoming increasingly pervasive in our daily life due to the digitalization of documents (both in the cultural and industrial domains) and the widespread use of paper tablets, pads, and smartphones to take notes and sign documents. In this respect, high-performing DA algorithms are needed that are able to deal with digitalized documents from different writers, in different languages (including ancient languages, modern slang terms, or writer-preferred abbreviations and symbols), and with different visual characteristics (due to the paper support and the writing tool), often very peculiar to the application domain. In this respect, domain adaptation and automatic personalization strategies are worth investigating to boost the performance of DA techniques in the scenarios mentioned above, which are of great cultural, practical, and economic interest. Nonetheless, in some application contexts, writer-specific data may contain sensitive information (either personal or business-related). In the sight of this, specific privacy-preserving solutions and lightweight adaptation strategies that can be performed onboard on personal devices must be considered when designing personalized DA techniques. This workshop aims at gathering expertise and novel ideas for personalized DA tasks such as, for example, Handwritten Text Recognition, Handwritten Text Generation, Writer Identification, Writer and Signature Verification, and Handwriting Analysis. In particular, it welcomes contributions on training and adaptation strategies of writer, language, and visual-specific models, new benchmarks, and data collection strategies to explore the mentioned tasks in a personalized setting, as well as related works on the personalized DA topic.

TOPICS

The workshop calls for submissions addressing, but not limited to, the following topics:

○ Adaptation for on-line Handwritten Text Recognition
○ Adaptation for off-line Handwritten Text Recognition
○ Adaptation for Handwritten Text Generation
○ Adaptation for Document Analysis System
○ Domain-specific text and symbol recognition
○ Adaptation for Human document interaction
○ Adaptation for Mobile text recognition
○ Domain-specific document analysis
○ Privacy-preserving domain adaptation of handwritten data
○ Writer, Language, and Visual adaptation of Document Analysis models
○ Novel benchmarks and datasets for personalized Document Analysis

IMPORTANT DATES

○ 20/05/2023 AoE: Paper submission deadline
○ 28/05/2023: Paper decisions to authors
○ 04/06/2023 AoE: Camera-ready deadline

SUBMISSION INSTRUCTIONS

Submissions should follow the Springer Lecture Notes in Computer Science (LNCS) format and can be either:

○ Short papers in the range 2-6 pages (references excluded)
○ Long papers in the range 6-12 pages (references excluded)

presenting on-going projects, datasets, final or preliminary results as well as innovative methodologies and tools.

You can submit your manuscript here: https://cmt3.research.microsoft.com/ADAPDA2023

8) 27th International Conference on Theory and Practice of Digital Libraries

Dates: 26-29 September 2023
Full paper deadline: 5 June 2023 - extended
Short paper deadline: 12 June 2023
Innovation paper: 12 June 2023
Website: https://tpdl2023.dei.unipd.it/index.html

The International Conference on Theory and Practice of Digital Libraries (TPDL) is a yearly date for researchers on Digital Libraries and related topics. For over twenty-five years TPDL has been an international reference forum focused on digital libraries and associated technical, practical, and social issues. TPDL encompasses the many meanings of the term “digital libraries”, including new forms of information institutions; operational information systems with all manner of digital content; new means of selecting, collecting, organizing, and distributing digital content; and theoretical models of information media, including document genres and electronic publishing. Digital libraries may be viewed as a new form of information institution or as an extension of the services libraries currently provide.

Representatives from academia, government, industry, research communities, research infrastructures, and others are invited to participate in this annual conference. The conference draws from a broad and multidisciplinary array of research areas including computer science, information science, librarianship, archival science and practice, museum studies and practice, technology, social sciences, cultural heritage and humanities, and scientific communities.

This year its focus is on bridging the wide field of Research and Information Science with the related field of Digital Libraries. Indeed, TPDL historically approached on “Digital libraries” embracing the field at large also comprehending three key areas of interest that can be synthesized as scholarly communication (e.g. research data, research software, digital experiments, digital libraries), e-science/computationally-intense research (e.g. scientific workflows, Virtual Research Environments, reproducibility) and library, archive and information science (e.g. governance, policies, open access, open science).

Contribution Types

Research Papers (up to 12 pages + unlimited references) present high-quality, original research of relevance to the TPDL community. Submissions should detail their methods and techniques sufficiently to enable replication and reuse. Accepted papers will be published in the conference proceedings and presented as long conference talks.

Practitioner Papers (up to 12 pages + unlimited references) present high-quality applied work of relevance to the TPDL community. Submissions should focus on results of direct relevance to practitioners and institutions in the TPDL community. Methods, tools, and techniques should be detailed sufficiently to enable application by other institutions. Accepted papers will be published in the conference proceedings and presented as long conference talks.

Submission Guidelines

Submissions should detail their methods and techniques sufficiently to enable replication and reuse. Accepted papers will be published in the conference proceedings and presented as long conference talks. Papers must be in the Springer LNCS style. The review process is single-blind, so you can include the author names in the manuscript you submit.

Topics of Interest

Submissions are welcome concerning theory, architectures, data models, tools, services, infrastructures about the following topics (but not limited to):

Publishing science

FAIR data and software
Research objects
Nanopublications
Digital Preservation and Curation
Supporting Science Reproducibility
Metadata
Research Data Management
Research Output Management

Data

Data Search
Research Data Discovery
Data citation
Data and Information Lifecycle (creation, store, share, and reuse)
Data and Document Provenance
Linked Data and Open Data
Data Repositories and Archives
Data and Research Infrastructure
Data Stewardship
Data cleaning
Data Integration
Data curation

Discovering science

Information Retrieval
Recommendation systems
Document (Text) Analysis in support of discovery
Multimodal and Multilingual Data Access

Monitoring and assessment of science

Science of Science
Scientometrics and bibliometrics
Scholarly Communication Knowledge Graphs

Knowledge creation

AI / Machine Learning/ Data mining for DLs
Knowledge Bases
Entity Extraction and Linking
Ontology development

Digital Humanities

Digital Cultural Heritage
Digital Terminology
Computational Linguistics
Digital History
Digital Archeology
Knowledge Organization for Digital Humanities
Digital Research Methods on Cultural Heritage
Digital interfaces for Digital Humanities Research and Practice

Human-Computer Interaction

User Interface and Experience in Cultural Heritage Institutions
Information Interaction for Cultural Heritage Applications
User Participation and Experience
Information Visualization and Visual Analytics

More info: https://tpdl2023.dei.unipd.it/index.html

9) IJDAR article alert (vol. 26, issue 2)

Volume 26, issue 2, June 2023. Please find below the titles of its 5 articles:

WriterINet: a multi-path deep CNN for offline text-independent writer identification
Chahi, A., El merabet, Y., Ruichek, Y. et al.

Retrieval-based language model adaptation for handwritten Chinese text recognition
Hu, S., Wang, Q., Huang, K. et al.

Tables to LaTeX: structure and content extraction from scientific tables
Kayal, P., Anand, M., Desai, H. et al.

Refocus attention span networks for handwriting line recognition
Hamdan, M., Chaudhary, H., Bali, A. et al.

Adaptive dewarping of severely warped camera-captured document images based on document map generation
Nachappa, C.H., Rani, N.S., Pati, P.B. et al.

10) Job offers – 1 new

Post-doctoral position at the Computer Vision Center, Barcelona

We are seeking one postdoctoral researcher to join the Vision, Language and Reading group at the Computer Vision Center (CVC), in Barcelona, Spain, focused on FEDERATED LEARNING AND DIFFERENTIAL PRIVACY.

CANDIDATE’S PROFILE

THE COMPUTER VISION CENTER

The direct responsible for these posts will Dr Dimosthenis Karatzas, leading the Vision, Language and Reading research group at the CVC.

RESEARCH CONTACT

If you are interested in the position, please contact Dr Dimosthenis Karatzas for more information and applications (dimos@cvc.uab.es)

APPLICATION PROCESS

MORE INFO

ELSA project: https://elsa-ai.eu/
Computer Vision Center: http://www.cvc.uab.es/
Vision, Language and Reading group: https://www.vlr.ai/

[IAPR-TC10] Newsletter 154 – March 2023

Posted on 03/03/202306/03/2023 by Christophe Rigaud

Welcome to the March edition of the TC10 newsletter.

In this issue, you will find the call for papers of GREC 2023 workshop that will be held in conjunction of ICDAR 2023 in San José and the list of other 8 workshops and 19 competitions of ICDAR 2023. The extended call for proposal of the document analysis summer school and its next edition which is already looking for a host as well. Finally, please find the summary of the last IJDAR issue and a new job offers in Barcelona.

I wish you a pleasant reading,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) Call for papers of GREC 2023
3) Competitions of ICDAR 2023
4) Workshops of ICDAR 2023
5) Summer school call for proposal SSDA 2023 and 2024 – extended
6) IJDAR paper alert
7) Job offers – 1 new

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

1) Upcoming deadlines and events

2023

Deadlines:
- March 31, hosting proposal due SSDA2023 – extended
- April 17/24, abstract/full paper submission deadline GREC 2023
Events:
- August 21-23, 2023, conference ICDAR 2023, San José, California, USA
- August 24-26, 2023, workshops, tutorials, and doctoral consortium of ICDAR 2023
- August 25, workshop GREC 2023, San José, California, USA

2024 and later

Events:
- September 2024, conference ICDAR 2024, Athens, Greece

2) Call for papers of GREC 2023

15th International Workshop on Graphics Recognition (GREC)

August 25, 2023
San Jose, USA
http://grec2023.univ-lr.fr/

We encourage the authors to submit papers on the topics detailed below.

Topics

– Analysis and interpretation of graphical documents, such as: Engineering drawings, floor plans, mathematical expressions, comics, maps, music scores, patents, diagrams, charts, tables, etc.
– Recognition of graphic elements, such as symbols, logos, stamps, drop caps, drawings, etc.
– Identification and localization of graphical mark-ups and annotations in written documents.
– Raster-to-vector techniques.
– Graphics-based information retrieval.
– Graphics recognition in Comics.
– Historical graphics recognition and indexing.
– Forensics (Writer identification/verification) in graphic documents.
– Description of complete systems for interpretation of graphic documents.
– Datasets and performance evaluation in graphics recognition.
– Authoring, editing, storing and presentation systems for graphics multimedia documents.
– 3D models from multiple 2D views (line drawings).
– Digital ink processing.
– Sketch recognition and understanding.
– Camera-based graphics recognition.
– Graphics recognition in born digital documents.
– Analysis of graphics on new digital interfaces.
– Graphics detection and recognition in real scenes.
– Graphics analysis in medical images.

Guidelines for authors

For this edition, authors are invited to submit two types of paper :
– Full papers describing complete works of research (12-15 pages). They will undergo a rigorous review process with a minimum of 2 reviews considering the originality of work.
– Short papers providing an opportunity to report on research in progress and to present novel positions on graphic recognition (up to 6 pages).

Accepted papers (full and short papers) will be published in a Springer LNCS volume dedicated to all ICDAR workshops.

The submitted papers will respect the same policy and conditions of ICDAR 2023 conference papers. Papers should be formatted according to the instructions and style files provided by Springer. The LaTeX template for LNCS can be downloaded on the GREC Website (see Guidelines for authors at: https://grec2023.univ-lr.fr/index.php/guidelines-for-authors/).

Important dates

Abstract submission: April 17th, 2023
Full paper submission: April 24th, 2023 – 11:59PM Pacific Time Zone

Acceptance notification: May 24th, 2023
Camera ready due: May 31th, 2023

Workshop: August 25th, 2023

More information at: http://grec2023.univ-lr.fr/

General Chair: Jean-Christophe Burie
Program Co-Chair: Nathalie Girard, Jorge Calvo-Zaragoza, Samit Biswas

3) Competitions of ICDAR 2023

Nineteen competitions are now up and running! Please find the list on ICDAR 2023 Competitions webpage and below to visit their websites, discover proposed tasks, datasets and participation deadlines.

Born Digital Video Text QA (BDVT-QA)

Detecting Tampered Text in Images (DTT)

Detection & Classification of Writing Activity in Air

Detection and Recognition of Greek Letters on Papyri
This competition investigates the performance of glyph detection and recognition on a very challenging type of historical document: Greek papyri. The detection and recognition of Greek letters on papyri is a preliminary step for computational analysis of handwriting that can lead to major steps forward in our understanding of this major source of information on Antiquity. It can be done manually by trained papyrologists. It is however a time-consuming task that would need automatising. We provide two different tasks: localization and classification or classification only.The document images are provided by several institutions and are representative of the diversity of book hands on papyri (a millennium time span, various script styles, provenance, states of preservation, means of digitization and resolution).

Document Information Localization and Extraction (DocILE)

Document UnderstanDing of Everything (DUDE) 😎
The ICDAR 2023 competition on Document Understanding of Everything (DUDE) proposes a new dataset for benchmarking Document Understanding systems under real-world settings that have been previously overlooked. In contrast to previous datasets, we extensively source multi-domain, multi-purpose, and multi-page documents of various types, origins, and dates. Importantly, we bridge the yet unaddressed gap between Document Layout Analysis and Question Answering paradigms by introducing complex layout-navigating questions and unique problems that often demand advanced information processing or multi-step reasoning. Finally, the multi-phased evaluation protocol also assesses the few-shot capabilities of models by testing their generalization power to previously unseen questions and domains, a condition essential to business use cases prevailing in the field. Submission deadline: 15 March 2023 AoE.

Harvesting Answers and Raw Tables from Infographics (CHART-Infographics)

Hierarchical Text Detection and Recognition (HierText)

Indic Handwriting Text Recognition (IHTR)

Language-Guided Document Editing (DocEdit)

Reading the Seal Title (ReST)

Recognition of Handwritten Mathematical Expressions (CROHME)

Recognition of Multi-line Handwritten Mathematical Expressions

RoadText Video Text Detection, Tracking and Recognition (RoadText)

Robust Layout Segmentation in Corporate Documents (DocLayNet)

Structured Text Extraction from Visually-Rich Document Images (SVRD)

Text-based Video Question Answering on News Videos (NewsVideoQA)

Video Text Reading Competition for Dense and Small Text (DSText)

Visual Question Answering on Business Document Images (VQAonBD)

4) Workshops of ICDAR 2023

Nine workshops will be held in conjunction with ICDAR2023. Here is a list of their websites for more information and important dates.

Automatically Domain-Adapted and Personalized Document Analysis (ADAPDA)

This workshop aims at gathering expertise and novel ideas for personalized Document Analysis tasks (training and adaptation strategies of writer, language, and visual-specific models, new benchmarks, and data collection strategies), both on-line and off-line, with attention to privacy-preserving solutions.

Camera-Based Document Analysis and Recognition – 10th edition (CBDAR 2023)

The ICDAR 2023 Workshop on Camera-Based Document Analysis and Recognition (CBDAR 2023) will be the successor of the previous nine CBDAR workshops. The CBDAR series has a special focus on the analysis of camera captured documents and text. CBDAR is a forum for presenting up-to-date research, sharing experiences, and fomenting discussions on future directions in camera based document analysis.

Computational Document Forensics – 4th edition (IWCDF)

The Fourth International Workshop on Computational Document Forensics (IWCDF 2023) aims at presenting the most recent theoretical and practical advances related to digital document forgery while fostering discussions between academy and industry.

Computational Paleography – 2nd edition (IWCP)

The goal of this workshop is to bridge the gap between the different research fields analyzing handwritten scripts in ancient artifacts. It is primarily targeted at computer scientists, natural scientists, and humanists involved in the study of ancient writing systems and their materials, but it is not limited to these groups. By promoting discussion among these three communities, the workshop aims to encourage future interdisciplinary collaborations that will address current research questions about ancient manuscripts.

Graphics Recognition – 15th edition (GREC 2023)

GREC 2023 will provide an excellent opportunity for researchers and practitioners at all levels of experience to meet colleagues and to share new ideas and knowledge about graphics recognition methods. Graphics Recognition is a sub-field of document image analysis that deals with graphical entities in engineering drawings, comics, musical scores, sketches, maps, architectural plans, mathematical notation, tables, diagrams, etc.

Historical Document Imaging and Processing – 7th edition (HIP’23)

The 7th International Workshop on Historical Document Imaging and Processing (HIP’23) will bring together researchers from various fields working on document image acquisition, restoration, analysis, indexing, and retrieval to make these documents accessible in digital libraries. It is the seventh satellite workshop of ICDAR dedicated to this topic, following HIP’11 in Beijing, HIP’13 in Washington, HIP’15 in Nancy, HIP’17 in Kyoto and HIP’19 in Sydney and HIP’21 in Lausanne (hybrid) that were a significant success with strong participation. HIP aims to provide the researchers with a forum that is complementary and synergetic to the main sessions at ICDAR on document analysis and recognition.

Machine Learning – 4th edition (WML)

Since 2010, the year of initiation of the annual ImageNet Competition where research teams submit programs that classify and detect objects, machine learning has gained significant popularity. In the present age, Machine learning, in particular deep learning, is incredibly powerful to make predictions based on large amounts of available data. There are many applications of machine learning in Computer vision, pattern recognition including Document analysis, Medical image analysis etc. In order to facilitate innovative collaboration and engagement between document analysis community and other research communities like computer vision and images analysis etc. here we plan to organize this workshop of Machine learning.

Machine Vision and NLP for Document Analysis – 1st edition (VINALDO)

The first edition of the machine VIsion and NAtural Language processing for DOcument analysis (VINALDO) workshop comes as an extension of the GLESDO workshop, where we encourage the description of novel problems or applications for document analysis in the area of information retrieval that has emerged in recent years. We also encourage works that include NLP tools for extracted text, such as language models and Transforms. Finally, we also encourage works that develop new scanned document datasets for novel applications.

Scaling-up Document Image Understanding (ScalDOC)

Document Analysis has long suffered from both a fragmented landscape of task-specific datasets and a strong focus on narrow-focused information extraction and document conversion tasks. The Scaling-up document Image understanding workshop aims to open the discussion on possible ways for the community to align data preparation efforts and define large-scale (grand) challenges that drive progress in the field.. This is meant to be one of a series of such events to be organized on our scientific forums in the near future. The aspiration is to set the seed for an initiative to create our own community’s document-oriented “ImageNet”, over which multiple long-term grand challenges can be defined.

5) Summer school call for proposal SSDA 2023 and 2024 – extended

Call for proposals: IAPR TC10/TC11 Summer School on Document Analysis

Important Dates:

January 23, 2023 Proposal Submission Deadline
March 31, 2023 new Proposal Submission Deadline

Note that proposals for organizing the summer school in 2024 are also welcome.

Please submit proposals via email to:

• Foteini Simistira Liwicki (TC11 Representative), foteini.liwicki@ltu.se
• KC Santosh (TC10 Representative), Santosh.KC@usd.edu

Part of the mission of International Association for Pattern Recognition (IAPR) TC11 and TC10 is to promote high quality educational activities related to Reading Systems and Graphics Recognition. Responding to this need, TC10 and TC11 have established a series of summer schools. After the successful organization of summer schools in Jaipur, India, La Rochelle, France, Islamabad, Pakistan, and Luleå, Sweden, we are now soliciting proposals for the organization of the fifth “IAPR TC10/TC11 Summer School on Document Analysis” (SSDA) in 2023.
The “IAPR TC10/TC11 Summer School on Document Analysis” is intended to become the primary educational activity of IAPR TC11 (Reading Systems) and TC10 (Graphics Recognition). The School is meant to be a training activity where participants are exposed to the latest trends and techniques of Reading Systems and Graphics Recognition.
The aim of the School is to provide both an objective and clear overview and an in-depth analysis of the state-of-the-art research in selected topics of Reading Systems and Graphics Recognition. The School should aim to provide a stimulating opportunity for young researchers and PhD students in the field.
Individuals and groups who are interested in Reading Systems and Graphics Recognition are invited to submit proposals for organizing and hosting the 2023 IAPR TC10 / TC11 Summer School. Note that submissions for organizing and hosting the 2024 IAPR TC10 / TC11 Summer School are also welcome. As the previous summer schools were organized in Asia, Europe, and the Sub-continent, organizing teams from the Americas are encouraged to submit a bid in order to facilitate the envisioned rotational scheme of the IAPR TC10 / TC11 Summer School.
In order to fully plan their bid, it is expected that proposers familiarize themselves with the guidelines for organizing the School first. The Guidelines can be found at the TC11Web site: http://www.iapr-
tc11.org/mediawiki/index.php/Guidelines_for_Organising_and_Bidding_to_Host_the_TC
10_/_TC11_Summer_School

The submission of a bid implies full agreement with the rules and procedures for organizing the School. Especially, this means that organizers will apply for IAPR support and that the event will use the series title “IAPR TC10/TC11 Summer School on Document Analysis” with an optional sub-title denoting a special focus of the respective event.
Please consider submitting a proposal for this increasingly important event for the TC10/TC11 community. If you have questions, please do not hesitate to contact the TC11 and TC10 SSDA representatives: Foteini Simistira Liwicki (TC11 Representative) and KC Santosh (TC10 Representative).

Previous events:

As a reference, the 2021 Summer School on Document Analysis was held in Luleå, Sweden with the theme Digital Transformation in a changing world (URL: https://www.ltu.se/research/subjects/Maskininlarning/Workshoppar/SSDA-2021?l=en)

6) IJDAR article alert (vol. 26, issue 1)

Volume 26, issue 1, March 2023. Here are the 5 articles of this issue:

YOLO-table: disclosure document table detection with involution
Daqian Zhang, Ruibin Mao, Runting Guo, Yang Jiang, Jing Zhu

Open writer identification from offline handwritten signatures by jointing the one-class symbolic data analysis classifier and feature-dissimilarities
Mohamed Anis Djoudjai, Youcef Chibani

Sequence-aware multimodal page classification of Brazilian legal documents
Pedro H. Luz de Araujo, Ana Paula G. S. de Almeida, Fabricio Ataides Braz, Nilton Correia da Silva, Flavio de Barros Vidal, Teofilo E. de Campos

Pho(SC)-CTC—a hybrid approach towards zero-shot word image recognition
Ravi Bhatt, Anuj Rai, Sukalpa Chanda, Narayanan C. Krishnan

Cover-based multiple book genre recognition using an improved multimodal network
Assad Rasheed, Arif Iqbal Umar, Syed Hamad Shirazi, Zakir Khan, Muhammad Shahzad

7) Job offers – 1 new

2x Post-doctoral positions at the Computer Vision Center, Barcelona

We are seeking two postdoctoral researchers to join the Vision, Language and Reading group at the Computer Vision Center (CVC), in Barcelona, Spain, focused on (1) COMPUTER VISION and (2) FEDERATED LEARNING AND DIFFERENTIAL PRIVACY.

The positions are available for a minimum of 2 years, and are linked to the “European Lighthouse on Secure and Safe AI” (ELSA), a European Project funded by Horizon Europe and backed by the ELLIS network of excellence. The project covers research topics that include robustness, privacy and human agency and will develop use cases in areas such as autonomous driving, robotics, health and document intelligence. The candidate researchers will focus on privacy aware methods for document understanding.

CANDIDATE ’S PROFILE

THE COMPUTER VISION CENTER

The direct responsible for these posts will Dr Dimosthenis Karatzas, leading the Vision, Language and Reading research group at the CVC.

RESEARCH CONTACT

If you are interested in the position, please contact Dr Dimosthenis Karatzas for more information and applications (dimos@cvc.uab.es)

APPLICATION PROCESS

MORE INFO

ELSA project: https://elsa-ai.eu/
Computer Vision Center: http://www.cvc.uab.es/
Vision, Language and Reading group: https://www.vlr.ai/

[IAPR-TC10] Newsletter 153 – December 2022

Posted on 15/12/202222/12/2022 by Christophe Rigaud

Welcome to the December edition of the TC10 newsletter.

In this issue, you will find call for workshop and tutorial of ICDAR 2023, the call for proposal of DAS2024, the last IJDAR issue summary and a new job offers in Barcelona (Spain).

I wish you a pleasant reading,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) ICDAR2023 call for workshop
3) ICDAR2023 call for tutorials
4) DAS2024 call for proposal
5) IJDAR article alert (vol. 25, issue 4)
6) Job offers – 1 new

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

1) Upcoming deadlines and events

2022

Deadlines:
- ~~December 15~~ January 8, workshop proposal due ICDAR 2023
- January 11, tutorial proposal deadline ICDAR 2023
- January 15, paper submission deadline ICDAR 2023
- February 28, DAS2024 hosting proposal due

2023 and later

Deadlines:
- January 15, paper submission deadline ICDAR 2023, San José, California, USA

Events:
- August 21-26, 2023, conference ICDAR 2023, San José, California, USA
- September 2024, conference ICDAR 2024, Athens, Greece

2) ICDAR2023 Call for Workshop

The ICDAR2023 Organizing Committee invites proposals for workshops that will be held after the ICDAR-2023 main conference. Researchers interested in organizing workshops at ICDAR2023 are invited to submit a proposal, including the following information.

Title
Preference for the duration (full day or half day) and date (August 24, 25 and/or 26, 2023)
Scope and motivation
Topics
Relevance for ICDAR
Potential committee
Short CV of organizers

In order to facilitate innovative collaboration and engagement between members of other research communities and the document analysis community, in addition to the topics directly related to document analysis, Workshop Chairs also strongly encourage other research communities to submit proposals on recent other important topics which are used by the document analysis community. Workshops proposing discussions about topics dealing with an open vision of the notion of documents are also welcome.

Workshop Chairs also encourage the proposal of workshops related to document analysis systems, as well as industrial solutions and challenges.

Notes:

The ICDAR main conference organizers will handle the workshop registration and provide workshop space, coffee breaks and other facilities required to organize workshops (e.g. a room, a projector and a screen). In addition, a free registration will be provided to an organizer of each workshop
If a workshop incurs costs for invited speakers, the workshop organizers are required to bear that cost. The workshop organizers may solicit sponsorship to cover the relevant costs. They may use their free registration for the invited speaker
Please set the camera-ready due date of your workshop at April 15, 2023 or prior

About proceedings:
Workshop proceedings will be published by the through the Springer Lecture Notes in Computer Science series and proceeding costs will also be borne by the ICDAR main organizers
- The front matter (preface, title page and so on) and the paper order with session dividers are provided from the workshop organizers to the ICDAR2023 Publication Chairs. (Its due date will be set around the same date as camera-ready papers.)
- If the proceedings related conditions above are not met by an organizer, the proceedings will not be published through the ICDAR main organizers, and the organizer will need to be published locally by their own.

Submission Guidelines and Inquiries

All proposals should be submitted by email to the Workshop Chairs (Alicia Fornes – Computer Vision Center, Spain and Mickaël Coustaty – L3i laboratory, La Rochelle, France) via: afornes***cvc.uab.es and mickael.coustaty***univ-lr.fr (please replace *** with atmark).

For any inquiries you may have regarding the workshops, please contact the Workshop Chairs via the above emails.

Important Dates

Workshop Proposal Due date: December 15, 2022
Acceptance Notification: December 30, 2022
Dates of Workshops: August 24 – 26, 2023

3) ICDAR2023 Call for Tutorial

Important Dates

Jan 11 2023 Proposal Due
Jan 31 2023 Acceptance Notification
Aug 24-26 2023 Dates of Tutorials

The ICDAR 2023 https://icdar2023.org Organizing Committee invites proposals for tutorials that will be held on August 24-26th (the correct final date will be communicated as soon as possible), before the main conference begins.

ICDAR 2023 Tutorials should serve one or more of the following objectives and we are especially interested in an NLP tutorial and practicalities of network implementation:

Introduce students and newcomers to major topics of Document Analysis and Recognition (DAR) research.
Provide instructions on established practices and methodologies.
Introduce expert non-specialists to a DAR subarea. Survey a mature area of DAR research and/or practice.
Motivate and explain a DAR topic of emerging importance.
Overview DAR systems for industrial solutions (suggestion for researchers in industry).
Introduce some recent innovative techniques for DAR research and software quality, such as open-source libraries, high-level API, technical frameworks for expert developments, etc. (suggestion for expert
programmers).

In order to facilitate innovative collaboration and interaction between researchers in academia and industry, the Tutorial Chairs strongly encourage proposals for industrial tutorials, in which researchers in companies describe DAR systems and overview industrial solutions to document analysis problems in real use-case industrial scenarios.

Proposals should be up to 4 pages in length, and should contain the following information:

Title of the tutorial.
Scope and motivation. A brief description of the tutorial, suitable for inclusion in the conference registration brochure.
Preference for the duration (full day or half day). Due to agenda constraints, half day tutorials are recommended. If a full day is needed, provide a brief justification.
A detailed outline of the tutorial. Course description with list of topics to be covered, along with a brief outline.
Relevance for ICDAR. A description of why the tutorial topic would be of interest to a substantial part of the ICDAR audience. Expected target audience in terms of composition and estimated number of attendees. Prerequisite knowledge of the ICDAR audience for attending the tutorial.
Short CV of organizers. A brief CV of the presenter(s), including name, postal address, phone number, e-mail address, web page, background in the tutorial area (projects, relevant publications or tutorial-level articles on the subject), evidence of teaching experience.
The name and e-mail address of the corresponding presenter. The corresponding presenter should be available for e-mail correspondence during the evaluation process, in the case clarifications and discussions on the scope and content of the proposal are needed.

Evaluation

The evaluation of the proposal will take into account its general interest for ICDAR attendees, the quality of the proposal (e.g., a tutorial that simply lists a set of concepts without any apparent rationale behind them will not be approved) as well as the expertise and skills of the presenters. We emphasize that the primary criteria for evaluation will be whether a proposal is interesting, well-structured, and motivated in relation to Document Analysis and Recognition, rather than the perceived experience/standing of the proposer.

Last but not least, the tutorial should attract a meaningful audience, cover hot topics and incorporate new knowledge to the community. Those submitting a proposal should keep in mind that tutorials are intended to provide an overview of the field; they should present reasonably well established information in a balanced way. Tutorials should not be used to advocate a single avenue of research, nor should they promote a product.

Notes:

Tutorial slides must be provided to us for inclusion on the conference website and also on the TC-10 and TC-11 websites, as educational material.

The ICDAR main conference organizers will handle the tutorial registration and provide the space, coffee breaks and other facilities required to organize tutorials (e.g. a room, a projector and a screen).

Submission Guidelines & Inquiries

All proposals should be submitted by electronic mail to the Tutorial Chairs:

Elisa Barney (elisa.barney@ltu.se)
Laurence Likforman-Sulem (likforman@telecom-paris.fr)

Feedback, comments and/or suggestions would be provided within two weeks of receiving the proposal. Final acceptance (or rejection) would be decided by January 31, 2022.

Inquiries should be sent to tutorials-chairs@icdar2023.org or the above emails.

4) DAS2024 Call for Proposal

Deadline: February 28, 2023

Submission Method: email to andreas.fischer@unifr.ch / jean-christophe.burie@univ-lr.fr

Anyone interested in submitting a proposal to host DAS 2024 in Greece should drop an email to andreas.fischer@unifr.ch and jean-christophe.burie@univ-lr.fr by February 28, 2023.

We are looking forward to receiving high quality proposals and making the
first chapter of DAS-ICDAR 2024 a big success!

Andreas Fischer (Chair, TC11)
Jean-Christophe Burie (Chair, TC10)

5) IJDAR article alert (vol. 25, issue 4)

Volume 25, Issue 4, December 2022
https://link.springer.com/journal/10032/volumes-and-issues/25-4

Advances in handwriting recognition
Utkarsh Porwal, Alicia Fornés and Faisal Shafait

Character spotting and autonomous tagging: offline handwriting recognition for Bangla, Korean and other alphabetic scripts
Nishatul Majid and Elisa H. Barney Smith

BCBId: first Bangla comic dataset and its applications
Arpita Dutta, Samit Biswas and Amit Kumar Das

Domain adaptation for staff-region retrieval of music score images
Francisco J. Castellanos, Antonio Javier Gallego, Ichiro Fujinaga – open access

A holistic approach for image-to-graph: application to optical music recognition
Carlos Garrido-Munoz, Antonio Rios-Vila and Jorge Calvo-Zaragoza – open access

A survey of historical document image datasets
Konstantina Nikolaidou, Mathias Seuret, Hamam Mokayed, Marcus Liwicki – open access

Combination of explicit segmentation with Seq2Seq recognition for fine analysis of children handwriting
Omar Krichen, Simon Corbillé, Éric Anquetil, Nathalie Girard, Élisa Fromont & Pauline Nerdeux

A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
Aejaz Farooq Ganai and Farida Khursheed

Conv-transformer architecture for unconstrained off-line Urdu handwriting recognition
Nauman Riaz, Haziq Arbab, Arooba Maqsood, Khuzaeymah Nasir, Adnan Ul-Hasan and Faisal Shafait

Benchmarking online sequence-to-sequence and character-based handwriting recognition from IMU-enhanced pens
Felix Ott, David Rügamer, Lucas Heublein, Tim Hamann, Jens Barth, Bernd Bischl and Christopher Mutschler – open access

Textline alignment on the image domain
Boraq Madi, Ahmad Droby and Jihad El-Sana

6) Job offers – 1 new

POSTDOC POSITION IN FEDERATED LEARNING AND DIFFERENTIAL PRIVACY – new

The Computer Vision Center (CVC), Barcelona, has a vacancy for a postdoc to work at the frontier between document intelligence and privacy preserving learning.

We are seeking a postdoc to join the Vision, Language and Reading group at the Computer Vision Center (CVC), in Barcelona, Spain.

The position is for 2.5 years. The position will remain open until a good candidate is found.

The position is linked to a European Project: the “European Lighthouse on Secure and Safe AI” (ELSA), funded by Horizon Europe and backed by the ELLIS network of excellence. The project covers research topics that include robustness, privacy and human agency and will develop use cases in areas such as autonomous driving, robotics, health and document intelligence.

The specific focus of this postdoc position will be on applying federated learning and differential privacy to document intelligence tasks. The candidate will co-supervise one PhD student.

PROJECT PI & HOSTING GROUP

The direct responsible for this post will Dr Dimosthenis Karatzas. He is the leader of the Vision, Language and Reading research group at the CVC. For more information visit: http://vlr.cvc.uab.es/

CANDIDATE ’S PROFILE

The candidate should have a background in machine learning, federated learning and differential privacy. The applicants are expected to be fluent in both oral and written communication in English. They should work well in a team while demonstrating initiative and independence. The candidate is expected to supervise PhD students.

THE COMPUTER VISION CENTER

The selected candidate will work at the Computer Vision Centre (CVC), Barcelona, a research institute comprising more than 130 researchers and support staff, dedicated to computer vision research and knowledge transfer. With a strong international projection and links to the industry, the Computer Vision Centre offers an exciting environment for scientific career development. The Computer Vision Centre has a plan for expansion of its permanent research staff base and has received the “HR Excellence in Research” award as a provider and supporter of a stimulating and favourable working environment.

SALARY AND CONDITIONS

Gross salary of 30-40k€/year depending on experience.

RESEARCH CONTACT

If you are interested in the position, please contact Dr Dimosthenis Karatzas for more information and applications (dimos@cvc.uab.es)

APPLICATION PROCESS

Apply by filling in the online form at:

http://www.cvc.uab.es/blog/2022/12/07/postdoc-federated-learning-and-differential-privacy/

WEB SITES

ELSA project

Computer Vision Center

Vision, Language and Reading group

[IAPR-TC10] Newsletter 152 – September 2022

Posted on 08/09/202220/09/2022 by Christophe Rigaud

Welcome to the September edition of the TC10 newsletter.

In this issue, you will find call for competition and journal-track papers for ICDAR 2023, a presentation of the New Technologies Show at ECCV. Also the two last IJDAR issues and two new job offers in Barcelona (Spain) and Rouen (France).

I wish you a pleasant reading,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) ICDAR2023 call for competitions
3) ICDAR-IJDAR journal track
4) ECCV 2022 new technologies show
5) IJDAR article alert (vol. 25, issue 2&3)
6) Job offers – 2 new

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

1) Upcoming deadlines and events

2022

Deadlines:
- October 28, competition proposition deadline ICDAR 2023
- October 31, 1st round paper submission deadline ICDAR-IJDAR Journal Track

Events:
- October 16-19, conference ICIP 2022, Bordeaux, France
- October 23-27, conference ECCV 2022, Tel Aviv, Israël
- December 4-7, conference ICFHR 2022, Hyderabad, India

2023 and later

Deadlines:
- January 15, paper submission deadline ICDAR 2023, San José, California, USA

Events:
- August 2023, conference ICDAR 2023, San José, California, USA
- September 2024, conference ICDAR 2024, Athens, Greece

2) ICDAR2023 Call for Competitions

The ICDAR2023 Organizing Committee invites proposals for competitions that aim at evaluating the performance of algorithms and methods related to areas of document analysis and recognition.

You are cordially invited to submit a proposal, that should contain the following information:

Contest title and abstract
A brief description of the competition, including what the particular task under evaluation is, why this competition is of interest to the ICDAR community, and the expected number of participants
An outline of the competition schedule
Description of the dataset to be used, and the evaluation process and metrics for submitted methods
The names, contact information, and brief CVs of the competition organizers, outlining previous experience in performance evaluation and/or organizing competitions

The following rules shall apply to the accepted Competitions:

The name of competition must be standardized by starting with “ICDAR 2023” e.g. “ICDAR 2023 Competition on …” or “ICDAR 2023 … Competition.”
Datasets used in the competitions must be made available after the end of the competitions. Specifically, the training data and ground truth must be publicly released and there must be a way to evaluate performance on a test set. This could take the form of an evaluation server, or the test data, ground truth, and evaluation script could be made publicly available. In principle, the organizers should submit the dataset to IAPR TC10 or TC11.
Evaluation methodologies and metrics used must be described in detail so that results can be replicated later. Evaluation scripts must be released afterwards.
Each competition has to be presented with a poster at a prominent place at the conference venue, and selected competitions will get the chance to be presented orally in the dedicated session mentioned above.
Competitions must have a sufficient number of participants to be able to draw meaningful conclusions.
Reports (full papers) on each competition will be reviewed and, if accepted (the competition ran according to plan, attracted a minimum level of participation and is appropriately described), will be published in the ICDAR 2023 conference proceedings.
Participants should not have access to the ground-truthed test dataset until the end of the competition. The evaluation should be done by the organizers.

Submission Guidelines & Inquiries

All proposals should be submitted by electronic mail to the Competition Chairs (Kenny Davila, Dimosthenis Karatzas, and Chris Tensmeyer) via:

competitions-chairs@icdar2023.org

We encourage competitions proposals with a solid plan to remain active and challenging for the community over and above ICDAR 2023

For any inquiries you may have regarding the competitions, please contact us via above email.

Important Dates

October 28, 2022: Competition Proposal Due

November 15, 2022: Competition Acceptance Notification

December 1, 2022: Individual Competition Websites are Live

Mar 20, 2023: Suggested deadline for competition participants

April 10, 2023: Initial Submission of Competition Reports Deadline

May 1, 2023: Camera-Ready Papers Due

July 1, 2023: Communicate Winners to Chairs

August 21-26 2023: Presentation or results at ICDAR Conference

3) ICDAR-IJDAR Journal Track

ICDAR-IJDAR Journal Track Call For Papers of the 17th International Conference on Document Analysis and Recognition August 21-26, 2023 – San José, California, USA
https://www.springer.com/journal/10032/updates/23440814

Following a tradition established at ICDAR 2019 and 2021, ICDAR 2023 will include a journal paper track that offers the rapid turnaround and dissemination times of a conference, while providing the paper length, scientific rigor, and careful review process of an archival journal.

The ICDAR-IJDAR journal track invites high-quality submissions presenting original research in Document Analysis and Recognition. Journal versions of previously published conference papers or survey papers will not be considered for this special issue; such submissions can be submitted as journal-only papers via the regular IJDAR process.

Accepted papers will be published in a special issue of IJDAR, and will receive an oral presentation slot at ICDAR 2023. Authors who submit their work to the journal track commit themselves to present their results at the ICDAR conference if their paper is accepted. IJDAR’s publisher, Springer-Nature, will make accepted papers freely available for four weeks around the conference. After this, accepted papers will be available from the archival journal.

Format: Journal track papers submissions should follow the format of IJDAR submissions and the standard guidelines for the journal.

Procedure and Deadlines: While we permit a second round of submissions for the ICDAR-IJDAR journal track for both new submissions and revised papers from the 1st round of submissions, it must be understood by all authors that the full journal review process will be required for all ICDAR-IJDAR track submissions — there is no “short cut.” Papers submitted late in the second round run a risk of the review process not completing in time. In such cases, authors should know that: (1) The submitted paper can still continue under review as an IJDAR paper. (2) The authors always have the option of preparing a shorter version of the paper to submit as a regular ICDAR conference paper.

Questions regarding the status of a paper that is under review for the IJDAR- ICDAR journal track should be directed to Ms. Katherine Moretti @Springer (katherine.moretti@springer.com). The guest editors have full authority to determine that the journal review process will not complete in time, and to decline submissions that arrive too late. In which case they will immediately notify the authors who can decide to leave the paper in the standard journal reviewing system or withdraw their manuscript from the journal.

Important Dates

October 31, 2022
1st round journal track paper submissions due

January 4, 2023
1st round journal track paper notifications

January 15, 2023
*Regular ICDAR proceedings paper submission deadline

March 15, 2023
2nd round journal track paper submissions due

May 15, 2023
2nd round journal track paper notifications

Special Issue Guest Editors
ICDAR 2023 PC Chairs
– Gernot A. Fink
– Rajiv Jain
– Koichi Kise (liaison to IJDAR)
– Richard Zanibbi

Inquiries
For additional information, please email one of the editors of this special issue or consult the FAQ that will be posted on the IJDAR website.

4) ECCV New Technologies Show

New Technologies Show @ECCV 2022

Do you think that your research idea can generate impact?

Pitch it at ECCV!

Research ideas are the fuel for innovation. ECCV 2022 wants to give the stage to researchers to present not only the scientific virtues of their work, but also their ideas about how their research can lead to meaningful innovation, highlighting possible applications enabled by their research.

ECCV 2022 will host a “New Technologies Show”, where researchers are invited to pitch their technologies to a different crowd. A jury comprising entrepreneurs and innovation experts will give feedback to all participating teams, while the most promising technologies will be offered further mentoring post-ECCV.

All potential participants to the New Technologies Show, will be offered a short (~1-hour) online seminar prior to ECCV, on how to create a successful pitch. Research teams are invited to express their interest and encouraged to attend the online seminar before finally confirming whether they desire to present their technology in the New Technologies Show.

The New Technologies Show session will take place on-site. All participants to the New Technologies Show will be invited to the industry track reception during the ECCV.

Please use the online form to express your interest to participate:

https://forms.gle/vy9fwxaXPj2jC9Ay6

Important Dates:

Expressions of interest: 10/9/2022

Online Course: 20/9/2022

Confirmation of participation: 1/10/2022

New Technologies Show session (on-site): 26/10/2022

Organisers:

Amir Markovitz (Amazon Research), Yair Kittenplon (Amazon Research), Shai Mazor (Amazon Research), Omar Zahr (Tandemlaunch), Dimosthenis Karatzas (Computer Vision Center), Chen Sagiv (SagivTech), Lihi Zelnik-manor (Technion)

For any inquiries, please contact: industrychairs@ecva.net

Supported By:

TandemLaunch Inc.

– TandemLaunch will provide an hour-long online group workshop on preparing an investment narrative based on their invention.

– All applicants will be considered for an invitation into the TandemLaunch Startup Foundry Investment program.

Awards:

– Winners will receive 2 hours of Technology and IP Strategy mentoring and 1 hour of CTO career mentoring with the CTO of TandemLaunch.

5) IJDAR article alert (vol. 25, issue 2&3)

Volume 25, Issue 2, June 2022
https://link.springer.com/journal/10032/volumes-and-issues/25-2

Segmentation for document layout analysis: not dead yet
Logan Markewich, Hao Zhang and Seok-Bum Ko

Feature learning and encoding for multi-script writer identification
Abdelillah Semma, Yaâcoub Hannad and Mohamed El Youssfi El Kettani

Robust text line detection in historical documents: learning and evaluation methods
Mélodie Boillet, Christopher Kermorvant and Thierry Paquet

Arbitrary-shaped scene text detection with keypoint-based shape representation
Shuxin Qin and Lin Chen

Personalizing image enhancement for critical visual tasks: improved legibility of papyri using color processing and visual illusions
Vlad Atanasiu and Isabelle Marthot-Santaniello – open access
Asma Naseer, Sarmad Hussain, Kashif Zafar, Ayesha Khan

Correction to: Personalizing image enhancement for critical visual tasks: improved legibility of papyri using color processing and visual illusions
Vlad Atanasiu and Isabelle Marthot-Santaniello

Volume 25, Issue 3, September 2022
https://link.springer.com/journal/10032/volumes-and-issues/25-3

Scene text detection via decoupled feature pyramid networks
Min Liang, Jie-Bo Hou and Jingyan Qin

CarveNet: a channel-wise attention-based network for irregular scene text recognition
Guibin Wu, Zheng Zhang and Yongping Xiong

Fusion of visual representations for multimodal information extraction from unstructured transactional documents
Berke Oral and Gülşen Eryiğit

6) Job offers – 2 new

Postdoctoral Research Position on Document Intelligence and Privacy Preserving Learning – new

The Computer Vision Center (CVC), Barcelona, has a vacancy for a postdoc to work at the frontier between document intelligence and privacy preserving learning.

The position is linked to the European Lighthouse on Secure and Safe AI (ELSA) project https://elsa-ai.eu/, funded by European Union’s Horizon Europe research and innovation programme.

The successful candidate is expected to contribute to the design and development of AI solutions for document understanding, and in particular on Document Visual Question Answering systems, employing privacy preserving techniques and infrastructures set up by the ELSA project.

The candidate is expected to have extensive experience in at least one of: document intelligence and/or private and robust collaborative learning (differential privacy, federated learning), and an appetite to learn.

All applications must include a full CV in English including contact details and a Cover Letter with a statement of interest in English.

Duration: between 30 and 36 months, depending on the start date

Salary: €40,000 annual gross, according to CVC labour categories

More information and application procedure:

https://euraxess.ec.europa.eu/jobs/815190

http://www.cvc.uab.es/blog/2022/07/19/postdoctoral-research-position/

This project has received funding from the European Union’s Horizon Europe research and innovation programme under grant agreement No 101070617. Views and opinions expressed are those of the author(s) only and do not necessarily reflect those of the European Union. Neither the European Union nor the granting authority can be held responsible for them.

Research Engineer Position on Information extraction, Text Recognition in Historical Document Collections – new

LITIS

LITIS (Laboratoire d’Informatique, Traitement de l’information et des Systèmes) is a research
laboratory associated to the University of Rouen Normandie, Le Havre Normandie Normandie,
and School of Engineering INSA Rouen Normandie. Research at LITIS is organized around 7
research teams which contribute to 3 main application domains: Access to Information,
Biomedical Information Processing, Ambient Intelligence. LITIS currently includes 90 faculty staff
members, 50 PhD students, 20 PostDoc and Research Engineers. The Machine Learning team of
LITIS is developing research in modeling unstructured data (signals, images, text, etc…) with
machine learning algorithms and statistical models. For more than two decades it has contributed
to the development of reading systems and document image analysis for various applications such
as postal automation, business document exchange, digital libraries, etc…

EXO-POPP project

Optical Extraction of Handwritten Named Entities for Marriage
Certificates for the Population of Paris (1880–1940)
Thanks to a collaboration between specialists in machine learning and historians, the EXO-POPP
project will develop a database of 300,000 marriage certificates from Paris and its suburbs
between 1880 and 1940. These marriage certificates provide a wealth of information about the
bride and groom, their parents, and their marriage witnesses, that will be analyzed from a host of
new angles made possible by the new dataset. These studies of marriage, divorce, kinship, and
social networks covering a span 60 years will also intersect with transversal issues such as gender,
class, and origin. The geolocation of data will provide a rare opportunity to work on places and
relocations within the city, and linkage with two other databases will make it possible to follow
people from birth to death.
Building such a database by hand would take at least 50,000 hours of work. But, thanks to the
recent developments in deep learning and machine learning, it is now possible to build huge
databases with automated reading systems including handwriting recognition and natural
language understanding. Indeed, because of these recent advances, optical printed named entity
recognition (OP-NER) is now performing very well. On the other hand, while handwriting
recognition by machine has become a reality, also thanks to deep learning, optical handwritten
named entity recognition (OH-NER) has not received much attention. OH-NER is expected to
achieve promising results on handwritten marriage certificates dating from 1880 to 1923. This
project research questions will focus on the best strategies for word disambiguation for
handwritten named entity recognition. We will explore end-to-end deep learning architectures for
OH-NER, writer adaptation of the recognition system, and named entity disambiguation by
exploiting the French mortality database (INSEE) and the French POPP database. An additional
benefit of this study is that a unique and very large dataset of handwritten material for named
entity recognition will be built.Laboratoire LITIS, EA 4108, Université de Rouen, 76 800 Saint-Etienne du Rouvray, FRANCE
Téléphone : (33) 2 32 95 50 13 Fax : (33) 2 32 95 50 22 Email : Thierry.Paquet@univ-rouen.fr

Missions

The research engineer will be in charge of the development of a processing pipeline dedicated
to optical printed named entity recognition (OP-NER). He will closely collaborate with a Ph.D.
student in charge of Handwritten Named Entity Recognition (OH-NER).
OP-NER is the project’s easiest task and will benefit from the latest results achieved by the LITIS
team on similar problems on financial yearbooks. Images are first processed to extract every text
information. This will be achieved with the DAN architecture designed by LITIS which is a deep-
learning-based OCR (https://arxiv.org/abs/2203.12273). The research engineer will be in charge of
this OCR task. A benchmark of DAN against available OCR software such as Tesseract and EasyOCR
will also be conducted. Then the textual transcriptions will be processed for named entity
extraction and recognition. Named entity recognition is a well-defined task in the natural language
processing community. In the EXO-POPP context however, we need to define each entity to be
extracted more precisely to make a clear distinction between the different people occurring in the
text. For example, we need to distinguish between wife and husband names, and similarly for the
parents of the husband and of the wife, and so on for the witnesses, children, etc. An estimation
of around 135 categories has been established. The TAG definition was made by LITIS as well as a
first training dataset. Manually tagging the transcriptions has been made possible through the
PIVAN web-based collaborative interface (https://litis-exopopp.univ-rouen.fr/collection/12 ). This
platform provides in one single web interface a document image viewer, viewing and editing of
OCR results and text tagging facilities for NER. PIVAN eases the annotation efforts of the H&SS
trainees and allows for building the large, annotated datasets required for machine learning
algorithms to run optimally. The research engineer will oversee datasets generation and curation
as per the requirement of the EXO-POPP NER task, including the handwritten datasets.
The named entity recognition task will be based on a state-of-the-art machine learning approach.
We have started some experimentations with the well-known FLAIR NER library
(https://github.com/flairNLP/flair). We plan to continue developing and tuning the EXO-POPP
named entity recognition module using this library. The research engineer will oversee this task
entirely.

Tasks

• The research engineer will be in charge of tuning PIVAN for the OCR task. A benchmark of
DAN with the available OCR technologies such as Tesseract and EasyOCR will also be
conducted.
• The research engineer will be in charge of datasets generation and curation as per the
requirement of the EXO-POPP NER task, including the handwritten datasets.
• The research engineer will be in charge of developing the NER module.Laboratoire LITIS, EA 4108, Université de Rouen, 76 800 Saint-Etienne du Rouvray, FRANCE
Téléphone : (33) 2 32 95 50 13 Fax : (33) 2 32 95 50 22 Email : Thierry.Paquet@univ-rouen.fr

Deliverables:

Transcription of the typescript corpus
Named entities extracted from the typescript corpus

Skills :

• General software development and engineering, Python
• Machine Learning, Computer vision, Natural Language Processing
• Ability to work in a team, curious and rigorous spirit
• Knowledge in web-based programming is a plus

Position to be filled :
Positions: 1 Research Engineer
Time commitment: Full-time
Duration of the contract: September 1st 2022 – 31st August 2023
Contact: Prof. Thierry Paquet, Thierry.Paquet@univ-rouen.fr
Indicative salary: €24 000 annual net salary, plus French social security benefits
Location: LITIS, Campus du Madrillet, Faculty of science, Saint Etienne du Rouvray, France

[IAPR-TC10] Newsletter 151 – May 2022

Posted on 13/05/202217/05/2022 by Christophe Rigaud

Welcome to the May edition of the TC10 newsletter.

In this issue, you will find the last call for participation for DAS 2022, ICFHR call for workshops, tutorials and competitions, a selection of workshop in conjunction of ECCV and ICPR conferences and a call for paper in a special issue of Pattern Recognition Letters journal: Advances and New challenges in Document Analysis, processing and Recognition at the Dematerialization Age. In addition, the in Memoriam of Professor Sargur N. Srihari from the IAPR newsletter.

I hope to see you soon in La Rochelle,

Christophe Rigaud
IAPR-TC10 Communications Officer

Table of content:

1) Upcoming deadlines and events
2) Last Call for Participation DAS 2022
3) Call for paper MANPU 2022 – deadline extended
4) Call for Workshops & Tutorials ICFHR 2022
5) Call for Competitions ICFHR 2022
6) Workshop on “Text in Everything” ECCV 2022
7) Workshop on Identity Document Analysis and Verification ICPR 2022
8) Workshop on Reproducible Research in Pattern Recognition ICPR 2022
9) Call for paper Pattern Recognition Letters
10) IJDAR article alert (vol. 25, issue 1)
11) IAPR Newsletter summary – April 2022
12) Job offers – 1 new

Call for contributions: feel free to contribute to TC10 newsletters, by sending any relevant news, event, notice, open position, dataset or link to us on iapr.tc10[at]gmail.com

1) Upcoming deadlines and events

2022

Deadlines:
- May 22, paper abstract submission deadline ICFHR 2022
- May 31, workshop & tutorial proposal deadline ICFHR 2022
- June 5, competition proposal deadline ICFHR 2022
- June 6, paper submission deadline MANPU 2022 – extended

Events:
- May 22-25, conference DAS 2022, La Rochelle, France
- June 1-3, conference ICPRAI 2022, Paris, France
- July 10-16, summer school ICVSS, Italy (Sicily)
- August 21, workshop MANPU 2022, Montréal, Québec (QC), Canada
- August 21-25, conference ICPR 2022, Montréal, Québec (QC), Canada
- October 16-19, conference ICIP 2022, Bordeaux, France
- October 23-27, conference ECCV 2022, Tel Aviv, Israël
- December 2022, conference ICFHR 2022, Hyderabad, India

2023 and later

Events:
- August 2023, conference ICDAR 2023, San José, California, USA
- September 2024, conference ICDAR 2024, Athens, Greece

2) 📢 DAS 2022 Last call for participation

15th IAPR International Workshop on Document Analysis Systems (DAS 2022)

May 22-25, La Rochelle, France
DAS 2022 participation will be hybrid (i.e., both in-person and online)

https://das2022.univ-lr.fr/

🎟️ Registration for DAS 2022 is open. Complete information about the scientific program, tutorials, and keynote talks is now available online, and summarized below.

ℹ️ About DAS

DAS 2022 is the 15th edition of the IAPR sponsored workshop focusing on system-level issues and approaches for document analysis and recognition. The workshop consists of invited keynote speeches, oral and poster paper presentations, tutorials, and working group discussions.

Accepted Papers. The accepted papers at DAS 2022 cover a wide range of topics, including Image Processing, OCR, HTR, Forensics, Historical Document Analysis, Natural Language Processing, Information Retrieval, Deep Learning, and Datasets and Evaluation.

A Quick Summary of Paper Submissions and Acceptances:

88 full-length papers were submitted for DAS 2022
- 31 were accepted for an oral presentation, and
- 21 for a poster presentation.
16 short papers were also submitted to DAS 2022, and accepted for poster presentation.

🏖️ Location

DAS 2022 will be hosted by La Rochelle University (France) on May 22-25, 2022. The city of La Rochelle, situated on the central west coast of France, possesses a rich historical fabric including the Saint-Nicolas tower and urban heritage. In the early 21st century, the city has consistently been ranked among France’s most liveable cities.

🎤 Keynotes and Tutorials

Keynote 1: “Automatic Question Answering & Generation in News Archives” by Professor Adam Jatowt (University of Innsbruck, Austria)
Keynote 2: “The kiss of the Prince”: Bringing Document Content to Life by Professor Andreas Dengel (German Research Center for Artificial Intelligence (DFKI) in Kaiserslautern, Germany)
Tutorial: “Unlocking the Potential of Unstructured Data in Finance Through Document Intelligence” by Himanshu Sharad Bhatt (Research Director at American Express AI Labs, India)

📑 Program, Registration and Participation (In-Person and Online)

Final scientific program is available at: https://das2022.univ-lr.fr/index.php/scientific-program-overview/

Please register at: https://das2022.univ-lr.fr/index.php/registration-2/

For discounted accommodation options, please see: https://das2022.univ-lr.fr/index.php/accommodation/

For information on traveling to La Rochelle, please see: https://das2022.univ-lr.fr/index.php/on-site-participation/

For remote/online participation information, please see: https://das2022.univ-lr.fr/index.php/online-participation/

3) Call for paper MANPU 2022 – deadline extended

5th International Workshop on coMics ANalysis, Processing and Understanding

August 21, 2022, Montreal, Canada
In conjunction with ICPR 2022
https://manpu2022.imlab.jp

Important dates

Title and abstract submission due: June 6, 2022

Paper submission due: ~~May 13~~ June 6, 2022

Notification of acceptance: ~~May 31~~ June 30, 2022

Camera-ready paper due: ~~June 6~~ September 2, 2022

Workshop: August 21, 2022

Scope and Topics

The scope of this workshop includes, but is not limited to,

Datasets

To evaluate the proposed works, participants will be able to use the following datasets that are publicly available. Researchers can request to download them at each website.

– eBDtheque consists of 100 images with ground truth for panels, speech balloons, tails, text lines, leading characters: http://ebdtheque.univ-lr.fr/

– Manga109 consists of over 20 thousand images of 109 volumes (21,142 images): http://www.manga109.org/en/

Paper Submissions

Submission and Review

Papers should be formatted with the style files/details available at Information for Authors of Springer Computer Science Proceedings. Only PDF files are accepted. A complete paper should be submitted in the proper format. Only Full paper (12-15 pages) is accepted.

See more information on the website : https://manpu2022.imlab.jp/

General Co-Chairs

Jean-Christophe Burie (France)

Motoi Iwata (Japan)

Miki Ueno (Japan)

Program Co-Chairs

Rita Hartel (Germany)

Ryosuke Yamanishi (Japan)

Tien-Tsin Wong (Hong Kong)

Advisory Board

Kiyoharu Aizawa (Japan)

Koichi Kise (Japan)

Jean-Marc Ogier (France)

Toshihiko Yamasaki (Japan)

More info: https://manpu2022.imlab.jp

4) Call for Workshops & Tutorials ICFHR 2022

The ICFHR Organizing Committee invites proposals for workshops/tutorials that will be held on December 7, 2022, in Hyderabad, India. Researchers interested in organizing workshops or tutorials at ICFHR 2022 are invited to submit a proposal. The broad area of the submitted proposals for the workshop and tutorials could be topics of interest to ICFHR or the document image understanding and reading systems (that fall under the broad scope of TC10/TC11 of IAPR). The tutorial can be organized on topics of recent trends and tools used in the community. The proposal shall include the following information in a single PDF file:

Technical Information

Workshop/Tutorial title.
Topics that will be covered, with descriptions of why they are relevant.

Organizers and Speakers

Organizers’ names, titles, and affiliations and brief CVs
Names and affiliation of invited speakers. For each speaker, please indicate if attendance is tentative or confirmed.

Logistics

Preference for half-day or full-day events.
Estimated numbers of orals, posters, and invited talks, with a rough program outline. (if applicable)
Expected number of paper submissions (if planned)
Tentative program committee (for workshops).
Anticipated target audience and expected number of attendees.
Special space or logistic requirements, if any.
For workshops, also specify: (i) Paper submission deadline, (ii) Notification to authors, (iii) Camera-ready deadline.

Submission:
All proposals should be submitted by filling up this Google Form on or before May 31, 2022: Submission Link

Important Dates

Workshop and Tutorial proposal due: May 31, 2022
Notification to the organizers: June 15, 2022

Contact:
For any inquiries you may have regarding the workshops, please contact the Workshop Chairs.
veronica.romero@uv.es , mishra@iitj.ac.in

More info: http://icfhr2022.org/call-for-wt.php

5) Call for Competitions ICFHR 2022

The ICFHR2022 organizing committee invites proposals for competitions that aim at evaluating the performance of algorithms and methods related to the field of handwriting recognition.

You are cordially invited to submit a proposal that should contain the following information:

A brief description of the competition, including what the particular task under evaluation is and why this competition is of interest to the ICFHR community.
A draft of the outline of the competition describing the competition schedule, the expected number of participants and its rationale, which data is planned to be used, how the submitted methods will be evaluated, and which performance evaluation measures will be used.
The names, contact information, and brief CVs of the competition organizers, outlining previous experience in performance evaluation and/or organizing competitions.
The competition may be related to (but not limited to) the following areas:
- Few-shot/zero-shot handwritten manuscript layout analysis/text recognition
- Open set/open-world handwritten manuscript layout analysis/text recognition
- Multilingual handwritten manuscript layout analysis/text recognition
- Small size models for handwritten manuscript layout analysis/text recognition
- Domain adaptation from printed to handwritten manuscript transcription
- Transfer learning in handwritten manuscripts from one language to another
- Degraded/low-quality handwritten manuscripts text recognition
- Manuscript classification
- Handwritten document image binarization
- Historical scientific manuscript recognition
- Image retrieval and text spotting for historical handwritten manuscripts
- Detection, recognition and/or spotting of handwritten mathematical formula
- Handwriting style analysis
- Multi-script writer identification and retrieval
- Online signature recognition and verification

The following rules shall apply to the accepted competitions:

Name of the competition must be standardized by starting with “ICFHR2022” (e.g. “ICFHR2022 Competition on …” or “ICFHR2022 … Competition.”).
Datasets used in the competitions must be made available through the TC10/TC11, online resources website after the end of the competitions (or equivalent arrangement by approval of the competition chairs).
Evaluation methodologies and metrics used must be described in detail so that results can be replicated later.
Participants should be encouraged to share their algorithms and code using one of the open-source licenses.
Competitions must have a sufficient number of participants (about 5, depending on the originality of the topic) to be able to draw meaningful conclusions.
Reports (full papers) on each competition will be reviewed and, if accepted (the competition runs according to plan and is appropriately described), will be published in the ICFHR2022 conference proceedings.
Competition results will be announced during a dedicated session of the ICFHR2022 conference.
Each competition has to be presented with a poster at a prominent place at the conference venue, selected competitions will get the chance to be presented orally in the dedicated session mentioned above.
Based on the analysis of the received proposals, the competition chairs will first synchronize with the competition organizers, and then submit a proposal to the program committee, in order to finalize the organization of the competitions.

Submission Guidelines

All proposals should be submitted by electronic mail to the competition chairs (Rohit Saluja and Maroua Mehri) via: rohit.saluja@research.iiit.ac.in maroua.mehri@gmail.com

Important Dates

Proposal due: June 05, 2022
Acceptance notification: June 12, 2022
Suggested deadline for competition participants: July 17, 2022
Initial submission of competition reports deadline: September 18, 2022
Camera-ready papers due: September 25, 2022

Inquiries

For any inquiries you may have regarding the competitions, please contact the ICFHR2022 competition Chairs (Rohit Saluja and Maroua Mehri) via: rohit.saluja@research.iiit.ac.in maroua.mehri@gmail.com

More info: http://icfhr2022.org/call-for-com.php

6) Workshop on “Text in Everything” – ECCV 2022

Tel Aviv, Israel, October 2022
In conjunction with ECCV 2022
https://sites.google.com/view/tie-eccv2022/home

Understanding written communication through vision is a key aspect of human civilization and should also be an important capacity of intelligent agents aspiring to function in man-made environments. Interpreting written information in our environment is essential in order to perform most everyday tasks like making a purchase, using public transportation, finding a place in the city, getting an appointment, or checking whether a store is open or not, to mention just a few. As such, the analysis of written communication in images and videos has recently gained an increased interest, as well as significant progress in a variety of text based vision tasks. While in earlier years the main focus of this discipline was on OCR and the ability to read business documents, today this field contains various applications that require going beyond just text recognition, onto additionally reasoning over multiple modalities such as the structure and layout of documents.

Recent advances in this field have been a result of a multi-disciplinary perspective spanning not only computer vision, but also natural language processing, document and layout understanding, knowledge representation and reasoning, data mining, information retrieval, and more. The goal of this workshop is to raise awareness about the aforementioned topics in the broader computer vision community, and gather vision, NLP and other researchers together to drive a new wave of progress by cross pollinating more ideas between text/documents and non-vision related fields.

The workshop will be a hybrid, full-day event comprising invited talks, oral and poster presentations of submitted papers and a special challenge on Out of Vocabulary scene text understanding.

Keynote speakers

Xiang Bai (Huazhong University)
Tal Hassner (Meta AI)
Aishwarya Agrawal (University of Montreal, DeepMind)
Sharon Fogel (AWS AI Labs)

Topics of Interest

The workshop welcomes original work on any text-dependent computer vision application, such as:

Scene text understanding
Scene text VQA
Image-text aware cross-modal retrieval
Image-text for fine-grained classification
Text in video
Document VQA
Document layout prediction
Table detection
Information extraction

Challenge on Out-of-Vocabulary Scene Text Understanding

A challenge on Out of Vocabulary Scene Text Understanding (OOV-ST) will be organised in the context of this workshop. The OOV-ST challenge aims to evaluate the ability of text extraction models to deal with out-of-vocabulary (OOV) words, that have NEVER been encountered in the training set of the most common Scene Text understanding datasets to date. The challenge is organised jointly by Amazon Research, Google Research, Meta AI, and the Computer Vision Center.

To participate to the OOV_ST Challenge, please join through the RRC Portal.

https://rrc.cvc.uab.es/?ch=19

Important dates

Paper Submission Deadline: August 1, 2022

Notification to Authors: August 15, 2022

Workshop Camera Ready Due: August 22, 2022

Workshop Date: October 2022

Organisers

Ron Litman, AWS AI Labs

Aviad Aberdam, AWS AI Labs

Shai Mazor, AWS AI Labs

Hadar Averbuch-Elor, Cornell University

Dimosthenis Karatzas, Computer Vision Center / Autonomous University of Barcelona

R. Manmatha, AWS AI Labs

7) Workshop on Identity Document Analysis and Verification – ICPR 2022

1^st Workshop on Identity Document Analysis and Verification
Montréal (Québec), in conjunction with ICPR 2022.
https://www.icpr-ariadnext.com/

WIDAV focuses on addressing theoretical and practical works related to the field of identity document analysis, both from static images and from videos. This workshop will be the opportunity to share the last advances in the field of ID document analysis among the community. It will mainly include works from the fields of machine learning, document analysis, and forensics.

Topics of Interest:

– Document localization in the wild
– Identity document classification
– Optical character recognition (OCR)
– Fraud analysis and detection

Important dates :

– Submission deadline    06 June 2022
– Acceptance notification 15 July 2022
– Camera ready version   01 August 2022
– WIDAV2022 Workshop   TBD

Submission, attendance and publications:

– WIDAV 2022 invites the submission of substantially original, previously unpublished work and also welcomes ICPR 2022 re-submissions.
– Attendance and presentations for this first edition will be fully remote
– Accepted papers will be published by IEEE and be available in IEEE Xplore.

To find more information, please visit the workshop website: https://www.icpr-ariadnext.com/

8) Workshop on Reproducible Research in Pattern Recognition – ICPR 2022

Fourth Workshop on Reproducible Research in Pattern Recognition
Satellite event of ICPR 2022
21 Aug 2022 Montréal (Canada)
https://rrpr2022.sciencesconf.org

This Call for Papers expects two kinds of contributions.

The track 1 on RR Frameworks is dedicated to the general topics of Reproducible Research in experimental Computer Science with clear links to Image Processing and Pattern Recognition. Papers describing experiences, frameworks or platforms are welcome. The contributions might also include discussions on software libraries, experiences highlighting how the works benefit from Reproducible Research.

In the track 2 on RR Results, authors are invited to describe their works in terms of Reproducible Research. For example, authors of papers already accepted to ICPR might propose a companion paper describing the quality of the reproducible aspects. In particular the papers of this track can focus mainly (but not limited) for instance on:

Algorithmic implementation details
Influence of parameter(s) for the result quality (criteria to optimize them).
Integration of source code in an other framework.
Known limitations (or difficult cases).
Future improvements.
Installation procedure.

For this track, the topics could overlap with the main topics of the ICPR tracks:

Geometry and Deep Learning (special track)
Discrete Geometry and Mathematical Morphology
Pattern Recognition and Machine Learning
Computer Vision and Robot Vision
Image, Speech, Signal, and Video Processing
Document Analysis, Biometrics, and Pattern Recognition Applications.
Biomedical Image Analysis and Applications

9) Call for paper Pattern Recognition Letters – special issue

Advances and New challenges in Document Analysis, processing and Recognition at the Dematerialization Age

https://www.journals.elsevier.com/pattern-recognition-letters/call-for-papers/advances-and-new-challenges-in-document-analysis-processing-and-recognition-at-the-dematerialization-age

Description of the issue

Document Analysis and Recognition (DAR) aims at the processing, extraction and recognition of information contained in documents and initially addressed to human comprehension. Almost all sectors (banks, public administrations, etc.) are living a digital transformation boosted by the current COVID pandemic emergency. Different technologies characterize this scenario: mobile devices, standard acquisition and processing tools, cloud computing, cybersecurity and privacy are just some examples. The document is now part of an integrated and extended system which not only considers it as a standalone element, but it is also linked to many different users and to other digital elements including other documents, metadata, digital contents, and database records.

This special issue is devoted to present and collect the most recent advances to process documents in the stand-alone modality as well as in an integrated and extended way.

Document indexing, Natural Language Processing, summarization and translation are crucial steps for digital document archiving and augmentation. Nevertheless, it is well known that digital document containing handwriting can convey very sensitive information about the writer: age, sex, emotion, identity and health status are just some examples. Under this light, the processing of these data is very relevant in many applications deserving specific investigation as well as privacy preserving.

Papers presenting reviews, alternative perspectives, new applications and methods in the field of DAR are welcome. Research contributions should have a significant advancement both in the state-of-the-art for document analysis as well as for machine learning and pattern recognition fields.

Topics of interest

Topics of interest to this special issue include, but are not limited to:

Document Layout Analysis evaluation, recognition, semantic information extraction and benchmarking
Camera-based document scanning, processing, segmentation, and recognition
Document Understanding and information extraction
Structured and unstructured documents processing
Natural language processing, information extraction and retrieval
Handwritten/printed document images, information extraction and retrieval
Document Indexing
Handwritten text recognition
Document typeface, script and writing analysis and recognition
Object detection and structure modeling: tables, forms, identification documents, drawn sketches, etc.
Biometrics: writer identification and signature verification
Soft biometrics and meta-data extraction and manipulation
Deep Learning vs Shallow Learning techniques in real and benchmarking scenarios
Real-time document analysis
Interoperability of systems (e.g. cross dataset evaluations, standards,etc.)
Document augmentation
Human-Document Interaction
Large digital archives

Deadlines

Submission period: 1-20 July-2022

Guest Editors

Dr. Donato Impedovo (ME) – University of Bari Aldo Moro (Italy)
Dr. Byron Leite Dantas Bezerra – University of Pernambuco (Brazil)
Dr. Alejandro H. Toselli – Northeastern University (USA)
Dr. Giuseppe Pirlo – University of Bari Aldo Moro (Italy)

10) IJDAR article alert (vol. 25, issue 1)

Volume 25, Issue 1, March 2022
https://link.springer.com/journal/10032/volumes-and-issues/25-1

TableSegNet: a fully convolutional network for table detection and segmentation in document images
Duc-Dung Nguyen

TG2: text-guided transformer GAN for restoring document readability and perceived quality
Oldřich Kodym, Michal Hradiš

MRZ code extraction from visa and passport documents using convolutional neural networks
Yichuan Liu, Hailey James, Otkrist Gupta, Dan Raviv

An end-to-end network for irregular printed Mongolian recognition
ShaoDong Cui, YiLa Su, Ren Qing dao er ji, YaTu Ji

A novel normal to tangent line (NTL) algorithm for scale invariant feature extraction for Urdu OCR
Asma Naseer, Sarmad Hussain, Kashif Zafar, Ayesha Khan

11) IAPR Newsletter summary – April 2022

The April 2022 Issue of the IAPR Newsletter is available at: https://iapr.org/docs/newsletter-2022-02.pdf

In this issue you will find:

• From the Editor’s Desk: Add your voice to theirs, Gender Visibility in the Pattern Recognition Community, a collection of Her Stories
• CALLS for PAPERS
• Calls from the IAPR Education Committee, Industrial Liaison Committee, and ExCo
• IAPR…The Next Generation: Vishal M. Patel, winner of the 2021 Young Biometrics Investigator Award at IJCB 2021
• From the ExCo: News plus Gender Visibility in the Pattern Recognition Community, Part 2
• In Memoriam: Sargur N. Srihari
• ICPR 2022 Registration, Workshops, and Challenges
• IAPR Technical Committee (TC) News: TC1, TC4, TC5, TC6, TC7, TC12, TC18 and TC19
• Meeting Reports: CIARP Porto, ACPR 2021,
DICTA 2021, CVIP 2021, WSB 2022, and ISPR 2022
• Bulletin Board: Pattern Recognition Letters Special Issues
• Meeting and Education Planner

Wishing you all happy reading!

Stay safe,
Jing Dong
IAPR Newsletter Editor
Associate Professor, National Laboratory of Pattern Recognition
Institute of Automation, Chinese Academy of Sciences, Beijing, China
jdong@nlpr.ia.ac.cn

12) Job offers – 1 new

Research Engineer/PostDoc Position (2.5 Years) – IRISA/INSA Rennes (France) – new

Syntactical Models for Historical Music Score Recognition

PDF version: https://www.irisa.fr/offres-emploi/2022-03/syntactical-models-historical-music-score-recognition

IRISA – Intuidoc

IRISA is a joint research center for Informatics, including Robotics and Image and Signal Processing. 850 people, 40 teams, explore the world of digital sciences to find applications in healthcare, ecology-environment, cyber-security, transportation, multimedia, and industry… INSA Rennes is one of the 8 trustees of IRISA.

The Intuidoc team (https://www.irisa.fr/intuidoc) conducts researches on the topic of document image recognition. Since many years, the team proposes a system, called DMOS-PI method, for document structure analysis of documents. This DMOS-PI method is used for document recognition, or field extraction in archive documents, handwritten contents damaged documents (musical scores, archives, newspapers, letters, electronic schema, …).

Collabscore project

Collabscore is a project founded by ANR (French Research National Agency), led by the CNAM. The goal is to study ancient scores provided by the BNF (Bibliothèque National de France) and Royaumont foundation. Collabscore is a multidisciplinary project. The first task consists in improving OMR (Optical Music Recognition) results using learning techniques. The second action will focus on methods for automatic alignment of the scored score with other multimodal sources. The last one will set up demonstrators based on notated scores at two of the project partners, representative, in various ways, of institutions in charge of musical heritage collections (BnF and Fondation Royaumont). Intuidoc team focuses on the first task of musical score recognition.

Position to be filled

Position: Post-doctoral fellow / Research Engineer
Time commitment: Full-time
Duration of the contract: up to 18 months, starting as soon a possible
Indicative salary: Up to €36 000 gross annual salary (according to experience), with social security benefits
Location: IRISA — Rennes, France

Missions

The engineer fellow will work on the conception of an OMR system. Based on previous works of our research team, the goal of this position is to enrich an existing system (DMOS-PI) to get a complete self-adaptative OMR system for historical orchestra scores. The tasks are mainly:

define a grammatical description of musical notation, using the existing DMOS-PI method;
integrate symbol recognizers developed in another part of the project;
integrate anomaly detection into the system.

Logical programming from grammars and languages is expected in this work.

Applicant Requirements

Master degree or PhD in computer science.
Experience in document recognition or statistical analysis is expected but not mandatory.
Skills in grammars and languages and/or logical programming are nice-to-have, as well as knowledge of music notation.

Candidates should contact via email: Bertrand Coüasnon (bertrand.couasnon@irisa.fr), Aurélie Lemaitre (aurelie.lemaitre@irisa.fr) and Yann Soullard (yann.soullard@irisa.fr).

Post-doctoral research position – L3i – La Rochelle, France

Title: Extraction of information in “Bande Dessinée” / Manga / Comics albums

The L3i laboratory has one open post-doc position in computer science, in the specific field of document image analysis and pattern recognition.

Duration: 24 months
Position available from: October 1st, 2021
Salary: approximately 2100 € / month (net)
Place: L3i lab, University of La Rochelle, France
Specialty: Computer Science/Image Processing/Document Analysis/Pattern Recognition/Deep Learning
Contact: Jean-Christophe BURIE (jcburie [at] univ-lr.fr)

Position Description

The L3i is a research lab of the University of La Rochelle. La Rochelle is a city in the south west of France on the Atlantic coast and is one of the most attractive and dynamic cities in France. The L3i works since several years on document analysis and has developed a well-known expertise in ‘Bande dessinée”, manga and comics analysis, indexing and understanding.

The work done by the post-doc will be part of the SAIL (Sequential Art Image Laboratory) a joint laboratory involving L3i and a private company. The objective is to create innovative tools to index and interact with digital comics. The work will be done in a team of 10 researchers and engineers.

The work entrusted to the recruited person will consist in developing original approaches for extracting relevant information in comics panels in order to understand its content. The team has already developed some methods to extract panels, speech balloons, text, characters (persons), faces. However, the large variability of representation of these elements requires to propose different approaches or strategies. According to the skills and knowledge of the candidate, he/she will be able to work to improve methods dedicated to one of these elements.

Other challenges may also be considered such as :

Detection and understanding of the scenery (sea, countryside, city, …)
Detection and understanding of the context of the scene (battle, discussion, people eating, …)
Object detection and recognition (bicycle, car, table, chair, …)
…

Traditional and/or deep learning-based strategies can be studied to achieve these objectives.

Qualifications

Candidates must have a completed PhD and a research experience in image processing and analysis, pattern recognition. Good knowledge and experience in deep learning are also recommended.

General Qualifications

• Good programming skills mastering at least one programming language like Python, Java, C/C++
• Good teamwork skills
• Good writing skills and proficiency in written and spoken English or French

Applications

Candidates should send a CV and a motivation letter to jcburie [at] univ-lr.fr.

1) Upcoming deadlines and events

2026

2) Open Call for Organizing DAR Events

3) IAPR TC10/TC11 Summer School on Document Analysis

4) ICDAR 2026 competition selection

ICDAR2026 Competition on Multimodal Reasoning over Documents in Multiple Domains (DocVQA2026)

ICDAR 2026 Competition on Writer and Pen Identification from Hand-Drawn Circles (CircleID)

5) ICDAR 2026 workshop selection

DAS2026 Document Analysis System

Topics of interest:

Submission Types:

Important Dates (NO EXTENSIONS, all deadlines are AoE time):

HIP: 8th International Workshop on Historical Document Imaging and Processing

WoRMS: 7th International Workshop on Reading Music Systems

Important Dates

6) ICPR 2026 workshop selection

7th International Workshop on coMics ANalysis, Processing and Understanding

Important dates

Scope and Topics

TrustDoc Trustworthy Document Understanding: Privacy, Unlearning, Robustness, and Explainability

Important Dates

Publication

Contact

7) Job offer (1 new)

Biomedical AI Scientists (multiple)

1) Upcoming deadlines and events

2025

2) Message from the new TC10 Chair

3) ICDAR 2025 competition list

4) Open Call for Organizing DAR Events

5) IJDAR article alert (vol. 27, issue 3)

6) Job offers – 1 new

Postdoctoral Researcher in Artificial Intelligence,AI Research LabUniversity of South Dakota

1) Upcoming deadlines and events

2024

2) IAPR/ICDAR Young Investigator Award – nomination

3) ICDAR 2024 Doctoral consortium

4) ICDAR 2024 MANPU workshop – news

The patterns of global comicsNeil Cohn (Tilburg University)

5) ICDAR 2024 DAS workshop

6) ICDAR 2025 Call for Paper

7) 2025 ICDAR-IJDAR Journal Track – Call for Paper

8) Open Call for Organizing DAR Events

9) IJDAR article alert (vol. 27, issue 2)

10) Job offers – repost

1) Upcoming deadlines and events

2024

2) MANPU@ICDAR 2024 Call for Paper

Why should we research on Comics?

Important Dates

Scope and Topics

Datasets

Paper Submissions

3) DAS@ICDAR 2024 Call for Paper

4) Call for Nominations for the 2024 IAPR Fellow Awards – repost

5) ICDAR 2024 Occluded RoadText Competition

Competition Website

Important Dates

Organizers

6) ICDAR 2024 Competition on Handwriting Recognition of Historical Ciphers

Call for participation

7) ICDAR 2024 Competition on Map Text Detection, Recognition, and Linking

Call for Participation

8) SSDA 2025: Call for proposal for the next summer school on document analysis – repost

9) ICPR 2024 call for paper – extended

10) International Computer Vision Summer School

11) Call for papers Special Issue: Heritage Preservation in the Digital Age

Aims and Scope

Important Dates

Guest Editors

Submission Guidelines

12) Open Call for Organizing DAR Events

13) IJDAR article alert (vol. 27, issue 1)

14) Job offers – 1 new

1) Upcoming deadlines and events

2023

2024 and later

2) Welcome to the new TC10 educational officer and dataset curator

3) Call for Nominations for the 2024 IAPR Fellow Awards

4) Call for Nominations for the IAPR/ICDAR Young Investigator Award

Postdoctoral Researcher in Artificial Intelligence,
AI Research Lab
University of South Dakota

The patterns of global comics
Neil Cohn (Tilburg University)