Zobrazeno 1 - 10
of 29
pro vyhledávání: '"James B. Wendt"'
Publikováno v:
Proceedings of the VLDB Endowment. 14:997-1005
Extracting structured information from templatic documents is an important problem with the potential to automate many real-world business workflows such as payment, procurement, and payroll. The core challenge is that such documents can be laid out
Publikováno v:
Proceedings of the VLDB Endowment. 12:1235-1248
In emails, information abounds. Whether it be a bill reminder, a hotel confirmation, or a shipping notification, our emails contain useful bits of information that enable a number of applications. Most of this email traffic is machine-generated, sent
Autor:
Marc Najork, Navneet Potti, Qi Zhao, James B. Wendt, Bodhisattwa Prasad Majumder, Sandeep Tata
Publikováno v:
ACL
We propose a novel approach using representation learning for tackling the problem of extracting structured information from form-like document images. We propose an extraction system that uses knowledge of the types of the target fields to generate
Autor:
Marc Najork, Nguyen Vo, Qi Zhao, Sandeep Tata, Furkan Kocayusufoglu, Ying Sheng, James B. Wendt
Publikováno v:
WWW
Recent studies show that an overwhelming majority of emails are machine-generated and sent by businesses to consumers. Many large email services are interested in extracting structured data from such emails to enable intelligent assistants. This allo
Publikováno v:
IEEE BigData
Machine generated business-to-consumer (B2C) emails such as receipts, newsletters, and promotions constitute a large portion of users’ inboxes today. These emails reflect the users’ interests and often are sequentially correlated, e.g., users int
Publikováno v:
KDD
Extracting structured data from emails can enable several assistive experiences, such as reminding the user when a bill payment is due, answering queries about the departure time of a booked flight, or proactively surfacing an emailed discount coupon
Publikováno v:
WWW
A vast majority of the emails received by people today are machine-generated by businesses communicating with consumers. While some emails originate as a result of a transaction (e.g., hotel or restaurant reservation confirmations, online purchase re
Autor:
Miodrag Potkonjak, James B. Wendt, Jeyavijayan Rajendran, Bryant Wysocki, Ramesh Karri, Garrett S. Rose, Nathan McDonald
Publikováno v:
Proceedings of the IEEE. 103:829-849
Information security has emerged as an important system and application metric. Classical security solutions use algorithmic mechanisms that address a small subset of emerging security requirements, often at high-energy and performance overhead. Furt
Publikováno v:
WWW (Companion Volume)
According to recent estimates, about 90% of consumer received emails are machine-generated. Such messages include shopping receipts, promotional campaigns, newsletters, booking confirmations, etc. Most such messages are created by populating a fixed
Autor:
Michael Bendersky, Sujith Ravi, James B. Wendt, Marc-Allen Cartright, Lluis Garcia-Pueyo, Amitabh Saikia, Jie Yang, Balint Miklos, Ivo Krka, Vanja Josifovski
Publikováno v:
WSDM
Machine-generated documents such as email or dynamic web pages are single instantiations of a pre-defined structural template. As such, they can be viewed as a hierarchy of template and document specific content. This hierarchical template representa