Reinforcement Learning for Controlled Text Generation - Naver Labs Europe
preloder
NAVER LABS Europe
    Published
    30 September 2019
    Location
    Meylan, France
    Job Type
    Start date
    Fall 2019 or early 2020
    Duration
    5-6 months

    Description

    NAVER LABS Europe is opening a research internship on Reinforcement Learning techniques for applications to controlled text generation.

    Under conditions where training data is limited, standard end-to-end training of seq2seq models may generalize poorly and produce inadequate results at test time. A possible remedy is to augment models with rewards that control the quality of the outputs. These rewards can address two complementary goals: (i) taking into account global characteristics of observed sequences that go beyond standard local teacher-forcing training techniques (observation bias problem), and (ii) moving the generation process towards desired properties of the output (e.g. favoring shorter sentences or performing style transfer).

    Supervisors: Marc Dymetman and Hady Elsahar.

    We are looking for a motivated intern to help us develop methods and algorithms for addressing this general problem, both in theory and in practice. Experiments will be conducted on selected text generation tasks (NLG, summarization or machine translation).

    The successful candidate should be enrolled in a graduate program, at the Master or (preferably) PhD level, with experience (ideally) in Deep Learning, Reinforcement Learning and Natural Language Processing.
    Publication of results in major conferences/journals will be strongly encouraged.

    Required skills

    Strong mathematical and programming skills as well as familiarity with one of the major current deep learning toolkits (PyTorch preferred but not compulsory) are a requirement.

    Application instructions

    Please note that applicants must be registered students at a university or other academic institution and that this establishment will need to sign an 'Internship Convention' with NAVER LABS Europe before the student is accepted.

    You can apply for this position online. Don't forget to upload your CV and cover letter before you submit. Incomplete applications will not be accepted.

    About NAVER LABS

    NAVER LABS is a world class team of self-motivated and highly engaged researchers, engineers and interface designers collaborating together to create next generation ambient intelligence technology and services that are rich with the organic understanding they have of users, their contexts and situations.

    Since 2013 LABS has led NAVER’s innovation in technology through products such as the AI-based translation app ‘Papago’, the omni-tasking web browser ‘Whale’, the virtual AI assistant ‘WAVE’, in-vehicle information entertainment system ‘AWAY’ and M1, the 3D indoor mapping robot.

    The team in Europe is multidisciplinary and extremely multicultural specializing in artificial intelligence, machine learning, computer vision, natural language processing, UX and ethnography. We collaborate with many partners in the European scientific community on R&D projects.

    NAVER LABS Europe is located in the south east of France in Grenoble. The notoriety of Grenoble comes from its exceptional natural environment and scientific ecosystem with 21,000 jobs in public and private research. It is home to 1 of the 4 French national institutes in AI called MIAI (Multidisciplinary Innovation in Ai) It has a large student community (over 62,000 students) and is a lively and cosmopolitan place, offering a host of leisure opportunities. Grenoble is close to both the Swiss and Italian borders and is the ideal place for skiing, hiking, climbing, hang gliding and all types of mountain sports.

    Apply
    Drop files here browse files ...
    Captcha
    Are you sure you want to delete this file?
    /