📝 Note for Teachers: This answer key provides sample correct answers and guidance for grading. Student responses may vary in wording but should demonstrate similar understanding. Use the rubric at the end of this document to assess student work holistically. Remember that the goal is to help students understand how NLP systems process language, not to achieve perfect technical accuracy.
📄 Text Passage (Reference)
Passage A (Grades 6-8):
Last Tuesday, Emma Rodriguez visited the new Artificial Intelligence Museum in San Francisco. She was amazed by the interactive exhibits showcasing how computers can understand human language. "This technology is incredible!" Emma exclaimed to her friend Marcus. The museum opened three months ago and has already welcomed over 50,000 visitors. One exhibit demonstrated how chatbots use Natural Language Processing to answer questions. Emma learned that AI systems break down sentences into smaller parts called tokens. She found it fascinating that computers need step-by-step instructions to understand what humans grasp naturally. Before leaving, Emma purchased a robotics kit for $45.99 at the gift shop. She can't wait to explore more about AI technology!
✅ Task Answers
Answers:
a) Total number of tokens: Approximately 104-110 tokens (count may vary based on how punctuation is counted)
- If counting contractions as single tokens (can't = 1): ~104 tokens
- If splitting contractions (can't = 2: "can" + "n't"): ~106 tokens
- If counting all punctuation marks separately: ~110 tokens
b) Tokens with punctuation: Approximately 8-10
- "Tuesday," "Francisco." "incredible!" "Marcus." "ago" (could have comma) "questions." "tokens." "naturally." "$45.99" "shop." "technology!"
c) Examples of challenging tokens:
- can't - Contraction: Could be split into "can" + "n't" or kept as one token
- $45.99 - Number with currency symbol and decimal point
- step-by-step - Hyphenated compound word
- 50,000 - Large number with comma separator
d) Why tokenization is difficult:
Sample answer: Tokenization is difficult for computers because language has many special cases. Contractions need to be split or kept together, numbers can include commas and decimals, hyphenated words might be one word or multiple, and punctuation can be part of a word (like in "$45.99") or separate. Computers need rules for every possible situation, unlike humans who understand context naturally.
Teaching Note: Accept any reasonable token count between 100-115. The exact number depends on methodology. The important concept is that students understand tokenization involves systematic word separation and that edge cases (contractions, numbers, hyphenation) create challenges.
Sample Answers (15 words):
| # |
Word |
POS |
Explanation |
| 1 | Tuesday | N (Noun) | Name of a day - proper noun |
| 2 | visited | V (Verb) | Action word - past tense of visit |
| 3 | new | Adj (Adjective) | Describes the museum |
| 4 | amazed | V (Verb) | Past tense verb showing action/state |
| 5 | interactive | Adj (Adjective) | Describes the exhibits |
| 6 | how | Adv (Adverb) | Modifies "understand" - in what way |
| 7 | incredible | Adj (Adjective) | Describes technology |
| 8 | exclaimed | V (Verb) | Action of speaking loudly |
| 9 | three | Adj (Adjective) | Number describing months |
| 10 | already | Adv (Adverb) | Modifies "welcomed" - when/time |
| 11 | demonstrated | V (Verb) | Action word - past tense |
| 12 | smaller | Adj (Adjective) | Comparative adjective describing parts |
| 13 | naturally | Adv (Adverb) | Modifies "grasp" - in what manner |
| 14 | purchased | V (Verb) | Action of buying - past tense |
| 15 | robotics | N (Noun) | Thing/concept - name of field |
Reflection Answers:
a) Difficult words to categorize:
Sample answer: "opened" was tricky because it could be a verb (The museum opened) or an adjective (the opened box). In this sentence it's a verb. Also, "three" could be a number, but in grammar it's functioning as an adjective describing "months."
b) Same word, different parts of speech:
Sample answer: Yes! The word "light" can be different POS:
- "The light is bright" (noun)
- "Light the candle" (verb)
- "This box is light" (adjective)
Teaching Note: Student word choices will vary - that's expected! Grade based on correct identification of the part of speech for their chosen words, not on matching this exact list. Common challenges: distinguishing between past tense verbs and past participles used as adjectives, identifying adverbs that don't end in "-ly," and recognizing numbers as adjectives.
Complete List of Named Entities:
| Entity |
Category |
Importance |
| Last Tuesday | DATE/TIME | Tells when the event occurred |
| Emma Rodriguez | PERSON | Main character of the story |
| Artificial Intelligence Museum | ORGANIZATION | Place where events happen |
| San Francisco | LOCATION | Geographic location of museum |
| Marcus | PERSON | Emma's friend mentioned in story |
| three months ago | DATE/TIME | When museum opened |
| 50,000 | QUANTITY | Number of visitors |
| Natural Language Processing | ORGANIZATION/CONCEPT | Key technology discussed |
| $45.99 | MONEY | Price of robotics kit |
| Emma (second mention) | PERSON | Continued reference to main character |
Analysis Answers:
a) Total entities: 10 distinct named entities (some names appear multiple times)
b) Category with most entities: DATE/TIME and PERSON (tied with 2-3 each, depending on if "Emma" mentions are counted separately)
c) Real-world uses of NER:
Sample answers:
- News organizations can automatically tag articles by people, places, and events mentioned
- Search engines use NER to understand what you're looking for ("restaurants in Boston")
- Calendar apps can extract dates and times from emails to create reminders
- Business intelligence tools track mentions of companies and competitors
d) Challenges in identifying entities:
Sample answer: Computers face several challenges: (1) Names can be common words (Apple is both a fruit and a company), (2) Context matters to determine entity type, (3) New names and entities are created all the time, (4) Different cultures have different naming patterns, (5) Abbreviations and nicknames need to be linked to full names.
Teaching Note: Some students may debate whether "Natural Language Processing" is an organization or a concept/technology. Accept either interpretation with reasonable justification. The key is recognizing it as a named entity that's important to the passage.
Sentiment Assessment:
a) Overall sentiment: ☑ Positive (or Very Positive)
b) Explanation:
Sample answer: The passage is clearly positive because Emma is described as "amazed," she calls the technology "incredible," she finds things "fascinating," and she "can't wait" to explore more. There are no negative statements or complaints. The entire tone is enthusiastic and excited about AI and the museum visit.
Emotion Words:
| POSITIVE Words/Phrases |
NEGATIVE Words/Phrases |
- amazed
- incredible
- exclaimed (with excitement)
- fascinating
- can't wait
- explore more
- new (in positive context)
- interactive
|
- None found in this passage
|
Analysis Answers:
a) Neutral statements (facts):
- "The museum opened three months ago" - factual date
- "Emma purchased a robotics kit for $45.99" - factual transaction
- "AI systems break down sentences into smaller parts called tokens" - technical explanation
- "has already welcomed over 50,000 visitors" - statistical fact
b) Sarcasm changing sentiment:
Sample answer: If read sarcastically, "incredible" could mean the opposite - that the technology is disappointing or unimpressive. For example, if Emma said "This technology is incredible" with an eye roll, it would actually be negative. This shows why sarcasm is so hard for AI - the same words have opposite meanings depending on tone and context.
c) Computer challenges in detecting sentiment:
Sample answer: Computers would struggle because: (1) They can't hear tone of voice, (2) They might not understand that "can't wait" is positive enthusiasm, not literally being unable to wait, (3) Context matters - "incredible" could be positive or sarcastic, (4) Cultural differences in how emotions are expressed, (5) Subtle emotional nuances that require life experience to understand.
d) Real-world sentiment analysis:
Sample answers:
- Companies analyze customer reviews to understand satisfaction levels
- Social media platforms monitor posts to detect harmful content or users in distress
- Political campaigns analyze public opinion about candidates
- Movie studios track reactions to trailers and films
Main Topic and Key Phrases:
a) Main topic:
Sample answers (students should write something similar):
- "Emma Rodriguez visits an AI museum and learns about natural language processing."
- "A student's visit to a museum teaches her about how computers understand language."
- "Emma's experience at the Artificial Intelligence Museum exploring NLP technology."
b) Five key phrases:
- Artificial Intelligence Museum
- Natural Language Processing (NLP)
- How computers understand human language
- Chatbots and AI systems
- Breaking sentences into tokens
c) 20-word summary:
Sample answer: "Emma visited an AI museum in San Francisco and learned how computers use natural language processing to understand human language."
(Exactly 20 words)
Semantic Understanding:
a) Who is "She"?
Emma Rodriguez
b) How did you know?
Sample answer: The previous sentences are all about Emma - she was amazed, she exclaimed, Emma learned. "She" comes right after talking about what Emma learned, so it must refer to Emma. Also, Emma is the only female person mentioned who would be finding things fascinating at the museum.
c) Why is this difficult for computers?
Sample answer: Computers have to track which nouns the pronoun "she" could refer back to. In this passage, "she" appears multiple times and could theoretically refer to any female person mentioned. Computers need complex rules to understand pronoun references (called "anaphora resolution") and must consider sentence structure, proximity, and context - things humans do automatically without thinking.
Teaching Note: The pronoun resolution question is crucial - it demonstrates semantic understanding. Good answers will explain how humans use context clues and implicit reasoning, while computers need explicit rules and algorithms to make the same connections.
Sample Reflection Answers:
1. Easiest task:
Sample answers might include:
- "Tokenization was easiest because I just had to count words and look for spaces"
- "Sentiment analysis was easiest because the passage was clearly happy and positive"
- "Named entity recognition was easiest because names and dates are obvious"
2. Most challenging task:
Sample answers might include:
- "POS tagging was hard because some words can be multiple parts of speech"
- "Semantic understanding was challenging because you have to track pronouns and references"
- "Key phrase extraction was difficult because everything seemed important"
3. Manual vs. computer analysis:
Sample answer: "When I analyzed manually, I used my understanding of context, grammar rules I learned, and common sense. A computer has to follow strict algorithms and rules for every step. I can 'just know' what a word means, but a computer has to check thousands of patterns and examples. Humans are flexible; computers are systematic."
4. Biggest NLP challenge:
Sample answers:
- "Understanding context is the biggest challenge because words change meaning based on situation"
- "Dealing with ambiguity - humans use common sense but computers need explicit instructions"
- "Sarcasm, idioms, and figurative language don't make literal sense"
- "Language constantly evolves with new slang and expressions"
5. NLP in daily life:
Sample answers:
- "A homework helper that can read my essay and suggest improvements would save me time"
- "Better autocorrect that understands context would reduce embarrassing text message errors"
- "Voice assistants that actually understand what I mean, not just the words I say"
- "Translation apps that work perfectly so I could talk to people in any language"
6. Designing an NLP application:
Sample answers (should demonstrate understanding of NLP concepts):
- "An app that reads news articles and creates simple summaries for kids to understand current events"
- "A chatbot that helps students who are feeling anxious by recognizing emotional words and responding supportively"
- "A tool that analyzes social media posts to detect cyberbullying before it gets worse"
- "An app that translates slang and idioms for English language learners"
Teaching Note: Reflection answers will be highly personal and varied. Look for evidence that students: (1) understand the core NLP concepts, (2) recognize challenges computers face that humans don't, (3) can connect NLP to real-world applications, and (4) demonstrate critical thinking about technology and its implications.
📊 Grading Rubric
Suggested Point Distribution (Total: 100 points)
| Task/Section |
Points |
Criteria |
| Task 1: Tokenization |
15 |
Reasonable token count (10 pts), identified challenging tokens (3 pts), explained difficulty (2 pts) |
| Task 2: POS Tagging |
20 |
Correctly identified 15 words and POS (12 pts), reasonable explanations (5 pts), reflection questions (3 pts) |
| Task 3: Named Entity Recognition |
20 |
Identified 8+ entities correctly (12 pts), proper categorization (5 pts), analysis questions (3 pts) |
| Task 4: Sentiment Analysis |
15 |
Correct overall sentiment (3 pts), identified emotion words (6 pts), thoughtful analysis answers (6 pts) |
| Task 5: Key Phrases & Summary |
15 |
Accurate main topic (3 pts), appropriate key phrases (5 pts), concise summary (4 pts), semantic understanding (3 pts) |
| Task 6: Reflection |
15 |
Thoughtful, detailed responses showing understanding of NLP concepts and challenges (2.5 pts per question) |
Overall Quality Assessment:
| Score Range |
Description |
| 90-100 (A) |
Exceptional understanding of NLP concepts. Accurately completed all tasks with minimal errors. Reflections demonstrate deep thinking about how computers process language vs. humans. Clear connections to real-world applications. |
| 80-89 (B) |
Strong understanding of NLP concepts. Most tasks completed accurately with few errors. Reflections show good thinking about computer language processing. Some connections to real-world applications. |
| 70-79 (C) |
Basic understanding of NLP concepts. Tasks completed with several errors or omissions. Reflections are present but may lack depth. Limited connections to real-world applications. |
| 60-69 (D) |
Partial understanding of NLP concepts. Many tasks incomplete or inaccurate. Reflections are minimal or superficial. Struggles to connect concepts to applications. |
| Below 60 (F) |
Limited understanding of NLP concepts. Most tasks incomplete or incorrect. Minimal effort in reflections. Does not demonstrate basic understanding of how computers process language. |
📌 Additional Grading Guidelines:
- Effort matters: Even if students make errors, reward genuine effort and thoughtful reasoning
- Accept variation: There may be multiple correct answers, especially for subjective tasks like sentiment analysis
- Focus on concepts: The goal is understanding NLP principles, not perfect technical execution
- Provide feedback: Use this as a teaching moment - write comments explaining why answers are correct or how they could be improved
- Differentiate grading: Consider students' starting knowledge levels and grade based on individual growth