Henishma
/

crisisMMD_task2

Model card Files Files and versions

crisisMMD_task2 / README.md

Henishma's picture

Update README.md

28917ee verified 4 months ago

|

history blame contribute delete

2.4 kB

	---
	datasets:
	- QCRI/CrisisMMD
	language:
	- en
	metrics:
	- accuracy
	- f1
	- recall
	- precision
	base_model:
	- google-bert/bert-base-uncased
	- microsoft/resnet-50
	---
	Source: CrisisMMD dataset (Alam et al., 2017)

	✅Original Labels (8 classes from annotations):

	Infrastructure and utility damage

	Vehicle damage

	Rescue, volunteering, or donation efforts

	Affected individuals

	Injured or dead people

	Missing or found people

	Other relevant information

	Not humanitarian

	✅Label Preprocessing (Class Merging):

	Vehicle damage merged into Infrastructure and utility damage

	Missing or found people merged into Affected individuals

	Not humanitarian retained as a separate class

	Removed very low-frequency categories (e.g., "Missing or found people" as a separate class)

	✅Final Label Set (5 classes total):

	Infrastructure and utility damage

	Rescue, volunteering, or donation efforts

	Affected individuals

	Injured or dead people

	Not humanitarian

	✅Multimodal Consistency:

	Selected only those posts where text and image annotations matched

	Resulted in a total of 8,219 consistent samples:

	Train set: 6,574 posts

	Test set: 1,644 posts

	✅ Preprocessing Done
	Text:

	Tokenized using BERT tokenizer (bert-base-uncased)

	Extracted input_ids and attention_mask

	Image:

	Processed using ResNet-50

	Extracted 2048-dimensional image features

	The preprocessed data was saved in PyTorch .pt format:

	train_human.pt and test_human.pt

	Each contains: input_ids, attention_mask, image_vector, and label

	✅ Model Architecture
	A custom multimodal classifier that combines BERT and ResNet-50 outputs:

	Component Details
	Text Encoder BERT base (bert-base-uncased) – outputs pooler_output (768-d)
	Image Encoder Pre-extracted ResNet-50 image features (2048-d)
	Fusion Concatenation → FC layers → Softmax over 5 classes
	Classifier Fully connected layers with BatchNorm, ReLU, Dropout

	✅ Training Setup
	Loss Function: CrossEntropyLoss

	Optimizer: AdamW

	Scheduler: StepLR (γ = 0.9)

	Epochs Tried: 1, 3, 5, 8, 10

	Batch Size: 16

	Runtime: ~2 minutes 20 seconds per epoch on Google Colab (T4 GPU)

	✅ Evaluation Metrics
	Accuracy

	Precision

	Recall

	F1 Score

	✅ Metrics(epoch 3 with highest accuracy)

	✅ Test Accuracy : 0.8820
	✅ Precision : 0.6854
	✅ Recall : 0.7176
	✅ F1 Score : 0.7005

	The new dataset created: https://huggingface.co/datasets/Henishma/crisisMMD_cleaned_task2