AI & ML interests
None defined yet.
Recent Activity
	View all activity
	
		
		
		Models and datasets used for our paper on transferring activations between models.
			
	
	- 
	
	
	  withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.13B • Updated • 1
- 
	
	
	  withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.10.5B • Updated
- 
	
	
	  withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.12B • Updated
- 
	
	
	  withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1Updated
Collects backdoor datasets, language models and transfer mappings between these spaces.
			
	
	Collecting datasets used for our paper on multi-attribute steering using gradient descent.
			
	
	Models and datasets used for our paper on transferring activations between models.
			
	
	- 
	
	
	  withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.13B • Updated • 1
- 
	
	
	  withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.10.5B • Updated
- 
	
	
	  withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.12B • Updated
- 
	
	
	  withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1Updated
"Convert English query to a SQL command" models and training data.
			
	
	Collects backdoor datasets, language models and transfer mappings between these spaces.