Robert R George 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							a98a8ee93f 
							
						 
					 
					
						
						
							
							Update robots.txt to prevent crawling of domain blocks ( #26470 )  
						
						... 
						
						
						
						Co-authored-by: Claire <claire.github-309c@sitedethib.com> 
						
						
					 
					
						2024-12-02 08:03:24 +00:00 
						 
				 
			
				
					
						
							
							
								Foritus 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							405f141fe0 
							
						 
					 
					
						
						
							
							Change: Block GPTBot ( #26396 )  
						
						
						
						
					 
					
						2023-08-09 11:58:46 +02:00 
						 
				 
			
				
					
						
							
							
								ThibG 
							
						 
					 
					
						
						
						
						
							
						
						
							c4f2433300 
							
						 
					 
					
						
						
							
							Disallow robots from indexing /interact/ ( #10666 )  
						
						... 
						
						
						
						This does not provide any new information and may just triple the number
of crawled pages 
						
						
					 
					
						2019-05-02 00:10:19 +02:00 
						 
				 
			
				
					
						
							
							
								nightpool 
							
						 
					 
					
						
						
						
						
							
						
						
							a5992e5883 
							
						 
					 
					
						
						
							
							Change robots.txt to exclude only media proxy URLs ( #10038 )  
						
						... 
						
						
						
						* Revert "Change robots.txt to exclude some URLs (#10037 )"
This reverts commit 80161f43510ad9316c60c9b50dd5c09c2dae4d54.
* Let's block media_proxy
/media_proxy/ is a dynamic route used for requesting uncached media, so it's
probably bad to let crawlers use it
* misleading comment 
						
						
					 
					
						2019-02-14 03:11:47 +01:00 
						 
				 
			
				
					
						
							
							
								Eugen Rochko 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							80161f4351 
							
						 
					 
					
						
						
							
							Change robots.txt to exclude some URLs ( #10037 )  
						
						... 
						
						
						
						- Exclude static assets
- Exclude uploaded files
- Exclude alternate versions of the profile page
- Exclude media proxy URLs 
						
						
					 
					
						2019-02-13 21:28:18 +01:00 
						 
				 
			
				
					
						
							
							
								Eugen Rochko 
							
						 
					 
					
						
						
						
						
							
						
						
							9c4856bdb1 
							
						 
					 
					
						
						
							
							Initial commit  
						
						
						
						
					 
					
						2016-02-20 22:53:20 +01:00