Dropbox Tech Blog Note

Dropbox Tech Blog

The Dropbox Tech website is a platform where Dropbox shares its technology and engineering stories, insights, and innovations. The website features articles and blog posts written by Dropbox engineers and developers, covering a wide range of topics such as software development, data storage, security, and more. The website is divided into several sections, including a blog, a podcast, and a section for open-source projects. The blog features in-depth articles on various technical topics, including how Dropbox approaches certain technical challenges, new technologies and tools, and lessons learned from past projects. The podcast section features conversations with Dropbox engineers and other industry experts, discussing topics such as technology trends, innovation, and the future of software development. The open-source section showcases Dropbox's open-source projects, which are available on GitHub. These projects include tools and libraries developed by Dropbox engineers to solve specific problems or improve certain processes. Overall, the Dropbox Tech website is a valuable resource for developers, engineers, and anyone interested in technology and software development. It provides a unique glimpse into the inner workings of a successful tech company and offers insights and knowledge that can be applied to various areas of the tech industry.

Thread Of Notes

Insights from our executive roundtable on AI and engineering productivity

From Claude Code to Cursor, we're big adopters of AI coding tools at Dropbox. The early results have been promising, but there are still a lot of open questions about how to work with these tools most effectively and where they can have the most impact. To push this conversation forward, we hosted an executive roundtable at our San Francisco studio. Here's how it went.
CdXz5zHNQW_8elT0XPgU3.png

Is this a date? Using ML to identify date formats in file names

The article discusses Dropbox's implementation of a machine learning model designed to identify date formats in file names, enhancing file organization and retrieval. Effective file naming is crucial for teamwork, and Dropbox's automated naming conventions feature allows users to set rules for file names, ensuring consistency and efficiency. Initially, Dropbox attempted a rule-based approach for date identification but faced challenges due to the variety of date formats used by different individuals. This led to the development of a machine learning model that accurately recognizes dates within file names. The model underwent several stages, including data annotation, tokenization, and classification, utilizing techniques such as Inside-Outside-Beginning (IOB) tagging to label date components. The machine learning model, based on the transformer architecture (specifically DistilRoberta), showed a significant improvement over the previous rule-based system, increasing the number of renamed files by 40%. To optimize performance, Dropbox implemented techniques like model pruning and quantization, successfully reducing latency during inference. Following its rollout in August 2022, the feature gained popularity, with over a million files renamed shortly after launch. Future enhancements may include the extraction of additional entities beyond dates, leveraging advanced models for even greater accuracy in file naming conventions.