Skip to content

KeisukeImoto/ACalt4

Repository files navigation

AudioCaps Alternative Captions

We created alternative captions for AudioCaps, AudioCaps Alternative 4 Captions (ACalt4). While the files in this folder do not provide complete information about how we generate, they are for your reference for your future extended versions.

References

  • [AudioCaps] C. D. Kim, B. Kim, H. Lee, and G. Kim, “AudioCaps: Generating Captions for Audios in The Wild,” in NAACL-HLT, 2019.
  • [BLIP-2] J. Li, D. Li, S. Savarese, and S. C. H. Hoi, “BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models,” in ICML, 2023.
  • [AudioSet] J. F. Gemmeke, D. P. W. Ellis, D. Freedman, A. Jansen, W. Lawrence, R. C. Moore, M. Plakal, and M. Ritter, “Audio Set: An ontology and human-labeled dataset for audio events,” in ICASSP, 2017, pp. 776–780.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages