sfeed_web.1 1.1 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950
  1. .Dd July 27, 2021
  2. .Dt SFEED_WEB 1
  3. .Os
  4. .Sh NAME
  5. .Nm sfeed_web
  6. .Nd finds URLs to feeds from a HTML webpage
  7. .Sh SYNOPSIS
  8. .Nm
  9. .Op Ar baseurl
  10. .Sh DESCRIPTION
  11. .Nm
  12. reads the HTML data of the webpage from stdin and writes the found URLs to
  13. stdout.
  14. .Pp
  15. Such a link reference in HTML code looks like:
  16. .Bd -literal
  17. <link rel="alternate" href="atom.xml" type="application/atom+xml" />
  18. .Ed
  19. .Sh OPTIONS
  20. .Bl -tag -width 8n
  21. .It Ar baseurl
  22. Optional base URL to use for found feed URLs that are relative.
  23. .El
  24. .Sh OUTPUT FORMAT
  25. url<TAB>content-type<newline>
  26. .Bl -tag -width Ds
  27. .It URL
  28. Found relative or absolute URL.
  29. .Pp
  30. For relative URLs if a <base href="..." /> tag is found it will be used,
  31. otherwise if the
  32. .Ar baseurl
  33. option is specified then that is used, if neither are set then the relative URL
  34. is printed.
  35. .It content-type
  36. Usually application/atom+xml or application/rss+xml.
  37. .El
  38. .Sh EXIT STATUS
  39. .Ex -std
  40. .Sh EXAMPLES
  41. Get URLs from a website:
  42. .Bd -literal
  43. curl -s -L 'https://codemadness.org/' | sfeed_web 'https://codemadness.org/'
  44. .Ed
  45. .Sh SEE ALSO
  46. .Xr sfeed_update 1 ,
  47. .Xr sh 1
  48. .Sh AUTHORS
  49. .An Hiltjo Posthuma Aq Mt hiltjo@codemadness.org