1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950 |
- .Dd July 27, 2021
- .Dt SFEED_WEB 1
- .Os
- .Sh NAME
- .Nm sfeed_web
- .Nd finds URLs to feeds from a HTML webpage
- .Sh SYNOPSIS
- .Nm
- .Op Ar baseurl
- .Sh DESCRIPTION
- .Nm
- reads the HTML data of the webpage from stdin and writes the found URLs to
- stdout.
- .Pp
- Such a link reference in HTML code looks like:
- .Bd -literal
- <link rel="alternate" href="atom.xml" type="application/atom+xml" />
- .Ed
- .Sh OPTIONS
- .Bl -tag -width 8n
- .It Ar baseurl
- Optional base URL to use for found feed URLs that are relative.
- .El
- .Sh OUTPUT FORMAT
- url<TAB>content-type<newline>
- .Bl -tag -width Ds
- .It URL
- Found relative or absolute URL.
- .Pp
- For relative URLs if a <base href="..." /> tag is found it will be used,
- otherwise if the
- .Ar baseurl
- option is specified then that is used, if neither are set then the relative URL
- is printed.
- .It content-type
- Usually application/atom+xml or application/rss+xml.
- .El
- .Sh EXIT STATUS
- .Ex -std
- .Sh EXAMPLES
- Get URLs from a website:
- .Bd -literal
- curl -s -L 'https://codemadness.org/' | sfeed_web 'https://codemadness.org/'
- .Ed
- .Sh SEE ALSO
- .Xr sfeed_update 1 ,
- .Xr sh 1
- .Sh AUTHORS
- .An Hiltjo Posthuma Aq Mt hiltjo@codemadness.org
|