Swiss-Prot entry

ID   NHR34_CAEEL    STANDARD;      PRT;   605 AA.
AC   Q21006; Q95NU3; Q9GTH8;
DT   30-MAY-2000 (Rel. 39, Created)
DT   28-FEB-2003 (Rel. 41, Last sequence update)
DT   01-MAY-2005 (Rel. 47, Last annotation update)
DE   Nuclear hormone receptor family member nhr-34.
GN   Name=nhr-34; ORFNames=F58G6.5;
OS   Caenorhabditis elegans.
OC   Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 
OC   Rhabditidae; Peloderinae; Caenorhabditis. 
OX   NCBI_TaxID=6239;
RN   [1]
RP   NUCLEOTIDE SEQUENCE (ISOFORM A).
RA   Bogan A., Maina C.V., Yamamoto K., Cohen F., Sluder A.E.;
RT   "Caenorhabditis elegans nuclear receptor sequences exhibit biophysical
RT   compatibility with the ligand-binding domain fold.";
RL   Submitted (MAY-2000) to the EMBL/GenBank/DDBJ databases.
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bristol N2;
RX   MEDLINE=99069613; PubMed=9851916 [NCBI, ExPASy, EBI, Israel, Japan];
RG   The C. elegans sequencing consortium;
RT   "Genome sequence of the nematode C. elegans: a platform for
RT   investigating biology.";
RL   Science 282:2012-2018(1998).
RN   [3]
RP   SEQUENCE REVISION, AND ALTERNATIVE SPLICING.
RG   WormBase consortium;
RL   Submitted (SEP-2001) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Orphan nuclear receptor.
CC   -!- SUBCELLULAR LOCATION: Nuclear (Potential).
CC   -!- ALTERNATIVE PRODUCTS:
CC       Event=Alternative splicing; Named isoforms=2;
CC       Name=b;
CC         IsoId=Q21006-1; Sequence=Displayed;
CC       Name=a;
CC         IsoId=Q21006-2; Sequence=VSP_003724, VSP_003725;
CC         Note=No experimental confirmation available;
CC   -!- SIMILARITY: Belongs to the nuclear hormone receptor family.
CC   -!- SIMILARITY: Contains 1 nuclear receptor DNA-binding domain.
CC   --------------------------------------------------------------------------
CC   This Swiss-Prot entry is copyright. It is produced through a collaboration
CC   between  the Swiss Institute of Bioinformatics  and the  EMBL outstation -
CC   the European Bioinformatics Institute.  There are no  restrictions on  its
CC   use as long as its content is in no way modified and this statement is not
CC   removed.
CC   --------------------------------------------------------------------------
DR   EMBL; AF273780; AAG15129.1; ALT_INIT. [EMBL / GenBank / DDBJ] [CoDingSequence]
DR   EMBL; Z68217; CAA92467.2; -. [EMBL / GenBank / DDBJ] [CoDingSequence]
DR   EMBL; Z68217; CAC70099.1; -. [EMBL / GenBank / DDBJ] [CoDingSequence]
DR   EMBL; Z68222; CAC70099.1; JOINED. [EMBL / GenBank / DDBJ] [CoDingSequence]
DR   EMBL; Z68222; CAC70146.1; -. [EMBL / GenBank / DDBJ] [CoDingSequence]
DR   EMBL; Z68217; CAC70146.1; JOINED. [EMBL / GenBank / DDBJ] [CoDingSequence]
DR   HSSP; P03372; 1HCP. [HSSP ENTRY / SWISS-3DIMAGE / PDB]
DR   WormBase; WBGene00003627; F58G6.5.
DR   WormPep; F58G6.5a; CE27764. [WormPep / WormBase / WorfDB]
DR   WormPep; F58G6.5b; CE28938. [WormPep / WormBase / WorfDB]
DR   InterPro; IPR000536; Hrmon_recept_lig.
DR   InterPro; IPR001723; Stdhrmn_receptor.
DR   InterPro; IPR008946; Str_ncl_receptor.
DR   InterPro; IPR001628; Znf_C4steroid.
DR   InterPro; Graphical view of domain structure.
DR   Pfam; PF00104; Hormone_recep; 1.
DR   Pfam; PF00105; zf-C4; 1.
DR   Pfam; Graphical view of domain structure.
DR   PRINTS; PR00398; STRDHORMONER.
DR   ProDom; PD000035; Znf_C4steroid; 1.
DR   ProDom [Domain structure / List of seq. sharing at least 1 domain ]
DR   SMART; SM00430; HOLI; 1.
DR   SMART; SM00399; ZnF_C4; 1.
DR   PROSITE; PS00031; NUCLEAR_REC_DBD_1; FALSE_NEG.
DR   PROSITE; PS51030; NUCLEAR_REC_DBD_2; 1.
DR   CMR; Q21006.
DR   BLOCKS; Q21006.
DR   ProtoNet; Q21006.
DR   ProtoMap; Q21006.
DR   PRESAGE; Q21006.
DR   DIP; Q21006.
DR   ModBase; Q21006.
DR   SWISS-2DPAGE; GET REGION ON 2D PAGE.
KW   Alternative splicing; DNA-binding; Nuclear protein; Receptor;
KW   Transcription; Transcription regulation; Zinc-finger.
FT   DNA_BIND    122    197       Nuclear receptor-type.
FT   ZN_FING     125    145       C4-type.
FT   ZN_FING     161    185       C4-type.
FT   VARSPLIC      1    106       Missing (in isoform a).
FT                                /FTId=VSP_003724.
FT   VARSPLIC    107    152       FINPKTEDMSMDIPMGNVQSGLSFFSTSKISEDQNTIALKT
FT                                QKNKL -> MAESKFSILKGEEFTGLKCRVCGDSRAGRHYG
FT                                TIACNGCKGFFRRS (in isoform a).
FT                                /FTId=VSP_003725.
SQ   SEQUENCE   605 AA;  68356 MW;  4F10E1A28AF70267 CRC64;
     MMDDLHQQLH QNYNPNGSPT TSTTHRRQNS SSGLEPARKR PKLSPPVLTA MPLDLTDAVS
     PNESQACTSS LKGVVTAAGA AQMSVADRMT SSSVAPPLCI VPRSVEFINP KTEDMSMDIP
     MGNVQSGLSF FSTSKISEDQ NTIALKTQKN KLIWEQRDYV CRFGGKCLVV QEYRNRCRAC
     RLRKCFTVGM DARAVQSERD KHKKNPKDSN NEGSTSPQYP TASTPISIPS TSTSQTPTSS
     VNSYNFQNIP GIVSRSFSEN LIMRDNSVPV METSQSAALS HVPLVRYLID LEKATDNLID
     ENCDFMSMEF DQLCRVDVTI EAAFRQPGVV AKRTPPRWLA LERLTTLEDV HIAWCRSFVL
     CIDYAKIMKD YQELSPTDQF TLLRNRVISV NWLCHTYKTF KAGCDGVALV NGSWYPRDKE
     LQKQLDPGCN HYFRILSEHL MEDLVIPMRE MDMDEGEFVI LKALILFRAH RRLSEEGRAH
     IKRVRDKYIE ALYQHVQHQH RHFSSVQTSM RISKILLLLP SIEHLSQQED DNVQFLALFN
     LANLNGLPYE LHSSIKQHIP NGDDSDDTQV NEVTSNNDGP RSSESSHTPQ SVSTSQFLEF
     KPSLH
//