GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:00:07 Sequence gi568815584r:58309037_58511945 : 202909 bp : 38.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9506 9585 80 0 2 72 75 91 0.686 4.55 1.02 Intr + 9675 9769 95 2 2 71 56 69 0.444 -0.16 1.03 Intr + 14449 14581 133 1 1 45 111 206 0.529 18.33 1.04 Intr + 19201 19280 80 0 2 86 68 61 0.999 1.33 1.05 Intr + 20492 20568 77 2 2 50 98 104 0.998 5.84 1.06 Intr + 20967 21133 167 1 2 59 66 223 0.957 15.96 1.07 Intr + 35659 35731 73 0 1 66 53 99 0.718 2.26 1.08 Intr + 37375 37469 95 2 2 16 101 70 0.750 -0.14 1.09 Intr + 37984 38081 98 0 2 52 106 60 0.925 2.09 1.10 Intr + 38611 38842 232 1 1 49 96 309 0.999 24.65 1.11 Intr + 42037 42287 251 0 2 67 73 343 0.996 26.01 1.12 Intr + 44622 44819 198 0 0 12 115 216 0.960 14.24 1.13 Intr + 50096 50180 85 2 1 101 53 88 0.999 5.50 1.14 Intr + 51865 52066 202 0 1 67 42 200 0.588 11.24 1.15 Intr + 55134 56285 1152 1 0 86 106 1034 0.730 93.06 1.16 Intr + 56482 56586 105 2 0 110 77 116 0.999 11.97 1.17 Term + 56988 57244 257 1 2 89 36 250 0.787 14.66 1.18 PlyA + 57274 57279 6 1.05 2.00 Prom + 59379 59418 40 -4.25 2.01 Init + 86922 87057 136 2 1 35 79 306 0.999 22.75 2.02 Intr + 87262 87329 68 2 2 99 70 65 0.568 3.51 2.03 Intr + 93644 93725 82 1 1 33 94 45 0.009 -2.11 2.04 Intr + 98290 98432 143 2 2 84 99 43 0.037 4.15 2.05 Intr + 105789 105869 81 2 0 68 65 61 0.001 0.72 2.06 Intr + 121612 121681 70 0 1 66 93 69 0.406 3.04 2.07 Intr + 123352 123421 70 2 1 74 106 33 0.662 1.02 2.08 Intr + 133670 133844 175 2 1 77 71 180 0.676 14.22 2.09 Intr + 134918 135139 222 1 0 89 70 117 0.882 7.40 2.10 Intr + 140163 140259 97 2 1 35 100 32 0.551 -2.24 2.11 Intr + 141543 141710 168 1 0 9 75 197 0.919 9.50 2.12 Intr + 144314 144437 124 0 1 94 98 -11 0.668 -0.78 2.13 Intr + 147666 147774 109 0 1 15 103 96 0.421 3.17 2.14 Intr + 148723 148943 221 0 2 85 110 230 0.998 21.08 2.15 Intr + 149437 149509 73 1 1 83 91 46 0.989 2.89 2.16 Intr + 151950 152124 175 2 1 34 116 141 0.998 10.09 2.17 Intr + 158699 158886 188 0 2 76 85 140 0.973 10.99 2.18 Intr + 160390 160599 210 0 0 44 61 160 0.951 7.09 2.19 Intr + 161577 161687 111 2 0 83 71 70 0.953 4.46 2.20 Intr + 163163 163243 81 1 0 20 92 110 0.916 3.52 2.21 Intr + 165571 165761 191 0 2 69 115 149 0.998 13.16 2.22 Intr + 168087 168205 119 0 2 89 111 54 0.979 6.99 2.23 Intr + 173477 173676 200 0 2 95 72 165 0.570 13.75 2.24 Intr + 177971 178130 160 1 1 84 70 70 0.522 3.44 2.25 Intr + 178893 179073 181 1 1 51 -52 202 0.240 0.50 2.26 Intr + 179585 179838 254 2 2 60 89 191 0.699 12.55 2.27 Intr + 181128 181204 77 1 2 70 115 64 0.958 5.62 2.28 Intr + 183108 183239 132 2 0 102 65 49 0.933 3.92 2.29 Intr + 189747 189924 178 2 1 90 74 136 0.970 10.97 2.30 Intr + 199519 199673 155 2 2 83 81 109 0.147 8.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 70837 70382 456 1 0 45 34 202 0.836 6.43 S.002 Term - 101711 101535 177 2 0 87 32 123 0.950 3.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:58309037_58511945|GENSCAN_predicted_peptide_1|1126_aa XFDDGDERTLRRTSLCLKGERHFAESETLDQLPLTNPEHFGTPVIAKKTNRGRRSSLPVT EDEKEEESSEEEDEDKRRLNDELLGKVVSVVSATERTEWYPALVISPSCNDDITVKKDQC LVRSFIDSKFYSIARKDIKEVDILNLPESELSTKPGLQKASIFLKTRVVPDNWKMDISEI LESSSSDDEDGPAEENDEEKEKEAKKTEEEVPEEELDPEERDNFLQQLYKFMEDRGTPIN KPPVLGYKDLNLFKLFRLVYHQGGCDNIDSGAVWKQIYMDLGIPILNSAASYNVKTAYRK YLYGFEEYCRSANIQFRTVHHHEPKVKEEKKDLEESMEEALKLDQEMPLTEVKSEPEENI DSNSESEREEIELKSPRGRRRIARDVNSIKKEIEEEKTEDKLKDNDTENKDVDDDYETAE KKENELLLGRKNTPKQKEKKIKKQEDSDKDSDEEEEKSQEREETESKCDSEGEEDEEDME PCLTGTKVKVKYGRGKTQKIYEASIKSTEIDDGEVLYLVHYYGWNVRYDEWVKADRIIWP LDKGGPKKKQKKKAKNKEDSEKDEKRDEERQKSKRGRPPLKSTLSSNMPYGLSKTANSEG KSGTRSARSNIPDSSPLSNGMEDSCSSDSETEDALEKNLINEELSLKDELEKNENLNDDK LDEENPKISAHILKENDRTQMQPLETLKLEVGENEQIVQIFGNKMEKTEEVKKEAEKSPK GKGRRSKTKDLSLEIIKISSFGQNEAGSEPHIEAHSLELSSLDNKNFSSATEDEIDQCVK EKKLKRKILGQSSPEKKIRIENGMEMTNTVSQERTSDCIGSEGMKNLNFEQHFERENEGM PSLIAESNQCIQQLTSERFDSPAEETVNIPLKEDEDAMPLIGPETLVCHEVDLDDLDEKD KTSIEDVAVESSESNSLVSIPPALPPVVQHNFSVASPLTLSQDESRSVKSESDITIEVDS IAEESQEGLCERESANGFETNVASGTCSIIVQERESREKGKDFLGKSQKRPSDGNSGLMA KKQKRTPKRTSAAAKNEKNGTGQSSDSEDLPVLDNSSKCTPVKHLNVSKPQKLARSPARI SPHIKDGEKDKHREKHPNSSPRTYKWSFQLSKKLIENLIDEYCKVE >gi568815584r:58309037_58511945|GENSCAN_predicted_CDS_1|3381_bp ntgtttgatgatggtgatgagcgaacattgagacgtacctcactttgtctgaaaggagag agacattttgcagagagtgagacacttgaccagcttccattaacaaatccagagcatttt ggaactccagtaattgcaaagaagacgaacagaggaaggagatcttctcttcctgttact gaagatgaaaaggaagaagaaagcagtgaagaggaagatgaagacaagcgccgtctcaat gatgaattactaggaaaagttgtaagtgtggtgtctgcaacggagaggactgaatggtat cctgctttggtaatatctcccagctgtaatgatgacatcacagtgaaaaaggatcagtgt ttagttcgatcatttattgattctaaattttactctatagcaagaaaggacattaaggaa gtagacattctcaatctaccggaatctgagctctccactaaaccagggcttcagaaagca agcatcttcttaaaaactagagttgttcctgataattggaaaatggatataagtgaaatc cttgagtcatccagtagtgatgatgaagatggcccagctgaagaaaatgatgaagagaag gaaaaggaggccaaaaagacagaagaagaggtgcctgaggaagaacttgatcctgaagag agggacaacttcctccagcagctttataagtttatggaagacagaggtactccaatcaac aaaccacctgttttgggctataaagatctcaatctcttcaaactcttcagactggtttat catcagggtggatgtgacaatattgatagtggtgctgtatggaagcaaatttatatggac cttggcattcctattttgaattcagctgcttcctacaatgtaaaaactgcttatagaaag tatctctatggttttgaggagtactgccgttcggcaaatattcagttcagaactgttcat caccatgaaccaaaagtaaaagaggaaaaaaaagacttagaagaatcaatggaagaggct ctcaaattagatcaagaaatgcctttaacagaagtgaagagtgaacctgaggaaaatatc gattcaaacagtgaaagtgaaagagaagagatagaattaaaatctccgaggggacgaagg agaattgctcgagatgtaaattctattaaaaaggaaattgaagaagagaaaacagaagac aaattaaaagataatgatacagaaaataaggatgtagatgatgactatgaaactgcagag aaaaaagaaaatgagctactactggggagaaaaaatacaccaaagcaaaaagagaagaaa attaaaaaacaggaggattctgacaaagactcagatgaagaggaagagaaaagccaagag agggaagaaactgaaagcaaatgtgactctgaaggagaggaagatgaggaagacatggaa ccctgcctaacaggaaccaaagtgaaagtaaaatatggacgagggaagactcagaaaatt tatgaagccagtattaaaagcactgaaattgatgacggagaagttttatatttggtacat tactatggatggaatgtcaggtatgatgagtgggtgaaggctgacaggataatctggcct ttggacaaaggtggaccaaagaaaaaacagaagaaaaaagctaaaaataaagaagatagt gaaaaggacgaaaagagagatgaggagaggcagaagtcaaaacggggacgacctccttta aaatcaaccctctcatcaaacatgccgtatggcttatctaagacagcaaacagtgaagga aaatcaggtaccagaagtgctcgcagcaatataccagacagctcacctctgtcaaatgga atggaagactcttgttcatctgatagtgaaacagaagatgctttagaaaagaatttaata aatgaagaactttctcttaaagatgaactagaaaaaaatgaaaatttgaatgatgataag ctagatgaagaaaatccaaagatttctgcacatatattaaaagaaaatgataggactcaa atgcagcctttagaaaccctgaagttagaagttggagagaatgaacaaatagtacagatt tttgggaacaaaatggaaaaaacagaagaagttaagaaagaagccgaaaaatctccaaaa ggaaagggaagacgaagcaagacaaaagatctttctttagaaattataaagatttcatca tttggccagaatgaagcaggaagtgaacctcatatagaagctcatagtcttgaattgtct tcattagacaataaaaacttttcttctgctacagaagatgaaattgaccaatgtgtgaaa gaaaagaagttgaaacggaaaatactaggacaatcatcgccagagaaaaaaataagaatt gagaatggaatggaaatgacaaatactgtatctcaagaaaggaccagtgattgtattgga tctgagggaatgaaaaacttaaattttgaacagcactttgaaagagaaaatgaaggaatg ccatcattgatagcagagtcaaaccaatgcatccaacaactgactagtgaaagatttgat agtccagctgaagaaactgtaaatattccactaaaagaagatgaggatgcaatgcctctg atcgggcctgaaaccttggtttgccatgaagtagatttggatgatttggatgaaaaggat aagaccagcattgaggatgtagcagttgaaagctctgagtctaactctcttgtttctatt ccacctgccctacctcctgtagtccaacataacttttcagtagcttcaccacttactctt agtcaagatgagtctcgaagcgtaaaaagtgagagtgatataacgattgaagttgatagt attgctgaagaatctcaagaaggtctctgtgagagggaatcggcaaatggatttgaaact aatgttgcctctggtacctgtagtataattgtacaagagagagagagcagagagaagggt aaggactttctagggaaaagtcagaagaggccaagtgatggaaatagtggattaatggca aaaaagcaaaagcgtaccccaaagcgaacaagtgctgcagccaaaaatgaaaagaatgga acaggacaaagcagtgatagtgaagatcttcctgtcctagacaattcaagtaaatgtacc ccagtaaagcatcttaatgtatctaagccacagaaacttgcacgatctcctgcaagaata tccccgcacatcaaagatggagagaaagataaacacagagaaaaacatccgaattcatcc cctaggacatataaatggagctttcagctcagtaagaagcttatagaaaacttgatagat gaatactgtaaagtggaatag >gi568815584r:58309037_58511945|GENSCAN_predicted_peptide_2|1417_aa MPSVRSLLRLLAAAAACGAFAFLGYCIYLNRKRRGDPAFKRRLRDKRRAEPQKAEEQGTQ VQCFRSAQLWDPTKNKKLQELFLQEVRMGELWLSRGEHRMGIQHLGNALLVCEQPRELLK VFKHTLPPKVFEMLLHKIPLICQKSLPTPQRVSVETTQGAWLSTPIGSNEDFSKDVAVQV LPLDKIEENNKQKANDIFISQYTMGQKDALRTVLKQKAQSMPVFKEVKVHLLEDAGIEKD AVTQETRISPSGIDSATTVAAATAAAIATAAPLIKVQSDLEAKVNSVTELLSKLQETDKH LQRVTEQQTSIQRKQEKLHCHDHEKQMNVFMEQHIRHLEKLQQQQIDIQSLKIAHAFKVL YILQQMSGPYKTVVPSTLRGQAPLKEVEDTSFDKQKSPLETPAPRRFAPVPVSRDDELSK RENLLEEKENMEVSCHRGNVRLLEQILNNNDSLTRKSESSNTTSLTRSKIGWTPEKTNRF PSCEELETTKVTMQKSDDVLHDLGQKEKETNSMVQPKESLSMLKLPDLPQNSVKLQTTNT TRSVLKDAEKILRGVQNNKKVLEENLEAIIRAKDGAAMYSLINALSTNREMSEKIRIRKT VDEWIKTISAEIQGLLKATTVIQDEDYMLQVYGKPVYQGHRSTLKKGPYLRFNSPSPKSR PQRPKVIERVKGQTQSNSDTMPPAGVIVSKPHPVTVTTSIPPSSRKVETGVKKPNIAIVE MKSEKKDPPQLTVQRRPTNPVVQQLWRLRETWGHFSLDSSLSQVQTRSMCIERMSITSTV SPWRKLEDLDQGAKNPELKGKQHTVLPSVDIDSISNSSADVLSPLSSPKEASLPPVQTWI KTPEIMKVDEEEVKFPGTNFDEIIDVIQEEEKCDEIPDSEPILEFNRSVKADSTKYNGPP FPPVASTFQPTADILDKVIERKETLENSLIQWVEQEIMSRIISGLFPVQQQIAPSISVSV SETSEPLTSDIVEGTSSGALQLFVDAGVPVNSNVIKHFVNEALAETIAVMLGDREAKKQG PVATGVSGDASTNETYLPARVCTPLPTPQPTPPCSPSSPAKECVLVKTPDSSPCDSDHDM AFPVKEICAEKVTPTTTPPPAAAVFTPTLSDISIDKLKVSSPELPKPWGDGDLPLEEENP NSPQEELHPRAIVMSVAKDEEPESMDFPAQPPPPEPVPFMPFPAGTKAPSPSQMPGSDSS TLESTLSVTVTETETLDKPISEGEILFSCGQKLAPKILEDIGLYLTNLNDSLSSTLHDAV EMEDDPPSEGQVIRMSHKKFHADAILSFAKQNQESAVSQQAVYHSEDLENSVGELSEGQR PQLTAAAENILMGHSLYMQPPVTNTQSLDQQCDPKPLSRQFDTVSGSIYEDSCASHGPMS LGELELEPNSKLVLPTTLLTAQENDVNLPVAAEDFSQ >gi568815584r:58309037_58511945|GENSCAN_predicted_CDS_2|4251_bp atgccctccgtccgctccctcctccgcctcttggccgccgcggcggcctgtggcgccttc gccttcctgggctattgtatttacctcaaccggaagcggcgcggggaccccgcgttcaag cgccgcctgcgggacaaaagaagagcagagcctcaaaaggctgaggagcagggcacgcag gtgcagtgctttcgatccgcacagttgtgggatccaacgaagaataaaaagttgcaagaa cttttcttgcaagaggtacggatgggagaactttggttatctagaggagagcacagaatg gggattcaacacctcggcaatgcccttttagtgtgcgagcaaccacgggaacttctgaaa gttttcaaacacactctccctcccaaggtatttgagatgctgttgcacaaaattcccctt atttgccagaaaagcctccccactccccagcgagtgtcagtggaaaccacacagggagcc tggctttccaccccaattggcagtaatgaggatttttctaaagacgttgcagtgcaagtg ttgcctttggataaaatagaagagaacaacaagcaaaaagcaaatgacatcttcatttct cagtatacaatgggacagaaagatgctctaagaacagttttaaagcaaaaagctcaaagc atgcctgtttttaaggaagtaaaggtacatctgttagaagatgcaggcatagagaaggat gctgttactcaggagactagaatttcacccagtggaattgattcagctacaaccgtggct gcagcaactgctgctgccattgcaaccgcagctccgttgataaaggtgcagagtgatttg gaagcaaaagtcaattctgttacagaattacttagtaaattacaggagactgataaacac ctgcaacgtgttacagagcagcaaacaagcattcagaggaaacaagagaaattacattgt catgatcacgaaaagcaaatgaatgtgtttatggagcagcacataaggcatcttgaaaag ttacaacaacaacaaatagatattcagtcattaaagattgctcatgcttttaaggtgctt tacattttgcaacaaatgagtggtccttacaaaactgttgtgcccagcactttgagaggc caagcacctttaaaagaagttgaagatacgagttttgataaacagaaatctcctttggag acaccagcacctcgcagatttgctcctgtacctgtttcaagggatgatgaactatcaaag agggaaaatcttttggaagaaaaagaaaatatggaagtgtcgtgtcacagaggaaatgta agactattggaacaaattttgaataataatgattctttgacaagaaaaagtgaatcatca aacaccacctcactaactaggtcaaaaataggatggactcctgagaaaacaaacagattt ccttcctgtgaagagctagaaacaactaaagtgactatgcagaagtctgatgatgttctt catgaccttggccaaaaagagaaagaaacaaatagcatggtccagccaaaagaatctctg agtatgttgaagcttccagatcttccacagaattctgttaagcttcaaacaaccaataca acaagatctgtattgaaagatgctgagaagattttgagaggagtacaaaacaataaaaaa gtacttgaagaaaacctggaagctattattcgtgcaaaagatggagctgccatgtattcg cttatcaatgctttatctaccaacagagagatgtcagagaaaattaggatcagaaagaca gtggatgaatggattaaaactatttctgcagaaattcaggggcttttgaaagcaaccaca gtaatacaagatgaagattatatgttacaagtctatggaaagccagtttatcagggccat cgaagcactcttaaaaaaggaccatatctcagatttaattctccatctcctaagtccaga ccacagagaccaaaagtaatagaacgagttaaaggacaaacccaaagtaatagtgatacc atgccacctgctggagtgattgtcagcaagccacaccctgtaactgtgactacttctatt cctccatcatctcgaaaagtagaaactggagtaaagaaacctaacatagccattgtagaa atgaagtcagaaaaaaaggatcctcctcagcttactgtgcagcgtaggcccactaaccca gtggtgcagcagctatggaggcttcgggagacctggggacacttcagcctggacagctcc ctgagccaagtacaaacccggtcaatgtgtattgaacgaatgagcatcactagcacagtc agtccatggaggaaattagaggatctagatcagggggcaaagaatcctgaattaaaaggc aaacaacacacagtattacccagtgtagatattgacagcatttcaaatagtagtgctgat gtcctttcacctctgtctagccccaaagaagcatctcttcctcctgtgcaaacttggata aagactccagaaattatgaaggtagatgaagaagaggtgaagtttccaggaactaacttt gatgaaataatcgatgtcatacaggaagaagaaaaatgtgatgaaattccagactctgaa ccaattctggagtttaacagaagtgttaaagctgattctacaaaatataatggtcctcca tttccgccagttgcttctacttttcagcccactgctgatattctggataaagtaattgag agaaaagaaacactggaaaatagcttaattcaatgggtagagcaagaaataatgtcaaga attatctctgggctctttccagtccagcaacagattgcacctagtatcagtgtttcagtc agtgagacaagtgaaccactgacttctgacattgtggaaggaacaagcagtggcgccctc cagctttttgttgatgctggtgttcctgtgaactcaaatgtgattaaacattttgttaac gaagctcttgctgagaccattgctgtcatgctgggtgacagagaagcaaagaagcaaggt cctgttgctacaggtgtttctggggatgcttcaacaaatgaaacatatttgccggcaaga gtgtgcaccccactgcctaccccacagcctacgcctccttgctcaccttcatcacctgct aaggagtgtgttttggtaaagactccagattcttctccctgtgattcggatcatgatatg gcttttcctgtgaaagaaatatgtgctgaaaaagttacccctactactacacctcctcca gcggcggcagtttttaccccaactttgtcagatatttccattgataaattgaaggtatca agcccagagcttcccaagccatggggtgatggagacctgccactggaagaagagaaccct aactcacctcaagaagaacttcatccaagagctattgtaatgtctgtggctaaggatgaa gaaccagagagtatggatttccctgctcagcctccacctccagagccagttccctttatg ccatttcctgccggcaccaaggccccttccccctcacagatgccaggttctgattcatca acactggagagcacattgagtgttactgtcactgaaactgaaactttagataaacccatc tctgaaggagagattttatttagctgtggtcaaaaattggcccccaagattttagaagat ataggactgtacctgacaaaccttaatgatagcttatccagcactctgcatgatgccgtt gaaatggaggatgatcctcctagtgaagggcaagtgattaggatgtcccataaaaaattt catgcagatgcaattctttcttttgctaaacaaaaccaggagtcagcagtttcccagcaa gcagtctatcattcagaggacttggaaaacagtgtgggtgaacttagtgaaggacaaaga ccccagctaacagcggcagcagagaacatcttaatgggacattctctctatatgcagcca cctgtcactaatacacagtctttggatcaacaatgtgatcctaaaccattatctcggcaa tttgacacagtttcaggtagtatttatgaagattcatgtgctagtcatggtccaatgagt ttgggagaattggagttggagccaaattctaagctggttcttcccacaacacttctgaca gcacaagaaaatgatgttaatttaccagtagccgctgaagatttttcccag