GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:01:06 Sequence gi568815575r:2834835_3058436 : 223602 bp : 43.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4215 4254 40 -1.96 1.01 Init + 8315 8520 206 1 2 79 110 209 0.801 18.54 1.02 Intr + 19146 19320 175 0 1 70 105 241 0.960 23.94 1.03 Intr + 20159 20321 163 1 1 73 127 146 0.999 16.55 1.04 Intr + 21664 21790 127 2 1 63 92 82 0.983 5.84 1.05 Intr + 25009 25231 223 1 1 76 98 138 0.918 11.73 1.06 Intr + 26688 26888 201 2 0 72 95 73 0.839 5.88 1.07 Term + 46218 46379 162 2 0 92 36 324 0.829 25.54 1.08 PlyA + 47426 47431 6 1.05 2.11 PlyA - 47471 47466 6 1.05 2.10 Term - 72798 72437 362 1 2 122 42 282 0.989 21.70 2.09 Intr - 74008 73887 122 0 2 60 79 76 0.994 4.04 2.08 Intr - 75145 74983 163 2 1 91 98 121 0.998 12.43 2.07 Intr - 75959 75825 135 0 0 64 113 80 0.997 8.64 2.06 Intr - 80858 80722 137 1 2 58 97 21 0.875 0.21 2.05 Intr - 83393 82970 424 0 1 78 121 454 0.984 40.62 2.04 Intr - 85889 85767 123 0 0 106 36 133 0.963 10.46 2.03 Intr - 87190 87069 122 0 2 65 77 28 0.590 -0.46 2.02 Intr - 90931 90782 150 0 0 98 77 90 0.652 8.18 2.01 Init - 94441 94398 44 1 2 67 51 112 0.877 3.29 2.00 Prom - 98173 98134 40 -7.56 3.14 PlyA - 99211 99206 6 1.05 3.13 Term - 100356 99998 359 1 2 73 36 146 0.840 2.57 3.12 Intr - 102029 101908 122 1 2 91 79 76 0.960 7.14 3.11 Intr - 103423 103261 163 2 1 88 92 190 0.992 18.43 3.10 Intr - 108365 108231 135 0 0 93 116 44 0.992 8.24 3.09 Intr - 111300 111164 137 2 2 93 97 86 0.931 10.21 3.08 Intr - 114893 114470 424 0 1 98 82 213 0.908 14.62 3.07 Intr - 118431 118309 123 1 0 34 76 84 0.165 2.36 3.06 Intr - 120703 120582 122 0 2 94 82 100 0.246 10.14 3.05 Intr - 124768 124538 231 0 0 37 76 102 0.274 0.79 3.04 Intr - 133466 133315 152 2 2 65 93 120 0.740 9.16 3.03 Intr - 138272 138204 69 2 0 114 65 13 0.296 0.98 3.02 Intr - 139184 139077 108 2 0 63 61 48 0.356 0.08 3.01 Init - 144513 144448 66 0 0 58 121 84 0.889 9.77 3.00 Prom - 145545 145506 40 -4.56 4.00 Prom + 163080 163119 40 -2.86 4.01 Init + 165364 165386 23 0 2 89 93 11 0.644 0.98 4.02 Intr + 175196 175317 122 2 2 101 94 42 0.912 6.24 4.03 Intr + 178213 178338 126 2 0 54 52 184 0.402 12.05 4.04 Intr + 180136 180559 424 2 1 69 63 226 0.962 11.12 4.05 Intr + 183700 183836 137 1 2 82 106 18 0.992 3.21 4.06 Intr + 189187 189321 135 2 0 82 55 254 0.885 22.04 4.07 Intr + 192479 192641 163 0 1 81 80 7 0.879 -1.77 4.08 Intr + 194413 194534 122 1 2 84 93 134 0.948 13.64 4.09 Term + 198184 198551 368 2 2 56 53 145 0.817 2.57 4.10 PlyA + 198932 198937 6 1.05 5.02 PlyA - 200044 200039 6 1.05 5.01 Term - 222608 222520 89 0 2 109 47 84 0.904 4.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:2834835_3058436|GENSCAN_predicted_peptide_1|418_aa MAPSPLLSLRSVTLVFLLIFTVTDQAFVTLATNDIYCQGALVLGQSLRRHRLTRKLVVLI TPQVSSLLRVILSKVFDEVIEVNLIDSADYIHLAFLKRPELGLTLTKLHCWTLTHYSKCV FLDADTLVLSNVDELFDRGEFSAAPDPGWPDCFNSGVFVFQPSLHTHKLLLQHAMEHGSF DGADQGLLNSFFRNWSTTDIHKHLPFIYNLSSNTMYTYSPAFKQFGSSAKVVHFLGSMKP WNYKYNPQSGSVLEQGSASSSQHQAAFLHLWWTVYQNNVLPLYKSVQAGEARASPGHTLC HSDVGGPCADSASGVGEPCENSTPSAGVPCANSPLGSNQPAQGLPEPTQIVDETLSLPEG RRSEDVDLAVSVSQISIEEKVKELSPEEERRKWEEGRIDYMGKDAFARIQEKLDRFLQ >gi568815575r:2834835_3058436|GENSCAN_predicted_CDS_1|1257_bp atggccccgtcacccctgctgtccctcaggagtgtaactctggtgtttctgttgattttc acagtgactgatcaggcttttgtcacactagccaccaatgacatctactgccagggcgcc ctggtcctggggcagtcactgaggagacacaggctgacgaggaagctggtggtgttgatc actcctcaggtgtccagcctgctcagggtcatcctctcgaaggtgttcgatgaagtcatt gaagtgaatctaatcgatagtgccgactacatccacctggcctttctgaagagacctgag ctcgggctcaccctcaccaagcttcactgttggactctcactcactacagcaagtgtgtc ttcctggatgcagacactctggtgctgtccaatgtcgatgagctgtttgacaggggagag ttttctgcggccccggaccccggatggccggattgcttcaatagcggggtgtttgtcttc cagccttctctccacacgcataaactcctgctacagcacgccatggaacacggcagcttt gacggggcagaccaaggcttactgaatagtttcttcaggaactggtcgaccacagacatc cacaagcacctgccgttcatctataacttgagtagtaacacgatgtacacttacagccct gccttcaagcaattcggttccagtgcaaaggtcgtccactttttggggtccatgaaacct tggaactacaagtacaatccacagagtggctcggtgttggagcaaggctcagcgtccagc agccagcaccaggcggcattccttcatctctggtggacggtctaccagaacaacgtgctg cccctttataaaagcgtccaagcgggggaagcacgcgcgtctcctggtcacacactttgc cacagtgatgtgggggggccgtgtgcggattcagcctctggtgttggagagccgtgtgaa aattcaacacccagtgcgggcgtgccgtgtgcaaattcaccactgggttctaaccagcct gctcagggccttccggagccgacccagatagtggatgagaccctgtccctacctgaagga cgccgttcagaagatgtcgacctggccgtctctgtttcccagatctccatcgaagagaag gtgaaggaattgagccccgaggaagagaggaggaagtgggaggaaggccgtatcgactac atggggaaggacgcgtttgctcgcatccaggagaagctggaccggttcctgcagtaa >gi568815575r:2834835_3058436|GENSCAN_predicted_peptide_2|593_aa MRSAARRGRAAPAARDSLPVLLFLCLLLKTCEPKTANAFKPNILLIMADDLGTGDLGCYG NNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTGRHSFRSGMDASNGYRALQWNA GSGGLPENETTFARILQQHGYATGLIGKWHQGVNCASRGDHCHHPLNHGFDYFYGMPFTL TNDCDPGRPPEVDAALRAQLWGYTQFLALGILTLAAGQTCGFFSVSARAVTGMAGVGCLF FISWYSSFGFVRRWNCILMRNHDVTEQPMVLEKTASLMLKEAVSYIERHKHGPFLLFLSL LHVHIPLVTTSAFLGKSQHGLYGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGGH LEARDGHSQLGGWNGIYKGGKGMGGWEGGIRVPGIFHWPGVLPAGRVIGEPTSLMDVFPT VVQLVGGEVPQDRVIDGHSLVPLLQGAEARSAHEFLFHYCGQHLHAARWHQKDSGSVWKV HYTTPQFHPEGAGACYGRGVCPCSGEGVTHHRPPLLFDLSRDPSEARPLTPDSEPLYHAV IARVGAAVSEHRQTLSPVPQQFSMSNILWKPWLQPCCGHFPFCSCHEDGDGTP >gi568815575r:2834835_3058436|GENSCAN_predicted_CDS_2|1782_bp atgcgatccgccgcgcggaggggacgcgccgcgcccgccgccagggactctttgccggtg ctactgtttttatgcttgcttctgaagacgtgtgaacctaaaactgcaaatgcctttaaa ccaaatatcctactgatcatggcggatgatctaggcactggggatctcggttgctacggg aacaatacactgagaacgccgaatattgaccagcttgcagaggaaggtgtgaggctcact cagcacctggcggccgccccgctctgcaccccaagccgagctgcattcctcacagggaga cattccttcagatcaggcatggacgccagcaatggataccgggcccttcagtggaacgca ggctcaggtggactccctgagaacgaaaccacttttgcaagaatcttgcagcagcatggc tatgcaaccggcctcataggaaaatggcaccagggtgtgaattgtgcatcccgcggggat cactgccaccaccccctgaaccacggatttgactatttctacggcatgcccttcacgctc acaaacgactgtgacccaggcaggccccccgaagtggacgccgccctgagggcgcagctc tggggttacacccagttcctggcgctggggattctcaccctggctgccggccagacctgc ggtttcttctctgtctccgcgagagcagtcaccggcatggccggcgtgggctgcctgttt ttcatctcttggtactcctccttcgggtttgtgcgacgctggaactgtatcctgatgaga aaccatgacgtcacggagcaacccatggttctggagaaaacagcgagtcttatgctaaag gaagctgtttcctatattgaaagacacaagcatgggccatttctcctcttcctttctttg ctgcatgtgcacattccccttgtgaccacgagtgcattcctggggaaaagtcagcatggc ttatatggtgataatgtggaggagatggactggctcataggtaaggttcttaatgccatc gaagacaatggtttaaagaactcaacattcacgtatttcacctctgaccatggaggacat ttagaggcaagagatggacacagccagttagggggatggaacggaatttacaaaggtggg aagggcatgggaggatgggaaggtgggatccgcgtgcccgggatcttccactggccgggg gtgctcccggccggccgagtgattggagagcccacgagcctgatggacgtgttccctact gtggtccagctggtgggtggcgaggtgccccaggacagggtgattgatggccacagcctg gtacccttgctgcagggagctgaggcacgctcggcacatgagttcctgtttcattactgt gggcagcatcttcacgcagcacgctggcaccagaaggacagtggaagcgtctggaaggtt cattacacgaccccgcagttccaccccgagggagcgggggcctgctacggccgaggcgtc tgcccatgctccggggagggcgtgacccatcacagaccccctttgctctttgacctctcc agggacccctccgaggcacggcccctgacccccgactccgagcccctgtaccacgccgtg atagcaagggtaggtgccgcggtgtcggagcatcggcagaccctgagtcctgtgccccag cagttttccatgagcaacatcctgtggaagccgtggctgcagccgtgctgcggacatttc ccgttctgttcatgccacgaggatggggatggcaccccctga >gi568815575r:2834835_3058436|GENSCAN_predicted_peptide_3|736_aa MEQKGEFKNDTLIYEFFHIVAKPHPLPIKSSTYTTFQSARVTRFFLDAEKECRCQEGQVE AALIAMGGTCQGTQRLSTAGESESATHMREPAVQCRRDSQCVSVLAGRRRFFQGAASPLH PASVALTQLHVRWASTATIGQVSGFPGKIRVSPTTNLGDSDYIFTASHWGLGHYGPAVHV HSQGGIDERGSNDSGRLYVTNPCGNVPSWTPNIDRLAEDGVKLTQHISAASLCTPSRAAF LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE SASDHCHHPLHHGFDHFYGMPFSLMGDCARWELSEKRVNLEQKLNFLFQVLALVALTLVA GKLTHLIPVSWMPVIWSALSAVLLLASSYFVGALIVHADCFLMRNHTITEQPMCFQRTTP LILQEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGRI LDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGGWEGGIRVPGIF RWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRVIDGQDLLPLLLGTAQHSDHEFL MHYCERFLHAARWHQRDRGTMWKVHFVTPVFQPEGAGACYGRKVCPCFGEKVVHHDPPLL FDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPVPLQLDRLGNIWRPWLQPC CGPFPLCWCLREDDPQ >gi568815575r:2834835_3058436|GENSCAN_predicted_CDS_3|2211_bp atggagcagaaaggagaattcaagaatgacacgctcatttatgaatttttccacattgtc gccaagccacatccattgccaataaaatcttccacatacactacctttcaatctgctcgt gtgaccagatttttcctggatgctgaaaaagaatgcaggtgtcaagaagggcaggtggaa gctgcattgatagccatgggtggcacctgccaaggaactcaaaggttgtcaacagcaggg gagagtgaatcagctacgcacatgcgtgagccagccgttcagtgccgccgagacagccag tgcgtgtctgttttggcaggaagaagacgtttcttccaaggagcagcctccccgctacat ccagccagtgtcgcccttacacagctgcatgtcaggtgggcatcaaccgctaccatcggc caggtgtctggttttccagggaaaatcagggtttcccctactacaaaccttggagactca gattacatctttactgcatcccactggggattgggacactatggccctgctgtccatgtg cattcccagggtggcatcgatgaaagaggcagcaatgatagtggcaggctgtatgtcacc aacccctgcggaaatgttccctcatggactccgaatattgaccgccttgcagaggacggc gtgaagctgacccaacacatctctgccgcatctttgtgcaccccaagcagagccgccttc ctcacgggcagataccctgtgcgatcagggatggtttccagcattggttaccgtgttctt cagtggaccggagcatctggaggtcttccaacaaatgagacaacttttgcaaaaatactg aaagagaaaggctatgccactggactcattggaaaatggcatctgggtctcaactgtgag tcagccagtgatcattgccaccaccctctccatcatggctttgaccatttctacggaatg cctttctccttgatgggtgattgcgcccgctgggaactctcagagaagcgtgtcaacctg gaacaaaaactcaacttcctcttccaagtcctggccttggttgccctcacactggtagca gggaagctcacacacctgatacccgtctcgtggatgccggtcatctggtcagccctttcg gccgtcctcctcctcgcaagctcctattttgtgggtgctctgattgtccatgccgattgc tttctgatgagaaaccacaccatcacggagcagcccatgtgcttccaaagaacgacaccc cttattctgcaggaggttgcgtcctttctcaaaaggaataagcatgggcctttcctcctc tttgtttcctttctacacgttcacatccctcttatcactatggagaacttcctcgggaag agtctccacgggctgtatggggacaacgtagaggagatggactggatggtaggacggatc cttgacactttggacgtggagggtttgagcaacagcaccctcatttattttacgtcggat cacggcggttccctagagaatcaacttggaaacacccagtatggtggctggaatggaatt tataaaggtgggaagggcatgggaggatgggaaggtgggatccgcgtgcccgggatcttc cgctggcccggggtgctcccggccggccgagtgattggcgagcccacgagtctgatggac gtgttccccaccgtggtccggctggcgggcggcgaggtgccccaggacagagtgattgac ggccaagaccttctgcccttgctcctggggacagcccaacactcagaccacgagttcctg atgcattattgtgagaggtttctgcacgcagccaggtggcatcaacgggacagaggaaca atgtggaaagtccactttgtgacgcctgtgttccagccagagggagccggtgcctgctat ggaagaaaggtctgcccgtgctttggggaaaaagtagtccaccacgatccacctttgctc tttgacctctcaagagacccttctgagacccacatcctcacaccagcctcagagcccgtg ttctatcaggtgatggaacgagtccagcaggcggtgtgggaacaccagcggacactcagc ccagttcctctgcagctggacaggctgggcaacatctggagaccgtggctgcagccctgc tgtggcccgttccccctctgctggtgccttagggaagatgacccacaataa >gi568815575r:2834835_3058436|GENSCAN_predicted_peptide_4|539_aa MKQLLYDCTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGRYPIRSGMVSAYNLNRAF TWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCASRNDHCYHPLNHGFHYFYGV PFGLLSDCQASKTPELHRWLRIKLWISTVALALVPFLLLIPKFARWFSVPWKVIFVFALL AFLFFTSWYSSYGFTRRWNCILMRNHEIIQQPMKEEKVASLMLKEALAFIERYKREPFLL FFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMDWMVGKILDALDQERLANHTLVYFTSD NGGHLEPLDGAVQLGGWNGIYKGGKGMGGWEGGIRVPGIFRWPSVLEAGRVINEPTSLMD IYPTLSYIGGGILSQDRVIDGQNLMPLLEGRASHSDHEFLFHYCGVYLHTVRWHQKDCAT VWKAHYVTPKFYPEGTGACYGSGICSCSGDVTYHDPPLLFDISRDPSEALPLNPDNEPLF DSVIKKMEAAIREHRRTLTPVPQQFSVFNTIWKPWLQPCCGTFPFCGCDKEDDILPMAP >gi568815575r:2834835_3058436|GENSCAN_predicted_CDS_4|1620_bp atgaagcaattgctgtatgactgcacacctaatattgaccgcctggcaagtgaaggagtg aggcttacccagcatctcgcagctgcttccatgtgcaccccaagtcgggctgccttcctg accggccggtaccccatcagatcagggatggtgtctgcctacaacctgaaccgtgccttc acgtggcttggtgggtcaggtggtcttcccaccaatgaaacgacttttgccaagctgctg cagcaccgtggctaccgcacgggactcataggcaaatggcacctgggtttgagctgcgcc tctcggaatgatcactgttaccacccgctcaaccatggttttcactacttttacggggtg ccttttggacttttaagcgactgccaggcatccaagacaccagaactgcaccgctggctc aggatcaaactgtggatctccacggtagcccttgccctggttccttttctgcttctcatt cccaagttcgcccgctggttctcagtgccatggaaggtcatctttgtctttgctctcctc gcctttctgtttttcacttcctggtactctagttatggatttactcgacgttggaattgc atccttatgaggaaccatgaaattatccagcagccaatgaaagaggagaaagtagcttcc ctcatgctgaaggaggcacttgctttcattgaaaggtacaaaagggaaccttttctcctc tttttttccttcctgcacgtacatactccactcatctccaaaaagaagtttgttgggcgc agtaaatatggcaggtatggggacaatgtagaagaaatggattggatggtgggtaaaatc ctggatgccctggaccaggagcgcctggccaaccacaccttggtgtacttcacctctgac aacgggggccacctggagcccctggacggggctgttcagctgggtggctggaacgggatc tacaaaggtggcaaaggaatgggaggatgggaaggaggtatccgtgtgccagggatattc cggtggccgtcagtcttggaggctgggagagtgatcaatgagcccaccagcttaatggac atctatccgacgctgtcttatataggcggagggatcttgtcccaggacagagtgattgac ggccagaacctaatgcccctgctggaaggaagggcgtcccactccgaccacgagttcctc ttccactactgtggggtctatctgcacacggtcaggtggcatcagaaggactgtgcaact gtgtggaaagctcattatgtgactcctaaattctaccctgaaggaacaggtgcctgctat gggagtggaatatgttcatgttcgggggatgtaacctaccacgacccaccactcctcttt gacatctcaagagacccttcagaagcccttccactgaaccctgacaatgagccattattt gactccgtgatcaaaaagatggaggcagccataagagagcatcgtaggacactaacacct gtcccacagcagttctctgtgttcaacacaatttggaaaccatggctgcagccttgctgt gggaccttccccttctgtgggtgtgacaaggaagatgacatccttcccatggctccctga >gi568815575r:2834835_3058436|GENSCAN_predicted_peptide_5|29_aa XLWDEEGSINHLTERESEPPTLPGAEESF >gi568815575r:2834835_3058436|GENSCAN_predicted_CDS_5|90_bp naattatgggatgaagagggctccataaaccatctcaccgaaagagaatcagaaccacct actcttccaggagctgaagagagcttttaa