GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:14:52 Sequence gi568815575r:2807274_3029275 : 222002 bp : 44.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 912 947 36 1 0 126 88 5 0.492 2.53 1.02 Intr + 4063 4179 117 2 0 104 115 44 0.965 9.14 1.03 Term + 5237 5361 125 0 2 105 39 30 0.552 -1.65 1.04 PlyA + 6716 6721 6 1.05 2.05 PlyA - 6768 6763 6 1.05 2.04 Term - 12637 12522 116 2 2 50 39 110 0.906 1.03 2.03 Intr - 16761 16737 25 0 1 93 115 26 0.870 3.40 2.02 Intr - 22570 22310 261 1 0 108 49 185 0.947 14.28 2.01 Init - 22917 22861 57 0 0 96 23 52 0.380 -1.02 2.00 Prom - 24647 24608 40 -7.96 3.00 Prom + 31776 31815 40 -1.96 3.01 Init + 35876 36081 206 1 2 79 110 209 0.853 18.54 3.02 Intr + 46707 46881 175 0 1 70 105 241 0.960 23.94 3.03 Intr + 47720 47882 163 1 1 73 127 146 0.999 16.55 3.04 Intr + 49225 49351 127 2 1 63 92 82 0.983 5.84 3.05 Intr + 52570 52792 223 1 1 76 98 138 0.918 11.73 3.06 Intr + 54249 54449 201 2 0 72 95 73 0.839 5.88 3.07 Term + 73779 73940 162 2 0 92 36 324 0.829 25.54 3.08 PlyA + 74987 74992 6 1.05 4.11 PlyA - 75032 75027 6 1.05 4.10 Term - 100359 99998 362 1 2 122 42 282 0.989 21.70 4.09 Intr - 101569 101448 122 0 2 60 79 76 0.994 4.04 4.08 Intr - 102706 102544 163 2 1 91 98 121 0.998 12.43 4.07 Intr - 103520 103386 135 0 0 64 113 80 0.997 8.64 4.06 Intr - 108419 108283 137 1 2 58 97 21 0.875 0.21 4.05 Intr - 110954 110531 424 0 1 78 121 454 0.984 40.62 4.04 Intr - 113450 113328 123 0 0 106 36 133 0.963 10.46 4.03 Intr - 114751 114630 122 0 2 65 77 28 0.590 -0.46 4.02 Intr - 118492 118343 150 0 0 98 77 90 0.652 8.18 4.01 Init - 122002 121959 44 1 2 67 51 112 0.877 3.29 4.00 Prom - 125734 125695 40 -7.56 5.14 PlyA - 126772 126767 6 1.05 5.13 Term - 127917 127559 359 1 2 73 36 146 0.840 2.57 5.12 Intr - 129590 129469 122 1 2 91 79 76 0.960 7.14 5.11 Intr - 130984 130822 163 2 1 88 92 190 0.992 18.43 5.10 Intr - 135926 135792 135 0 0 93 116 44 0.992 8.24 5.09 Intr - 138861 138725 137 2 2 93 97 86 0.931 10.21 5.08 Intr - 142454 142031 424 0 1 98 82 213 0.908 14.62 5.07 Intr - 145992 145870 123 1 0 34 76 84 0.165 2.36 5.06 Intr - 148264 148143 122 0 2 94 82 100 0.246 10.14 5.05 Intr - 152329 152099 231 0 0 37 76 102 0.274 0.79 5.04 Intr - 161027 160876 152 2 2 65 93 120 0.740 9.16 5.03 Intr - 165833 165765 69 2 0 114 65 13 0.296 0.98 5.02 Intr - 166745 166638 108 2 0 63 61 48 0.356 0.08 5.01 Init - 172074 172009 66 0 0 58 121 84 0.889 9.77 5.00 Prom - 173106 173067 40 -4.56 6.00 Prom + 190641 190680 40 -2.86 6.01 Init + 192925 192947 23 0 2 89 93 11 0.644 0.98 6.02 Intr + 202757 202878 122 2 2 101 94 42 0.912 6.24 6.03 Intr + 205774 205899 126 2 0 54 52 184 0.402 12.05 6.04 Intr + 207697 208120 424 2 1 69 63 226 0.962 11.12 6.05 Intr + 211261 211397 137 1 2 82 106 18 0.992 3.21 6.06 Intr + 216748 216882 135 2 0 82 55 254 0.121 22.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 216748 216896 149 2 2 82 44 251 0.869 18.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:2807274_3029275|GENSCAN_predicted_peptide_1|92_aa XGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFKLNNRRNCFRTHGNQKGVPIP TVFALGFAFGLKNPKTFPCAPSAGEWAVSTVS >gi568815575r:2807274_3029275|GENSCAN_predicted_CDS_1|279_bp ngtggagatcaccattcaacgtatggcaatccagaaggcaatatggtagcaaaaatcgtg tctcccatcgtatccgtggtggtggtgacactgctgggagcagcagccagttatttcaaa ctaaacaataggagaaattgtttcaggacccatggaaaccagaaaggggtgcccattcct acggtgtttgctttaggatttgcatttggcctcaaaaatcctaaaaccttcccatgcgct ccctctgctggtgaatgggcagtatcaaccgttagctaa >gi568815575r:2807274_3029275|GENSCAN_predicted_peptide_2|152_aa MVSGAAGSGRGPADLRPGASPATPLRLNTHAPSNDNAPGTPPDLSGVSSCPAAPRTCLEF WASSSSTPAGAPRSPLSTPWTTPALPDASICTSSELGLPLPAPPGQEAMESLAAALYEVH NWAQIKVVVEAFRILKEAEKALNDTPKTNFYV >gi568815575r:2807274_3029275|GENSCAN_predicted_CDS_2|459_bp atggtcagtggcgccgcgggcagtgggcgcggacctgcagacctgcgcccgggagcatca cctgcgaccccgctgcgtcttaacacccatgcgcccagcaacgacaacgcccccgggaca ccacccgacctcagcggtgtcagcagctgccccgccgccccgcgcacctgcttggagttc tgggcatcctcttcctccaccccagctggcgccccccggtcacccctctccactccctgg accacccctgcgctcccggacgccagtatctgcacctcctccgagcttgggctgcctctg cccgcgccccccggccaggaagccatggagagcttggcagcagctttgtatgaagttcat aactgggctcagatcaaagttgtggtggaagcctttagaattctgaaggaagcagaaaag gcattgaatgacacccccaagaccaacttttatgtttaa >gi568815575r:2807274_3029275|GENSCAN_predicted_peptide_3|418_aa MAPSPLLSLRSVTLVFLLIFTVTDQAFVTLATNDIYCQGALVLGQSLRRHRLTRKLVVLI TPQVSSLLRVILSKVFDEVIEVNLIDSADYIHLAFLKRPELGLTLTKLHCWTLTHYSKCV FLDADTLVLSNVDELFDRGEFSAAPDPGWPDCFNSGVFVFQPSLHTHKLLLQHAMEHGSF DGADQGLLNSFFRNWSTTDIHKHLPFIYNLSSNTMYTYSPAFKQFGSSAKVVHFLGSMKP WNYKYNPQSGSVLEQGSASSSQHQAAFLHLWWTVYQNNVLPLYKSVQAGEARASPGHTLC HSDVGGPCADSASGVGEPCENSTPSAGVPCANSPLGSNQPAQGLPEPTQIVDETLSLPEG RRSEDVDLAVSVSQISIEEKVKELSPEEERRKWEEGRIDYMGKDAFARIQEKLDRFLQ >gi568815575r:2807274_3029275|GENSCAN_predicted_CDS_3|1257_bp atggccccgtcacccctgctgtccctcaggagtgtaactctggtgtttctgttgattttc acagtgactgatcaggcttttgtcacactagccaccaatgacatctactgccagggcgcc ctggtcctggggcagtcactgaggagacacaggctgacgaggaagctggtggtgttgatc actcctcaggtgtccagcctgctcagggtcatcctctcgaaggtgttcgatgaagtcatt gaagtgaatctaatcgatagtgccgactacatccacctggcctttctgaagagacctgag ctcgggctcaccctcaccaagcttcactgttggactctcactcactacagcaagtgtgtc ttcctggatgcagacactctggtgctgtccaatgtcgatgagctgtttgacaggggagag ttttctgcggccccggaccccggatggccggattgcttcaatagcggggtgtttgtcttc cagccttctctccacacgcataaactcctgctacagcacgccatggaacacggcagcttt gacggggcagaccaaggcttactgaatagtttcttcaggaactggtcgaccacagacatc cacaagcacctgccgttcatctataacttgagtagtaacacgatgtacacttacagccct gccttcaagcaattcggttccagtgcaaaggtcgtccactttttggggtccatgaaacct tggaactacaagtacaatccacagagtggctcggtgttggagcaaggctcagcgtccagc agccagcaccaggcggcattccttcatctctggtggacggtctaccagaacaacgtgctg cccctttataaaagcgtccaagcgggggaagcacgcgcgtctcctggtcacacactttgc cacagtgatgtgggggggccgtgtgcggattcagcctctggtgttggagagccgtgtgaa aattcaacacccagtgcgggcgtgccgtgtgcaaattcaccactgggttctaaccagcct gctcagggccttccggagccgacccagatagtggatgagaccctgtccctacctgaagga cgccgttcagaagatgtcgacctggccgtctctgtttcccagatctccatcgaagagaag gtgaaggaattgagccccgaggaagagaggaggaagtgggaggaaggccgtatcgactac atggggaaggacgcgtttgctcgcatccaggagaagctggaccggttcctgcagtaa >gi568815575r:2807274_3029275|GENSCAN_predicted_peptide_4|593_aa MRSAARRGRAAPAARDSLPVLLFLCLLLKTCEPKTANAFKPNILLIMADDLGTGDLGCYG NNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTGRHSFRSGMDASNGYRALQWNA GSGGLPENETTFARILQQHGYATGLIGKWHQGVNCASRGDHCHHPLNHGFDYFYGMPFTL TNDCDPGRPPEVDAALRAQLWGYTQFLALGILTLAAGQTCGFFSVSARAVTGMAGVGCLF FISWYSSFGFVRRWNCILMRNHDVTEQPMVLEKTASLMLKEAVSYIERHKHGPFLLFLSL LHVHIPLVTTSAFLGKSQHGLYGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGGH LEARDGHSQLGGWNGIYKGGKGMGGWEGGIRVPGIFHWPGVLPAGRVIGEPTSLMDVFPT VVQLVGGEVPQDRVIDGHSLVPLLQGAEARSAHEFLFHYCGQHLHAARWHQKDSGSVWKV HYTTPQFHPEGAGACYGRGVCPCSGEGVTHHRPPLLFDLSRDPSEARPLTPDSEPLYHAV IARVGAAVSEHRQTLSPVPQQFSMSNILWKPWLQPCCGHFPFCSCHEDGDGTP >gi568815575r:2807274_3029275|GENSCAN_predicted_CDS_4|1782_bp atgcgatccgccgcgcggaggggacgcgccgcgcccgccgccagggactctttgccggtg ctactgtttttatgcttgcttctgaagacgtgtgaacctaaaactgcaaatgcctttaaa ccaaatatcctactgatcatggcggatgatctaggcactggggatctcggttgctacggg aacaatacactgagaacgccgaatattgaccagcttgcagaggaaggtgtgaggctcact cagcacctggcggccgccccgctctgcaccccaagccgagctgcattcctcacagggaga cattccttcagatcaggcatggacgccagcaatggataccgggcccttcagtggaacgca ggctcaggtggactccctgagaacgaaaccacttttgcaagaatcttgcagcagcatggc tatgcaaccggcctcataggaaaatggcaccagggtgtgaattgtgcatcccgcggggat cactgccaccaccccctgaaccacggatttgactatttctacggcatgcccttcacgctc acaaacgactgtgacccaggcaggccccccgaagtggacgccgccctgagggcgcagctc tggggttacacccagttcctggcgctggggattctcaccctggctgccggccagacctgc ggtttcttctctgtctccgcgagagcagtcaccggcatggccggcgtgggctgcctgttt ttcatctcttggtactcctccttcgggtttgtgcgacgctggaactgtatcctgatgaga aaccatgacgtcacggagcaacccatggttctggagaaaacagcgagtcttatgctaaag gaagctgtttcctatattgaaagacacaagcatgggccatttctcctcttcctttctttg ctgcatgtgcacattccccttgtgaccacgagtgcattcctggggaaaagtcagcatggc ttatatggtgataatgtggaggagatggactggctcataggtaaggttcttaatgccatc gaagacaatggtttaaagaactcaacattcacgtatttcacctctgaccatggaggacat ttagaggcaagagatggacacagccagttagggggatggaacggaatttacaaaggtggg aagggcatgggaggatgggaaggtgggatccgcgtgcccgggatcttccactggccgggg gtgctcccggccggccgagtgattggagagcccacgagcctgatggacgtgttccctact gtggtccagctggtgggtggcgaggtgccccaggacagggtgattgatggccacagcctg gtacccttgctgcagggagctgaggcacgctcggcacatgagttcctgtttcattactgt gggcagcatcttcacgcagcacgctggcaccagaaggacagtggaagcgtctggaaggtt cattacacgaccccgcagttccaccccgagggagcgggggcctgctacggccgaggcgtc tgcccatgctccggggagggcgtgacccatcacagaccccctttgctctttgacctctcc agggacccctccgaggcacggcccctgacccccgactccgagcccctgtaccacgccgtg atagcaagggtaggtgccgcggtgtcggagcatcggcagaccctgagtcctgtgccccag cagttttccatgagcaacatcctgtggaagccgtggctgcagccgtgctgcggacatttc ccgttctgttcatgccacgaggatggggatggcaccccctga >gi568815575r:2807274_3029275|GENSCAN_predicted_peptide_5|736_aa MEQKGEFKNDTLIYEFFHIVAKPHPLPIKSSTYTTFQSARVTRFFLDAEKECRCQEGQVE AALIAMGGTCQGTQRLSTAGESESATHMREPAVQCRRDSQCVSVLAGRRRFFQGAASPLH PASVALTQLHVRWASTATIGQVSGFPGKIRVSPTTNLGDSDYIFTASHWGLGHYGPAVHV HSQGGIDERGSNDSGRLYVTNPCGNVPSWTPNIDRLAEDGVKLTQHISAASLCTPSRAAF LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE SASDHCHHPLHHGFDHFYGMPFSLMGDCARWELSEKRVNLEQKLNFLFQVLALVALTLVA GKLTHLIPVSWMPVIWSALSAVLLLASSYFVGALIVHADCFLMRNHTITEQPMCFQRTTP LILQEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGRI LDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGGWEGGIRVPGIF RWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRVIDGQDLLPLLLGTAQHSDHEFL MHYCERFLHAARWHQRDRGTMWKVHFVTPVFQPEGAGACYGRKVCPCFGEKVVHHDPPLL FDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPVPLQLDRLGNIWRPWLQPC CGPFPLCWCLREDDPQ >gi568815575r:2807274_3029275|GENSCAN_predicted_CDS_5|2211_bp atggagcagaaaggagaattcaagaatgacacgctcatttatgaatttttccacattgtc gccaagccacatccattgccaataaaatcttccacatacactacctttcaatctgctcgt gtgaccagatttttcctggatgctgaaaaagaatgcaggtgtcaagaagggcaggtggaa gctgcattgatagccatgggtggcacctgccaaggaactcaaaggttgtcaacagcaggg gagagtgaatcagctacgcacatgcgtgagccagccgttcagtgccgccgagacagccag tgcgtgtctgttttggcaggaagaagacgtttcttccaaggagcagcctccccgctacat ccagccagtgtcgcccttacacagctgcatgtcaggtgggcatcaaccgctaccatcggc caggtgtctggttttccagggaaaatcagggtttcccctactacaaaccttggagactca gattacatctttactgcatcccactggggattgggacactatggccctgctgtccatgtg cattcccagggtggcatcgatgaaagaggcagcaatgatagtggcaggctgtatgtcacc aacccctgcggaaatgttccctcatggactccgaatattgaccgccttgcagaggacggc gtgaagctgacccaacacatctctgccgcatctttgtgcaccccaagcagagccgccttc ctcacgggcagataccctgtgcgatcagggatggtttccagcattggttaccgtgttctt cagtggaccggagcatctggaggtcttccaacaaatgagacaacttttgcaaaaatactg aaagagaaaggctatgccactggactcattggaaaatggcatctgggtctcaactgtgag tcagccagtgatcattgccaccaccctctccatcatggctttgaccatttctacggaatg cctttctccttgatgggtgattgcgcccgctgggaactctcagagaagcgtgtcaacctg gaacaaaaactcaacttcctcttccaagtcctggccttggttgccctcacactggtagca gggaagctcacacacctgatacccgtctcgtggatgccggtcatctggtcagccctttcg gccgtcctcctcctcgcaagctcctattttgtgggtgctctgattgtccatgccgattgc tttctgatgagaaaccacaccatcacggagcagcccatgtgcttccaaagaacgacaccc cttattctgcaggaggttgcgtcctttctcaaaaggaataagcatgggcctttcctcctc tttgtttcctttctacacgttcacatccctcttatcactatggagaacttcctcgggaag agtctccacgggctgtatggggacaacgtagaggagatggactggatggtaggacggatc cttgacactttggacgtggagggtttgagcaacagcaccctcatttattttacgtcggat cacggcggttccctagagaatcaacttggaaacacccagtatggtggctggaatggaatt tataaaggtgggaagggcatgggaggatgggaaggtgggatccgcgtgcccgggatcttc cgctggcccggggtgctcccggccggccgagtgattggcgagcccacgagtctgatggac gtgttccccaccgtggtccggctggcgggcggcgaggtgccccaggacagagtgattgac ggccaagaccttctgcccttgctcctggggacagcccaacactcagaccacgagttcctg atgcattattgtgagaggtttctgcacgcagccaggtggcatcaacgggacagaggaaca atgtggaaagtccactttgtgacgcctgtgttccagccagagggagccggtgcctgctat ggaagaaaggtctgcccgtgctttggggaaaaagtagtccaccacgatccacctttgctc tttgacctctcaagagacccttctgagacccacatcctcacaccagcctcagagcccgtg ttctatcaggtgatggaacgagtccagcaggcggtgtgggaacaccagcggacactcagc ccagttcctctgcagctggacaggctgggcaacatctggagaccgtggctgcagccctgc tgtggcccgttccccctctgctggtgccttagggaagatgacccacaataa >gi568815575r:2807274_3029275|GENSCAN_predicted_peptide_6|323_aa MKQLLYDCTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGRYPIRSGMVSAYNLNRAF TWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCASRNDHCYHPLNHGFHYFYGV PFGLLSDCQASKTPELHRWLRIKLWISTVALALVPFLLLIPKFARWFSVPWKVIFVFALL AFLFFTSWYSSYGFTRRWNCILMRNHEIIQQPMKEEKVASLMLKEALAFIERYKREPFLL FFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMDWMVGKILDALDQERLANHTLVYFTSD NGGHLEPLDGAVQLGGWNGIYKX >gi568815575r:2807274_3029275|GENSCAN_predicted_CDS_6|969_bp atgaagcaattgctgtatgactgcacacctaatattgaccgcctggcaagtgaaggagtg aggcttacccagcatctcgcagctgcttccatgtgcaccccaagtcgggctgccttcctg accggccggtaccccatcagatcagggatggtgtctgcctacaacctgaaccgtgccttc acgtggcttggtgggtcaggtggtcttcccaccaatgaaacgacttttgccaagctgctg cagcaccgtggctaccgcacgggactcataggcaaatggcacctgggtttgagctgcgcc tctcggaatgatcactgttaccacccgctcaaccatggttttcactacttttacggggtg ccttttggacttttaagcgactgccaggcatccaagacaccagaactgcaccgctggctc aggatcaaactgtggatctccacggtagcccttgccctggttccttttctgcttctcatt cccaagttcgcccgctggttctcagtgccatggaaggtcatctttgtctttgctctcctc gcctttctgtttttcacttcctggtactctagttatggatttactcgacgttggaattgc atccttatgaggaaccatgaaattatccagcagccaatgaaagaggagaaagtagcttcc ctcatgctgaaggaggcacttgctttcattgaaaggtacaaaagggaaccttttctcctc tttttttccttcctgcacgtacatactccactcatctccaaaaagaagtttgttgggcgc agtaaatatggcaggtatggggacaatgtagaagaaatggattggatggtgggtaaaatc ctggatgccctggaccaggagcgcctggccaaccacaccttggtgtacttcacctctgac aacgggggccacctggagcccctggacggggctgttcagctgggtggctggaacgggatc tacaaagnn