GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:21:26 Sequence gi568815582r:64847606_65104869 : 257264 bp : 38.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 336 238 99 0 0 64 115 48 0.099 4.49 1.07 Intr - 2098 1790 309 1 0 56 54 145 0.011 3.48 1.06 Intr - 17721 17524 198 0 0 53 46 146 0.069 5.53 1.05 Intr - 34797 34695 103 2 1 31 77 55 0.063 -2.04 1.04 Intr - 36957 36820 138 2 0 56 51 156 0.363 7.36 1.03 Intr - 53323 53259 65 1 2 81 74 56 0.050 0.10 1.02 Intr - 63204 63040 165 0 0 65 93 102 0.589 7.64 1.01 Init - 67372 67367 6 1 0 58 93 17 0.377 -0.87 1.00 Prom - 69927 69888 40 -5.15 2.04 PlyA - 71558 71553 6 1.05 2.03 Term - 74453 73988 466 1 1 -60 49 1245 0.561 99.30 2.02 Intr - 96446 96368 79 0 1 103 41 26 0.011 -2.91 2.01 Init - 96931 96826 106 1 1 71 53 71 0.123 2.33 2.00 Prom - 97418 97379 40 -7.65 3.17 PlyA - 97639 97634 6 1.05 3.16 Term - 100494 99998 497 1 2 106 42 583 0.983 49.24 3.15 Intr - 103413 103162 252 1 0 114 82 307 0.989 29.18 3.14 Intr - 103813 103538 276 2 0 47 71 117 0.019 2.47 3.13 Intr - 124091 123974 118 2 1 96 68 121 0.492 10.02 3.12 Intr - 124459 124326 134 2 2 118 89 204 0.999 22.94 3.11 Intr - 125435 125299 137 1 2 61 94 53 0.982 2.49 3.10 Intr - 134696 134443 254 2 2 51 87 392 0.993 30.71 3.09 Intr - 140739 140552 188 1 2 68 113 263 0.995 25.29 3.08 Intr - 144330 144163 168 1 0 81 89 144 0.992 12.80 3.07 Intr - 145429 145310 120 2 0 40 84 155 0.992 9.85 3.06 Intr - 151251 150957 295 0 1 94 65 378 0.854 31.76 3.05 Intr - 153861 153832 30 0 0 83 121 -2 0.498 0.01 3.04 Intr - 157205 157037 169 1 1 39 86 133 0.307 7.23 3.03 Intr - 157436 157366 71 2 2 107 30 98 0.150 2.96 3.02 Intr - 173625 173590 36 0 0 115 62 33 0.001 0.94 3.01 Init - 189905 189603 303 2 0 34 119 186 0.651 13.62 3.00 Prom - 192338 192299 40 -7.65 4.04 PlyA - 192661 192656 6 1.05 4.03 Term - 195774 195676 99 0 0 136 36 56 0.326 2.35 4.02 Intr - 206323 206199 125 2 2 104 115 49 0.712 8.58 4.01 Init - 210113 209936 178 2 1 52 64 107 0.528 4.18 4.00 Prom - 211409 211370 40 -5.85 5.08 PlyA - 211497 211492 6 -0.45 5.07 Term - 211978 211825 154 0 1 49 43 224 0.847 10.41 5.06 Intr - 216555 216408 148 1 1 71 27 82 0.548 -1.23 5.05 Intr - 219164 219071 94 2 1 73 81 86 0.018 5.02 5.04 Intr - 221985 221881 105 0 0 35 63 94 0.010 1.09 5.03 Intr - 238553 238460 94 1 1 43 90 112 0.050 5.95 5.02 Intr - 240650 240518 133 0 1 2 50 161 0.034 2.48 5.01 Intr - 248677 248531 147 2 0 19 72 116 0.356 2.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 171201 171335 135 2 0 39 48 141 0.914 2.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:64847606_65104869|GENSCAN_predicted_peptide_1|361_aa MLNLRRSVYNNKNRRLSVVALERLRNHLNICKAVREPDAAKCLILVSAFFCLRILPLRRS LAATVPMQMTELNVPSLWGPQVQCLAQTIAQVITSEIDCPKFHVKYSKGFLTHGKFLCGR PNIKSPRSNNHLPVSVTYCSYAIHAVQAPLMGSIMSSAMKWKGFLSAVSPVTNGTSCVKS WGVEGSSANSYLQRNSRVREMIEVGPELSQTGTSPVLGGGTFENRISAIVIVKVWRIFHA QIIFSNPVVEKNVTIHFNTKTFMTFLRNMFGVLGEHNITGTVFTKKAPESGTPCMTFYRG SNCSIKGCCQEHLSIMITTNKNLYSLVMTFIGKAQRTNLLTLEMTCLTDERVKIPDFIGQ K >gi568815582r:64847606_65104869|GENSCAN_predicted_CDS_1|1083_bp atgctgaatctcagacgttctgtttataacaataagaaccgaaggctctcagtggtagct ttagaaagactcagaaatcatctaaatatatgtaaggcagtaagggagcctgacgcagcc aaatgtctgatccttgtctccgcgtttttttgcctgagaatcctgccgctgagaagaagc cttgctgctactgtgccaatgcagatgacagagctgaatgtcccatcactgtgggggcca caagtccagtgccttgctcaaaccattgcacaggtcattacttctgaaattgactgtcct aaattccacgtgaagtattctaaaggattcctgacccatgggaaattcttatgtggaagg cctaatattaagagccccaggtctaacaaccaccttcctgtgagtgtcacctactgctcc tatgctatccatgcagtgcaggcgcctctcatgggttccatcatgtcatctgcaatgaaa tggaaagggttcctttcagctgtcagcccagtgacaaatgggaccagctgtgtcaaaagc tggggagtggagggcagttcagcaaattcctatttacagagaaacagcagagtcagagag atgatagaagtgggccctgaactgagtcaaacagggacaagcccagttttgggtggagga acatttgagaataggatctcggccattgtaatagtgaaggtatggagaatttttcatgct caaatcattttttccaatccagtggttgaaaagaatgttacaattcatttcaacacaaaa acttttatgacatttttgagaaatatgtttggagtcctgggggaacacaacatcactggg actgtcttcactaaaaaagccccagaaagcgggacaccttgcatgactttctacagaggc agcaactgcagcatcaaaggatgctgccaagaacatttatcaattatgattaccactaat aaaaacctctacagtctagtgatgaccttcatagggaaagcacaaaggactaatctactc acattggagatgacttgtcttactgatgagagagtgaagatccctgattttattggtcag aag >gi568815582r:64847606_65104869|GENSCAN_predicted_peptide_2|216_aa MGVNGGWAVGVSEQKRGKIYLAGSVEDRLGAYKPRVHDAPATLAIFRFLIQKKTRSFSME SRRRKRRKRRKRKEEEKKKKRKKKKKKKKKKKKKKKKKKKKKEKEKEKEKEKEKEEEKEG EGEEEEEEEEEEEEEEEEEGEEEEGKEEEGKEEEKKKKRKKKKKKEKEKEKEEEKEGEGE EEEEEEEEEEEEEEEEGEEEEGKEEEGKEEEEEEEE >gi568815582r:64847606_65104869|GENSCAN_predicted_CDS_2|651_bp atgggggtgaatgggggctgggcagttggagtctctgagcagaagaggggcaaaatttat ttggcaggcagtgtggaggacagattaggagcatataaacccagagtgcacgatgctcca gccacactggccatctttcggttcctgatacaaaaaaaaacacgttccttttccatggaa agcagaaggagaaagagaaggaaaaggagaaagagaaaggaggaggagaagaagaagaag aggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaaggagaaggagaaggagaaggagaaggagaaggagaaggaggaggagaaagaagga gaaggagaagaagaagaagaggaagaagaagaggaagaagaagaagaggaagaagaaggg gaggaggaggaggggaaggaggaggaggggaaggaggaggagaagaagaagaagaggaag aagaagaagaagaaggagaaggagaaggagaaggaggaggagaaagaaggagaaggagaa gaagaagaagaggaagaagaagaggaagaagaagaagaggaagaagaaggggaggaggag gaggggaaggaggaggaggggaaggaggaggaggaagaagaagaggagtag >gi568815582r:64847606_65104869|GENSCAN_predicted_peptide_3|1015_aa MTVTKEGSLNRSRGDGDGKVQVYTVPNIQVFPNLGLCRQRFTVTDGVQVTARILQHVLEC HLPNTLSSAMRKSGWWGILSPVLSAKTEAESLNDFLDATQLVSEITLDYERALKLTGDVG SPDVMSSTSRDIPSQESHAFAPERRGHLRPSFHGHHEKGKEGQVLQRSKRGWVWNQFFVI EEYTGPDPVLVGRVCSRVNRRTKLHSDIDSGDGNIKYILSGEGAGTIFVIDDKSGNIHAT KTLDREERAQYTLMAQAVDRDTNRPLEPPSEFIVKVQDINDNPPEFLHETYHANVPERSN VGTSVIQVTASDADDPTYGNSAKLVYSILEGQPYFSVEAQTGIIRTALPNMDREAKEEYH VVIQAKDMGGHMGGLSGTTKVTITLTDVNDNPPKFPQSVYQMSVSEAAVPGEEVGRVKAK DPDIGENGLVTYNIVDGDGMESFEITTDYETQEGVIKLKKPVDFETKRAYSLKVEAANVH IDPKFISNGPFKDTVTVKISVEDADEPPMFLAPSYIHEVQENAAAGTVVGRVHAKDPDAA NSPIRYSIDRHTDLDRFFTINPEDGFIKTTKPLDREETAWLNITVFAAEIHNRHQEAKVP VAIRVLDVNDNAPKFAAPYEGFICESDQTKPLSNQPIVTISADDKDDTANGPRFIFSLPP EIIHNPNFTVRDNRGRSALGHVSNKLQIILSRRNDMFNRVAEGVGGNPFTGKPATSFSAL LLPILQWKFPTKSKEPVSEIYSKMQIACFVIMPNGCGFRRYLDERTNNTAGVYARRGGFS RQKQDLYLLPIVISDGGIPPMSSTNTLTIKVCGCDVNGALLSCNAEAYILNAGLSTGALI AILACIVILLVIVVLFVTLRRQKKEPLIVFEEEDVRENIITYDDEGGGEEDTEAFDIATL QNPDGINGFIPRKDIKPEYQYMPRPGLRPAPNSVDVDDFINTRIQEADNDPTAPPYDSIQ IYGYEGRGSVAGSLSSLESATTDSDLDYDYLQNWGPRFKKLADLYGSKDTFDDDS >gi568815582r:64847606_65104869|GENSCAN_predicted_CDS_3|3048_bp atgactgtcacaaaggaaggctccctaaacaggagtcggggtgatggggatggcaaggtg caagtctacacagttccaaatatccaggtgtttccaaacttggggctgtgtagacagagg ttcacagtgactgatggagtccaggtcacagctagaattttacaacatgtgctggagtgc catcttcctaacactctgtcctctgcaatgaggaaatcggggtggtggggaattttgtct ccagttctcagtgcaaaaacagaggctgaaagtctgaatgattttcttgatgccacacag ctggtatcagaaattactttggattatgagagagctttgaagctaactggggacgtgggc agccctgacgtgatgagctcaaccagcagagacattccatcccaagagagccatgccttt gccccagagcggcgggggcacctgcggccctccttccatgggcaccatgagaagggcaag gaggggcaggtgctacagcgctccaagcgtggctgggtctggaaccagttcttcgtgata gaggagtacaccgggcctgaccccgtgcttgtgggcagggtgtgcagcagggtaaataga agaacaaagcttcattcagatattgactctggtgatgggaacattaaatacattctctca ggggaaggagctggaaccatttttgtgattgatgacaaatcagggaacattcatgccacc aagacgttggatcgagaagagagagcccagtacacgttgatggctcaggcggtggacagg gacaccaatcggccactggagccaccgtcggaattcattgtcaaggtccaggacattaat gacaaccctccggagttcctgcacgagacctatcatgccaacgtgcctgagaggtccaat gtgggaacgtcagtaatccaggtgacagcttcagatgcagatgaccccacttatggaaat agcgccaagttagtgtacagtatcctcgaaggacaaccctatttttcggtggaagcacag acaggtatcatcagaacagccctacccaacatggacagggaggccaaggaggagtaccac gtggtgatccaggccaaggacatgggtggacatatgggcggactctcagggacaaccaaa gtgacgatcacactgaccgatgtcaatgacaacccaccaaagtttccgcagagcgtatac cagatgtctgtgtcagaagcagccgtccctggggaggaagtaggaagagtgaaagctaaa gatccagacattggagaaaatggcttagtcacatacaatattgttgatggagatggtatg gaatcgtttgaaatcacaacggactatgaaacacaggagggggtgataaagctgaaaaag cctgtagattttgaaaccaaaagagcctatagcttgaaggtagaggcagccaacgtgcac atcgacccgaagtttatcagcaatggccctttcaaggacactgtgaccgtcaagatctca gtagaagatgctgatgagccccctatgttcttggccccaagttacatccacgaagtccaa gaaaatgcagctgctggcaccgtggttgggagagtgcatgccaaagaccctgatgctgcc aacagcccgataaggtattccatcgatcgtcacactgacctcgacagatttttcactatt aatccagaggatggttttattaaaactacaaaacctctggatagagaggaaacagcctgg ctcaacatcactgtctttgcagcagaaatccacaatcggcatcaggaagccaaagtccca gtggccattagggtccttgatgtcaacgataatgctcccaagtttgctgccccttatgaa ggtttcatctgtgagagtgatcagaccaagccactttccaaccagccaattgttacaatt agtgcagatgacaaggatgacacggccaatggaccaagatttatcttcagcctaccccct gaaatcattcacaatccaaatttcacagtcagagacaaccgaggcagaagtgcgctgggt catgtgagcaacaaacttcaaatcattctaagcaggaggaatgacatgtttaaccgggtg gctgagggtgttgggggaaatcccttcactggcaagccggccacatcattttctgctttg ctcttgcccattcttcaatggaaattccctaccaaatccaaagagcctgtttctgagatt tattccaaaatgcagatagcttgtttcgtgatcatgccaaacggctgtggcttccgcaga tacttagatgaaagaacaaataacacagcaggcgtgtacgcccggcgtggagggttcagt cggcagaagcaggacttgtaccttctgcccatagtgatcagcgatggcggcatcccgccc atgagtagcaccaacaccctcaccatcaaagtctgcgggtgcgacgtgaacggggcactg ctctcctgcaacgcagaggcctacattctgaacgccggcctgagcacaggcgccctgatc gccatcctcgcctgcatcgtcattctcctggtcattgtagtattgtttgtgaccctgaga aggcaaaagaaagaaccactcattgtctttgaggaagaagatgtccgtgagaacatcatt acttatgatgatgaagggggtggggaagaagacacagaagcctttgatattgccaccctc cagaatcctgatggtatcaatggatttatcccccgcaaagacatcaaacctgagtatcag tacatgcctagacctgggctccggccagcgcccaacagcgtggatgtcgatgacttcatc aacacgagaatacaggaggcagacaatgaccccacggctcctccttatgactccattcaa atctacggttatgaaggcaggggctcagtggccgggtccctgagctccctagagtcggcc accacagattcagacttggactatgattatctacagaactggggacctcgttttaagaaa ctagcagatttgtatggttccaaagacacttttgatgacgattcttaa >gi568815582r:64847606_65104869|GENSCAN_predicted_peptide_4|133_aa MLLPSLTTLSILMESLREGKWKKEGYGSARTEPSTQSELSKRELPQSQLHPPLTVFLDFE ITVFSAQRPCDIPSCCHLLSDQSDGWSVLQKLAASIQWVKKVLFQRVPPSKHPHAKLHLR VSVPDNPTCDKLY >gi568815582r:64847606_65104869|GENSCAN_predicted_CDS_4|402_bp atgctgctcccttcattgacaactctttccattttaatggagagcctaagagagggaaag tggaaaaaggaaggatatggctctgcacgtactgagccttccacacaatcagagcttagc aaaagggagctgccacagtcacagctgcatcctccactgactgtcttccttgattttgaa atcactgtgttttcagctcagcggccctgtgacattccttcgtgttgtcatttgttgagt gaccaatcagatgggtggagtgtgttacagaaattggcagcaagtatccaatgggtgaag aaggtgttattccaaagggtaccccctagtaaacatccccatgctaaactccatctcaga gtcagcgtcccagataatccaacctgtgacaaactctattaa >gi568815582r:64847606_65104869|GENSCAN_predicted_peptide_5|291_aa XLEAGKSKIKALADSVSDEGETRILVILHWKYYKEVVATWTYACDKMLRVLEDRNFKIKV PADSVSGEGPFQIDGTFCVFMWLKGQGGSLKLRLEYQGLKAPEDIFESMPLANDAFETGN GSTRKSSFSWLYKKEDAQVSVFVPKLTLDVASSGVYVAKEGLEEQADSWENGFLKQQNVQ THKNRMTDKGGYKVLWAFSEDHLDSETWLCLHITLRAKGDPSLAAKVFCTCSVLLKMATS WPGQCRIQRLPEPPDRPLAVRYCAVITMVTMDSLYARPSALWKMVKVVTLH >gi568815582r:64847606_65104869|GENSCAN_predicted_CDS_5|876_bp nttctggaggctggaaagtccaagatcaaggcgctagcagattctgtgtctgatgaggga gagacaagaatacttgtaatacttcactggaagtattacaaagaagttgtagctacatgg acgtatgcttgcgataaaatgttaagagttctggaggataggaacttcaagatcaaggtg ccagctgattcggtgtctggtgagggcccgttccaaatagacggcaccttctgtgtcttc atgtggctgaaggggcaagggggctccctgaagcttcgtttagaatatcaaggactgaaa gctccagaagatatttttgaatcaatgccattggcaaatgatgcctttgaaacagggaat ggttctaccaggaagagtagtttttcctggctgtacaaaaaagaggatgctcaggtatcg gtatttgtacccaagctcactttagatgtggccagctcaggtgtctatgttgcaaaagag ggactggaggagcaggcagacagttgggaaaatggtttcttaaaacaacaaaatgttcaa acacataagaacagaatgactgataaaggtggttataaggtcctctgggccttctctgag gaccatcttgactctgaaacctggctgtgcctacacatcaccctgagagccaaaggtgac ccctccctggccgccaaggtcttctgcacatgcagtgtcctcttgaagatggccacatca tggcctggacaatgccggattcagcggctgcctgagcctcctgaccgtccattggccgtc aggtattgtgcagtaattaccatggtgacaatggactctctctacgcccgtccatcggcg ctttggaaaatggtgaaggttgtcactttgcattag