GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:56:34 Sequence gi568815590f:85365271_85580786 : 215516 bp : 38.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 437 432 6 1.05 1.01 Sngl - 3871 3485 387 1 0 58 35 178 0.580 5.46 1.00 Prom - 4015 3976 40 -3.45 2.00 Prom + 5522 5561 40 -5.65 2.01 Init + 6886 7098 213 0 0 74 48 125 0.746 5.89 2.02 Intr + 7836 7954 119 2 2 37 55 134 0.799 3.34 2.03 Intr + 17684 17805 122 2 2 85 54 93 0.482 4.82 2.04 Term + 20012 20070 59 0 2 119 37 65 0.567 1.37 2.05 PlyA + 20526 20531 6 1.05 3.02 PlyA - 21685 21680 6 1.05 3.01 Sngl - 55838 55038 801 2 0 42 46 423 0.941 29.08 3.00 Prom - 57284 57245 40 -6.15 4.02 PlyA - 57453 57448 6 1.05 4.01 Sngl - 58249 57950 300 1 0 51 42 244 0.839 11.64 4.00 Prom - 64238 64199 40 -3.85 5.00 Prom + 66110 66149 40 -5.85 5.01 Init + 73640 73673 34 1 1 116 113 44 0.782 9.79 5.02 Intr + 74442 74639 198 1 0 79 108 160 0.955 15.50 5.03 Intr + 76803 76921 119 1 2 48 94 175 0.999 13.36 5.04 Intr + 78764 78856 93 1 0 84 98 65 0.987 6.34 5.05 Intr + 79886 79948 63 1 0 82 92 57 0.919 3.50 5.06 Intr + 80872 81027 156 0 0 86 99 120 0.989 12.19 5.07 Intr + 82764 82862 99 2 0 123 69 75 0.953 8.49 5.08 Term + 82909 83001 93 0 0 102 42 92 0.981 2.85 5.09 PlyA + 83634 83639 6 1.05 6.00 Prom + 88765 88804 40 -1.45 6.01 Init + 90270 90479 210 2 0 86 67 222 0.891 18.73 6.02 Term + 96800 96901 102 1 0 102 38 73 0.750 1.00 6.03 PlyA + 96981 96986 6 1.05 7.00 Prom + 97720 97759 40 -11.44 7.01 Init + 98268 98469 202 2 1 95 2 150 0.980 6.26 7.02 Intr + 98650 98845 196 2 1 104 113 335 0.991 35.15 7.03 Intr + 99463 99545 83 1 2 43 72 47 0.875 -3.04 7.04 Intr + 100002 100199 198 1 0 96 82 338 0.982 32.40 7.05 Intr + 108423 108541 119 1 2 49 63 98 0.997 2.66 7.06 Intr + 109054 109146 93 0 0 79 96 44 0.913 3.54 7.07 Intr + 110528 110590 63 1 0 84 94 72 0.976 5.40 7.08 Intr + 111850 112005 156 0 0 83 72 145 0.997 11.69 7.09 Term + 115400 115519 120 1 0 112 44 97 0.994 5.19 7.10 PlyA + 115662 115667 6 1.05 8.04 PlyA - 116239 116234 6 1.05 8.03 Term - 122403 122030 374 1 2 45 38 186 0.049 3.17 8.02 Intr - 126667 126581 87 2 0 100 64 59 0.017 3.72 8.01 Init - 133410 133329 82 0 1 80 14 116 0.053 4.59 8.00 Prom - 138942 138903 40 -5.45 9.04 PlyA - 139703 139698 6 1.05 9.03 Term - 142270 142100 171 1 0 135 48 49 0.515 2.44 9.02 Intr - 146783 146741 43 1 1 57 99 32 0.191 -1.48 9.01 Init - 150646 150582 65 1 2 82 103 55 0.541 7.07 9.00 Prom - 165249 165210 40 -3.75 10.00 Prom + 166741 166780 40 -2.35 10.01 Init + 167428 167479 52 0 1 63 72 40 0.770 1.27 10.02 Term + 170485 170690 206 2 2 72 49 131 0.784 4.15 10.03 PlyA + 170757 170762 6 1.05 11.05 PlyA - 170879 170874 6 1.05 11.04 Term - 173680 173501 180 1 0 70 51 146 0.822 5.73 11.03 Intr - 184521 184348 174 0 0 112 41 41 0.003 1.01 11.02 Intr - 201849 201710 140 1 2 12 61 116 0.007 0.56 11.01 Init - 213108 213027 82 0 1 64 100 72 0.553 7.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 127522 127734 213 0 0 85 70 182 0.823 14.90 S.002 Init + 181695 181872 178 2 1 70 91 113 0.858 9.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_1|128_aa MMRSLTEVGEILRGKNPEIRVQGWERNGYFLGNVPRCPDLPLSYSLQTQTAREAFFEPPW KKPMTPKSFPRSSIQRLGTASSRKSDLVSGKEDLLESYKIDVTVTSLEPPGSDAQCLNSL RNEALPPR >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_1|387_bp atgatgcgcagtttaactgaggtgggggagattctcagagggaagaacccagagatcagg gttcaaggctgggagagaaatggctacttccttggaaatgttcctaggtgtcctgatctg ccactgagctacagccttcaaactcaaactgcaagggaagcttttttcgagcctccctgg aaaaaacccatgaccccaaaatccttccctcgaagctctatccagaggcttggtactgct tcaagtagaaaatctgatttggtttctgggaaggaggacctgttggaaagttacaagatt gacgtgacagtgacaagtctggaacctcctggctctgatgcacagtgcttgaactccctt agaaatgaagcattgcccccaagatag >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_2|170_aa MDDGPYAMVELKFRREHREDPGFWEALFKKECTLYSSDGYYETKRKPTRQKYSRKQVLAM MWRNENPCAFLVIVKEEKEIRASVGVATQTVKVTATVQDGCFVKTEKTLNGNVDTERLGN HRQAQQHQLNCSSPTGSSYIPSCAGVGLEPMSPLPRMLEKEFIELDRELD >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_2|513_bp atggacgatggaccatatgccatggtagaactaaagttcagacgcgagcacagagaagac ccaggtttttgggaggccctctttaaaaaagaatgcactttgtactcatcagacggctac tatgaaacaaaaagaaaaccaaccagacaaaaatacagcagaaaacaagtgttggcgatg atgtggagaaatgagaacccttgtgcatttctggtaattgtgaaggaggaaaaagaaatt cgtgctagtgttggtgttgcaactcaaactgtaaaagttacagctacagtgcaagatggg tgtttcgttaagacggaaaagacattaaatgggaatgtggatacggagagacttggcaac caccggcaggcacagcagcaccagctaaattgcagctcaccaacaggcagcagttacatc cctagttgtgctggtgtgggtctggagccaatgagcccccttccacgaatgctagagaaa gaatttattgaattagatagagagttggactag >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_3|266_aa MIISIDAEKTFDKIQQPFVLKTLNKLSIDRTYLKIIRATYDKPTANIILNGQKLEAFPLQ TGTRQGCPLSPLLFDIVLEVLARAIGQEKEIKGIQLGKEEVKLSLFADDMIEYLENPVAS AQNLLKLISNCSKVSGYKINVQKSQAFLCATNRQTESQIMSELPFTIASKRIEYLGIQLT GDVKDLFKEDYKPLLHEIKEDENKWKNIPCSRIGRINIVKIAILPKVIYRFNAIPIKLPM TFFTELEKLLQNSHGTKNEPALPRRS >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_3|801_bp atgattatctcaatagatgcagaaaagacctttgacaaaattcaacagcccttcgttcta aaaaccctcaataagttaagtattgataggacgtatctcaaaataatacgagctacttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgcaa actggcacaagacagggatgccctctctcaccactcctatttgacatagtgttggaagtt ctggccagggcaatcgggcaggagaaagaaataaagggtattcaattagggaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgaatatttagaaaaccccgtcgcctca gcccaaaatctcctcaagctgataagcaactgcagcaaagtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcttatgcgccactaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaatagaatacctaggaatccaacttaca ggtgatgtgaaggacctcttcaaggaggactacaaaccactgctccatgaaataaaagag gacgaaaacaaatggaagaacattccatgctcacggataggaagaatcaatatcgtgaaa atagccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagagttggaaaaactccttcaaaattcacatggaaccaaaaacgagccc gcattgccaagacgatcctaa >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_4|99_aa MGLWKRPNLRLIGVPESDRENGTKLENTLQDIIQENFPNLARQANIQIQEIQRIPQRYSS RRATPRHIIVRFTKVEMKEKTLRAAREKGQVTHKGSPSD >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_4|300_bp atgggactatggaaaagaccaaatctgcgtctgattggtgtacctgaaagtgacagggag aatggaaccaagctggaaaacactctgcaggatattatccaggaaaacttccccaatcta gcaaggcaggcgaacattcaaattcaggaaatacagagaataccacaaagatactcctcg agaagagcaactccaagacacataattgtcagattcaccaaagtggaaatgaaggaaaaa acgttaagggcagccagagagaaaggtcaagttacccacaaaggaagcccatcagactaa >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_5|284_aa MAKEWGYASHNGPDHWHELFPNAKGENQSPVELHTKDIRHDPSLQPWSVSYDGGSAKTIL NNGKTCRVVFDDTYDRSMLRGGPLPGPYRLRQFHLHWGSSDDHGSEHTVDGVKYAAELHL VHWNPKYNTFKEALKQRDGIAVIGIFLKIGHENGEFQIFLDALDKIKTKGKEAPFTKFDP SCLFPACRDYWTYQGSFTTPPCEECIVWLLLKEPMTVSSDQMAKLRSLLSSAENEPPVPL VSNWRPPQPINNRVERKPTIGELGSLPPSGAPYSKSISFFHTEQ >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_5|855_bp atggccaaggagtggggctacgccagtcacaacggtcctgaccactggcatgaacttttc ccaaatgccaagggggaaaaccagtcgcccgttgagctgcatactaaagacatcaggcat gacccttctctgcagccatggtctgtgtcttatgatggtggctctgccaagaccatcctg aataatgggaagacctgccgagttgtatttgatgatacttatgataggtcaatgctgaga gggggtcctctccctggaccctaccgacttcgccagtttcatcttcactggggctcttcg gatgatcatggctctgagcacaccgtggatggagtcaagtatgcagcggagcttcatttg gttcactggaacccgaagtataacacttttaaagaagccctgaagcagcgcgatgggatc gctgtgattggcatttttctgaagataggacatgagaatggcgagttccagattttcctt gatgcattggacaagattaagacaaagggcaaggaggcgcccttcacaaagtttgaccca tcctgcctgttcccggcatgccgggactactggacctaccagggctcattcaccacgccg ccctgcgaggaatgcattgtgtggctgctgctgaaggagcccatgaccgtgagctctgac cagatggccaagctgcggagcctcctctccagtgctgagaacgagcccccagtgcctctt gtgagcaactggcgacctccacagcctatcaataacagggtggaaaggaaacctaccatt ggagagcttggttccttgcctccttctggtgctccttactccaagtctatttcatttttc cacactgagcaatga >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_6|103_aa MAVNTEGKGLSGTSTVPNTLGMKYMSNSDISAIGKKEQIAGLRVQATCKTRLWVTWVFGS AMDSRRFLNDNPGKPLYIHKLCTQFLAPINAISPGPSEESHSN >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_6|312_bp atggctgtgaatactgaaggaaagggcttatcgggcacaagcacagtgcccaacacactg ggcatgaaatacatgtcgaacagtgatatcagtgccataggaaagaaggaacagatagca ggactgagggtacaggcgacctgcaaaacacggctctgggtcacctgggtctttgggtca gcaatggactccagacgctttttgaatgataatcccgggaagccactctacattcacaaa ctttgcactcaattcttggcacctatcaacgccatatctccagggccaagtgaagagtct catagcaactaa >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_7|409_aa MGTGDERRRGSGSRAVPAAARLRSREGDGVRVEAPRPGLDGGAQEPPQPPHAQSSELRCQ PRTPGPCRPPEPPPPGRPRARSWREPIKAGAGATRGHTVQAPKPPPPDRCRFLPCPDRQR DHVPSLGVRQTQRSGSSEPAEQRSMRLAHLAHLWRVWGFPGPEHWHKDFPIAKGERQSPV DIDTHTAKYDPSLKPLSVSYDQATSLRILNNGHAFNVEFDDSQDKAVLKGGPLDGTYRLI QFHFHWGSLDGQGSEHTVDKKKYAAELHLVHWNTKYGDFGKAVQQPDGLAVLGIFLKVGS AKPGLQKVVDVLDSIKTKGKSADFTNFDPRGLLPESLDYWTYPGSLTTPPLLECVTWIVL KEPISVSSEQVLKFRKLNFNGEGEPEELMVDNWRPAQPLKNRQIKASFK >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_7|1230_bp atggggacaggggacgaaaggcgccgggggtccgggtcccgagcagtccccgccgccgcc agactccgcagccgggagggggatggggtgcgcgtagaggctccgcggcccgggttggac ggaggagcccaggagccaccgcagccgccgcacgcccagagctccgagcttcgctgccag cccaggacaccggggccctgccgtccacccgagccccctcccccgggccgcccccgagca cgaagttggcgggagcctataaaagctggtgccggcgcgacccgcggacacacagtgcag gcgcccaagccgccgccgccagatcggtgccgattcctgccctgccccgaccgccagcgc gaccatgtcccatcactgggggtacggcaaacacaacgatctggttcttcggagccagcg gagcagaggagcatgcgtctggcgcacctagcgcatctttggagggtgtggggcttccca ggacctgagcactggcataaggacttccccattgccaagggagagcgccagtcccctgtt gacatcgacactcatacagccaagtatgacccttccctgaagcccctgtctgtttcctat gatcaagcaacttccctgaggatcctcaacaatggtcatgctttcaacgtggagtttgat gactctcaggacaaagcagtgctcaagggaggacccctggatggcacttacagattgatt cagtttcactttcactggggttcacttgatggacaaggttcagagcatactgtggataaa aagaaatatgctgcagaacttcacttggttcactggaacaccaaatatggggattttggg aaagctgtgcagcaacctgatggactggccgttctaggtatttttttgaaggttggcagc gctaaaccgggccttcagaaagttgttgatgtgctggattccattaaaacaaagggcaag agtgctgacttcactaacttcgatcctcgtggcctccttcctgaatccttggattactgg acctacccaggctcactgaccacccctcctcttctggaatgtgtgacctggattgtgctc aaggaacccatcagcgtcagcagcgagcaggtgttgaaattccgtaaacttaacttcaat ggggagggtgaacccgaagaactgatggtggacaactggcgcccagctcagccactgaag aacaggcaaatcaaagcttccttcaaataa >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_8|180_aa MMPGPQAVLSVKAKWGLRCGSTRETMTWRPVVWGKFSALVTGCLEIDSVLLGGHGGANII LNEEKLKAFHLRTGTRQGCPLSPLLLNLVLEVLARAIRQEKEIKGILIGEGEVKLSLFAD DMIVYLKIPKDTSKKPLELIKEFRKVSGCKINIHKSVTLLYTNSNQAENQIEISQPLHNS >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_8|543_bp atgatgcctggaccccaagctgtcctcagtgtgaaggccaaatggggcctgcgctgtggg agcactcgagagacaatgacgtggaggcctgtggtctggggcaagttctcagccctggtc actggctgcctggaaatagactcggtgctgctggggggtcacggtggggccaacattata ctgaatgaggaaaagttgaaagctttccacctgagaactggaacaagacaaggatgccca ctttcaccacttctgttaaacttggtactggaagtcctcgccagagcaatcagacaagag aaagaaataaagggcatcctaatcggtgaaggggaagtcaaactgtcactgtttgctgat gatatgatcgtatacctaaaaatccctaaagacacctccaaaaagccactagaactgata aaagaattcaggaaagtttcaggatgcaagattaatatacacaaatcagtaactctgcta tacaccaacagcaaccaagctgagaatcaaattgagatatctcaacctcttcacaatagc taa >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_9|92_aa MATTLTELIAQQRFESAMVENSFYVSSTILGTGNPEAGCCHSLLDVRQANFGENLVYSLN DNRPSQKLNYPCKTKERPQSYKDERVLNSTTM >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_9|279_bp atggctactaccctcacagagctgattgcccagcaaaggtttgaaagtgctatggtggag aacagcttctatgtgtcaagcactatactaggcactgggaatccagaggctggctgttgt cactcactcctggacgtacgccaagctaactttggggaaaatttagtttatagtttaaat gataataggccttcccaaaaactaaactacccttgtaaaactaaagaaaggccacaaagt tacaaggatgagagggtcctgaactctactacaatgtag >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_10|85_aa MTPKYSSGEKQYFELSDDLNVFYYGSRLKGLNGKRMDGCTAALVDAECPLGDTISNGPSV LFQCPDHPPVASNPQTSFFLIEDLS >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_10|258_bp atgactcctaaatattcttctggggaaaagcaatattttgaactgagcgacgatttaaat gttttctactatggaagcaggctgaagggcttgaatgggaagaggatggatggttgcacg gctgccctggtagatgctgaatgtcctctgggtgataccattagtaatggtccatctgtt ctcttccagtgccctgatcaccctcctgtggccagtaatcctcaaacctcattttttctt attgaggatttaagctga >gi568815590f:85365271_85580786|GENSCAN_predicted_peptide_11|191_aa MVVSEETILKKILKEGPLKKTRVFKGQARVQEHVNVSGELACLGQLKSFCKRAEETSERD SNLTKGFRYKGLDSGMAPHSCSKSIHPRRGPRQKCGCHLGSPVFVPNINLVNKVLSQICP LLTMPTLTPPIQLLIRNPEVFQDLTASESMKIKTDTPEDPWQLFLKGFSGLSLLKWGSEE PEFWNDYSRRL >gi568815590f:85365271_85580786|GENSCAN_predicted_CDS_11|576_bp atggttgtctctgaagagacaatcttaaagaagatccttaaagaaggtcccttaaagaag accagggtctttaagggacaagccagggttcaagagcacgtgaatgtgtctggggaactt gcctgcctgggccagctaaaatctttttgcaagagggctgaagaaacttccgagagagac tccaatcttaccaaaggattcagatacaagggacttgactcaggaatggctccacattca tgcagcaaatccatccatccacgcaggggcccaagacagaaatgtggatgccatcttggt tcccctgtttttgttcccaacatcaacttggtaaataaggttctcagtcaaatctgccca cttctcaccatgcctaccctgactcctccgattcagttactgatcagaaatcctgaggtg ttccaagatctaacagcttccgaaagcatgaaaattaagactgacactccggaggatcct tggcagctcttcctcaaaggcttctctgggctttcgcttcttaagtggggaagtgaagaa ccagaattctggaatgattattcacgaaggctctga