GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:28:29 Sequence gi568815586r:74943108_75307983 : 364876 bp : 34.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 77 72 6 1.05 1.02 Term - 42205 41609 597 1 0 -25 41 260 0.735 3.54 1.01 Init - 43135 43046 90 1 0 67 85 122 0.789 10.34 1.00 Prom - 51418 51379 40 -2.15 2.03 PlyA - 51655 51650 6 1.05 2.02 Term - 60279 59183 1097 1 2 67 54 386 0.330 24.35 2.01 Init - 66732 66567 166 0 1 35 98 79 0.198 3.44 2.00 Prom - 69101 69062 40 -4.95 3.04 PlyA - 70763 70758 6 1.05 3.03 Term - 82441 82320 122 2 2 58 37 138 0.613 3.36 3.02 Intr - 85682 85553 130 2 1 83 40 84 0.014 2.45 3.01 Init - 92811 92719 93 0 0 68 97 40 0.011 3.33 3.00 Prom - 98438 98399 40 -3.75 4.07 PlyA - 99754 99749 6 1.05 4.06 Term - 100134 99998 137 1 2 74 40 79 0.433 -1.10 4.05 Intr - 105159 105046 114 1 0 31 102 60 0.517 1.20 4.04 Intr - 108210 107283 928 0 1 95 110 409 0.958 33.66 4.03 Intr - 147747 147538 210 0 0 66 68 142 0.067 8.19 4.02 Intr - 164837 164722 116 0 2 52 98 21 0.138 -1.25 4.01 Init - 165354 165321 34 0 1 70 115 30 0.222 3.99 4.00 Prom - 175326 175287 40 -4.55 5.00 Prom + 176985 177024 40 -6.75 5.01 Init + 181389 181446 58 2 1 56 109 93 0.783 9.72 5.02 Intr + 219852 219942 91 1 1 28 84 116 0.014 3.33 5.03 Intr + 241425 241474 50 0 2 104 84 9 0.000 -0.49 5.04 Term + 254602 254744 143 2 2 69 48 112 0.028 2.41 5.05 PlyA + 255617 255622 6 1.05 6.05 PlyA - 255902 255897 6 1.05 6.04 Term - 257948 257869 80 0 2 56 48 75 0.025 -2.75 6.03 Intr - 258113 258076 38 1 2 128 71 47 0.150 3.99 6.02 Intr - 263747 263566 182 2 2 50 75 45 0.171 -2.86 6.01 Init - 264876 264190 687 0 0 81 86 806 0.993 74.82 6.00 Prom - 265689 265650 40 -15.32 7.04 PlyA - 265712 265707 6 -3.64 7.03 Term - 266450 266101 350 0 2 45 45 293 0.472 14.36 7.02 Intr - 274930 274847 84 2 0 51 88 55 0.397 0.67 7.01 Init - 275867 275735 133 2 1 78 47 96 0.523 4.85 7.00 Prom - 275990 275951 40 -3.65 8.03 PlyA - 277152 277147 6 1.05 8.02 Term - 289303 288938 366 1 0 -3 44 277 0.036 7.82 8.01 Init - 315854 315711 144 2 0 79 79 124 0.886 10.77 8.00 Prom - 330528 330489 40 -6.45 9.08 PlyA - 331634 331629 6 1.05 9.07 Term - 335958 335783 176 1 2 62 34 136 0.337 2.54 9.06 Intr - 339240 339144 97 0 1 57 91 -10 0.275 -5.14 9.05 Intr - 341973 341854 120 0 0 72 88 107 0.981 8.87 9.04 Intr - 346668 346514 155 1 2 78 95 101 0.998 8.67 9.03 Intr - 348713 348637 77 1 2 52 95 53 0.724 0.64 9.02 Intr - 350260 350142 119 1 2 79 108 37 0.654 3.14 9.01 Init - 355334 355269 66 2 0 85 92 10 0.574 2.22 9.00 Prom - 357024 356985 40 -4.45 10.03 PlyA - 358326 358321 6 1.05 10.02 Term - 362689 362415 275 2 2 49 46 206 0.253 7.15 10.01 Init - 362899 362773 127 1 1 71 53 215 0.246 14.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 240824 240740 85 2 1 46 105 87 0.907 7.23 S.002 Term - 249191 249111 81 2 0 61 42 140 0.918 3.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_1|228_aa MAVAACIERQLPRCQLQCGGTGSTARSRELLNTHRDTLAVDKCYPLQVSSELFYCSIKLL FALLTLHLSAFLIFPGHWTRTQDPPNGRAEHALCNTNRAETHPLLTMLWVTRRREELQSF REPRSRSSPSQDCDTLFGALWFLVPSSFWVPLHSLVPSVEAACGMSGPAVASQGAGSCAG DWSCPPHCSQQLCAVAGPDACLLTHPFQLHLPLAGIGSRLVESCDRRM >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_1|687_bp atggcagtggcagcctgtatagagcggcagctgccacgatgccagctgcagtgtggaggc acaggcagcactgcacgctccagggagctgctgaacactcatcgggacaccctggctgtg gacaaatgctacccacttcaagtctcctctgagctgttctattgctcaataaagctcctc ttcgccttgctaaccctccacttgtcagcattcctcatttttcctgggcactggacaaga actcaggacccaccaaatggcagggctgaacatgccctttgtaacacaaacagggctgaa acacaccccttgctcaccatgttgtgggtgacaagaaggagagaagagctgcagtccttc agggagcccagatctaggagttccccaagccaggactgtgacaccctctttggggctctg tggttcctggtgccttcaagcttctgggtgccactgcattccctagtgccatctgtggaa gctgcttgtgggatgtctggtccagctgtagcctcgcaaggagctggctcttgtgctgga gactggagctgcccaccccactgcagccagcagctatgtgcagtggccggacctgatgct tgcttgctcacacatcccttccagctccacttgcccttggcaggcataggatccaggctg gtagaatcctgtgacagaaggatgtaa >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_2|420_aa MGESSWSRVEPEMIPRQAQIQGRAQPLLQVQWSCRCLVARAHTHPRETLGRKSIKVLEVL ARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINV QKSQALLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTREVKDLFKENYKPLPSEIKED TNKWKNIPSSWVGRINIVKMAILPKVIYKFNAIPIKLPITSFTELEKTTLKIIWNQKRAH ITKSILSQKNKARGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRKQPSEIMPHIYNYL IFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIK TLEENLGITIQDIGMGKDFMSKTPKAMATKAKIERWDLIKLKSFCTAKEITIRVNRQPTE >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_2|1263_bp atgggagagagctcctggtcaagggtggagcctgaaatgattccccgtcaggcccaaatc cagggacgtgcccaacctctattgcaagtacagtggtcctgccgctgccttgtagctcgt gcacacacacatccacgagagactctgggtaggaaaagtattaaagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattatccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcactcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacttaggaatccaacttacaagg gaagtgaaggacctcttcaaggaaaactacaaaccactgcccagtgaaataaaagaggat acaaacaaatggaagaacattccatcctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttataaattcaatgccatccccatcaagctaccaataact tccttcacagaattggaaaaaactactttaaagatcatatggaaccaaaaaagagcccac atcaccaagtccatcctaagccaaaagaacaaagccagaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaaaacagccctcagaaataatgccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattcactatttaataaatgg tgctgggaaaactggctggccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaattaattcaaggtggattaaagacttaaacgttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggatataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgagagatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaattaccatcagagtgaacaggcaacctacagaa tga >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_3|114_aa MAGNGALLETSVDLCCWQIRHSAMAIPRLTLIFHFTMVSLAIKGTSLNALHDVLPISIAV RYHTGYWQDSGEMAAMTELVPKARNSHSSSPLVPPPPVASTSYWVLLRFVDKHT >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_3|345_bp atggcaggaaatggtgccttattggagaccagtgttgatctctgctgctggcagattaga cattcagcaatggccatacccaggctgaccttgatctttcattttaccatggtttctctg gcaataaagggcactagtttaaatgctctacatgatgttcttcctatatcaatagctgtt cgttatcacactgggtactggcaggattctggagagatggctgcaatgaccgagttagtt cccaaagccagaaattcgcattcatcatctcccctggtgccccctccccctgtagcaagc acaagttattgggtcctgttacgctttgtagataagcatacttaa >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_4|512_aa MAGSSPNLAMQVFYNVNIEFESKLDVYAGFYLLLSESKSVWRAPNFHMHQASLRMQLDTW RLESNDDDDDGNEDDDNDDINNNLLHALIIISKLFLTSNILSFTLYIEEVRVTKTWIYIQ FIAFASLFFILVSITTFCLETHEAFNIVKNKTEPVINGTSVVLQYEIETDPALTYVEGVC VVWFTFEFLVRIVFSPNKLEFIKNLLNIIDFVAILPFYLEVGLSGLSSKAAKDVLGFLRV VRFVRILRIFKLTRHFVGLRVLGHTLRASTNEFLLLIIFLALGVLIFATMIYYAERVGAQ PNDPSASEHTQFKNIPIGFWWAVVTMTTLGYGDMYPQTWSGMLVGALCALAGVLTIAMPV PVIVNNFGMYYSLAMAKQKLPRKRKKHIPPAPQASSPTFCKTELNMACNSTQSDTCLGKD NRLLEHNRSERLPIRRSSTRDKNRRGETCFLLTTGDYTCASDGGIRKGYEKSRSLNNIAG LAGNALRLSPVTSPYNSPCPLRRSRSPIPSIL >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_4|1539_bp atggcagggtcttctcctaatctggcaatgcaagtcttttataatgtgaatatagaattt gaaagtaaattagatgtatatgcaggattctacttgctactttcagaaagtaagagtgtc tggagagctccaaattttcatatgcatcaggcatctttgagaatgcagttagacacttgg aggctagagtcaaatgatgatgatgatgatggtaatgaggatgatgataatgatgatatt aacaacaatttgctacatgctttaattataatttctaaactattcctgacatcaaatatt ctttccttcactctatatatcgaggaagtgagggtcacaaagacatggatttatattcag tttattgcttttgcttctttattcttcatcctggtttcaattacaactttttgcctggaa acacatgaagctttcaatattgttaaaaacaagacagaaccagtcatcaatggcacaagt gttgttctacagtatgaaattgaaacggatcctgccttgacgtatgtagaaggagtgtgt gtggtgtggtttacttttgaatttttagtccgtattgttttttcacccaataaacttgaa ttcatcaaaaatctcttgaatatcattgactttgtggccatcctacctttctacttagag gtgggactcagtgggctgtcatccaaagctgctaaagatgtgcttggcttcctcagggtg gtaaggtttgtgaggatcctgagaattttcaagctcacccgccattttgtaggtctgagg gtgcttggacatactcttcgagctagtactaatgaatttttgctgctgataattttcctg gctctaggagttttgatatttgctaccatgatctactatgccgagagagtgggagctcaa cctaacgacccttcagctagtgagcacacacagttcaaaaacattcccattgggttctgg tgggctgtagtgaccatgactaccctgggttatggggatatgtacccccaaacatggtca ggcatgctggtgggagccctgtgtgctctggctggagtgctgacaatagccatgccagtg cctgtcattgtcaataattttggaatgtactactccttggcaatggcaaagcagaaactt ccaaggaaaagaaagaagcacatccctcctgctcctcaggcaagctcacctactttttgc aagacagaattaaatatggcctgcaatagtacacagagtgacacatgtctgggcaaagac aatcgacttctggaacataacagatcagaaaggctccccatcagacgctctagtaccaga gacaaaaacagaagaggggaaacatgtttcctactgacgacaggtgattacacgtgtgct tctgatggagggatcaggaaaggatatgaaaaatcccgaagcttaaacaacatagcgggc ttggcaggcaatgctctgaggctctctccagtaacatcaccctacaactctccttgtcct ctgaggcgctctcgatctcccatcccatctatcttgtaa >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_5|113_aa MGSGYPSQMYDSAEISHGTVAPTKQTYLVDRRSGSADPDLSLLVQTSLTIKERENFLEVI NLIDRAVCLVKGECTNPDNNMLLYHPISVDKVEETQKTIVIKKPSVEVQSQVL >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_5|342_bp atgggctctggatatccctcgcagatgtatgattctgcagaaataagccatggcacagtt gcccccactaagcagacctatttggttgaccgtagaagtggatctgcagatccagatctg tctctcctcgtgcagacaagcctgaccatcaaggaaagggagaacttcctggaagtcatt aacttaattgacagagctgtttgtttagttaaaggagaatgtacaaatcctgacaacaac atgctactttatcatccaatcagtgtggacaaagtagaggaaacacaaaagactatagtg ataaagaaacccagcgttgaagtacaaagtcaagtactctga >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_6|328_aa MGKIENNERVILNVGGTRHETYRSTLKTLPGTRLALLASSEPPGDCLTTAGDKLQPSPPP LSPPPRAPPLSPGPGGCFEGGAGNCSSRGGRASDHPGGGREFFFDRHPGVFAYVLNYYRT GKLHCPADVCGPLFEEELAFWGIDETDVEPCCWMTYRQHRDAEEALDIFETPDLIGGDPG DDEDLAAKRLGIEDAAGLGGPDGKSGRWRRLQPRMWALFEDPYSSRAARSRGSNWGLNLR TEEGEMECIYVSGEVEGGKVCLEVYVVKRISRTLFGPVTFQGRRILGLFPCWVATAVVTI ETGLRFSAGVYPTTCKGNRLDNLSRIFT >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_6|987_bp atgggcaagatcgagaacaacgagagggtgatcctcaatgtcgggggcacccggcacgaa acctaccgcagcaccctcaagaccctgcctggaacacgcctggcccttcttgcctcctcc gagcccccaggcgactgcttgaccacggcgggcgacaagctgcagccgtcgccgcctcca ctgtcgccgccgccgagagcgcccccgctgtcccccgggccaggcggctgcttcgagggc ggcgcgggcaactgcagttcccgcggcggcagggccagcgaccatcccggtggcggccgc gagttcttcttcgaccggcacccgggcgtcttcgcctatgtgctcaattactaccgcacc ggcaagctgcactgccccgcagacgtgtgcgggccgctcttcgaggaggagctggccttc tggggcatcgacgagaccgacgtggagccctgctgctggatgacctaccggcagcaccgc gacgccgaggaggcgctggacatcttcgagacccccgacctcattggcggcgaccccggc gacgacgaggacctggcggccaagaggctgggcatcgaggacgcggcggggctcgggggc cccgacggcaaatctggccgctggaggaggctgcagccccgcatgtgggccctcttcgaa gacccctactcgtccagagccgccaggagccgtggctccaattgggggttgaatctgagg acagaggaaggggaaatggaatgtatctatgtgtctggggaggtggagggaggaaaggtt tgcctggaggtttatgttgtgaagagaatctccagaacgttgtttgggcctgtcactttc caagggagacgaatactaggattgtttccctgttgggtagctactgctgtggttaccata gaaacaggtctaagattcagtgcaggtgtatatccaaccacctgcaagggtaaccgatta gataatctcagtcgcattttcacttga >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_7|188_aa MEYYAAIKKDEFMSFEGTWMQLETIILSRLSQGQKTKHCMLSLIGAIKGARAYAQYPVHG VQLILEHQGSLGADLPAPGGVGQCRLDPPRAPWFAQPCAMVTLGAEVGDWQIRRSQNEGG ERGRRPAGAFSPTPAPREAAQPPESGKRQQVVSRRRRAAHLGAKTQSRCPENLVHSETKA EPLFSRSR >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_7|567_bp atggaatactatgcagccataaagaaggatgagttcatgtcctttgaagggacatggatg cagctggaaaccatcattctcagcagactatcacaaggacagaaaaccaaacactgcatg ttgtcactcataggagcaataaagggtgctagagcatatgcacaatatccagtccacgga gtacaacttattttagaacatcaaggaagccttggagccgatctgcccgctcccggaggg gtcgggcagtgccggctggacccgccccgagctccatggtttgcccaaccctgcgcgatg gtgactctgggcgcggaggttggcgactggcaaatccgcagatcacagaatgaaggcggg gagcgcggccggcggccggcgggggctttctcccccaccccagcgcccagggaagcggct caaccacctgaatccggaaaacgccaacaagtagtttctcgtcggagaagggcggctcac ctgggcgccaagactcagtcccgctgcccagagaacctcgtccactcggaaaccaaagca gaaccacttttctctcggtctcgttaa >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_8|169_aa MGSYVKRTIRELELLETKNTVNKKKFASAPVGNGMAYSCQETSSMEEKLCTSLDSATTSF GLSSNTRTICQETQEKPKAPGTSVQVMDPEVVHDPTLAPSSMVQEQSCLWNYLGDMTICD LRGRTAGTGLTMDYEAALCLASDPSQLLSGSSPVHPGTQQETSGNSPTV >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_8|510_bp atggggtcttatgtaaaaaggacaataagagaactagagcttttggaaactaaaaataca gtgaacaagaagaagtttgcttcagcccctgttggcaatggcatggcctacagctgccag gagaccagtagtatggaagagaagctatgtacctctttggattcagctacaacttcattt ggccttagttctaacaccagaaccatctgccaagaaacacaggaaaagccaaaagcacct gggacttcagttcaggttatggaccctgaagtagtgcatgatccaactctagccccttca agcatggtccaggagcagtcttgcctgtggaattatttgggagacatgactatctgtgac ctcagaggtagaactgctggcactggtttgaccatggattatgaagcagctttgtgtctt gcctctgatccctcccagctgctatctgggagcagtcctgttcacccagggacccagcaa gagacatctggtaacagtcccactgtctag >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_9|269_aa MGCLQMKLYLRTLKSSSKLCHWGATLTFLSSDHLSLPESIKENTLLKLRITNIDQIALDS LKTASMEQEDDIIIQETNDRLVFKAIQDVLKEKLHKRGVRILTGLGKYFQQLDKEGNGLL DKADFKQALKVFHLEVSEKDFESAWLILNDNGNGKVDYGEFKRGIIGEMNEYRKSYVRKA FMKLDFNKSGSVPIINIRKCYCAKKHSQVISGHSTEEEIKSSFLETLKVACSKSDEVSYG EFEDYYEGLSIGIVDDEDFVNILRTPWGI >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_9|810_bp atgggttgtctccagatgaagttgtatcttcgtactttgaaaagttcttcaaagctgtgt cattggggtgcaaccttgacatttttgagttctgatcatctcagccttccagaaagcatc aaagaaaacacattacttaaactccgaatcacaaatattgatcaaatagctttggattct ctcaaaactgcttctatggaacaggaggatgatataatcattcaagaaaccaatgatagg ctggtcttcaaagcaattcaagatgtgctaaaagaaaaactacataaaagaggtgttcgt attttgactggattgggaaaatattttcaacagttggacaaggaaggaaatggactttta gataaggcagattttaagcaagctctaaaagtgtttcacttagaagtgtctgaaaaggat tttgagtctgcatggctaattctgaatgacaatggcaatggcaaggttgattatggagaa ttcaaacgtggtattattggtgaaatgaatgaatacaggaaatcatatgttcgaaaggcc tttatgaaactggatttcaacaaaagtggcagtgtgcctattataaacataagaaaatgt tactgtgcaaagaagcattctcaagtaatttcaggccattcaacagaggaagaaatcaaa tcatcctttctagaaacattaaaagttgcctgcagcaagtctgatgaagtgtcatatggt gaatttgaagattactatgaaggtttaagtataggaatagtagatgatgaagactttgtt aacatcttacgtactccatgggggatttag >gi568815586r:74943108_75307983|GENSCAN_predicted_peptide_10|133_aa MSRRPVRTLLLLRATLSMLLGLQRFHQRRLHGVEVGQPQVQPAADDILHHVWKPDDHLLL VLDPHVPHELVVRLLQQLCRARLGSARGLVGAALLRGLLLSLPVARGLGQWSGLEEGSAA LPGSRTEERLPEP >gi568815586r:74943108_75307983|GENSCAN_predicted_CDS_10|402_bp atgagcaggaggcctgtcaggaccctgctgttgctcagagccaccctgtccatgctcctc gggctccagcgctttcaccagcgcagacttcatggtgtggaggttggacagcctcaggtg cagcctgctgccgacgacatccttcaccatgtctggaaaccagatgatcatcttcttctg gtccttgacccgcatgtaccacacgagctcgtcgtgcgccttctgcagcagctgtgccgg gccagactgggcagtgctaggggcctggtgggcgccgcccttctccgcggcctgctgctg agccttccggtcgctcggggcctggggcagtggtcgggtctggaggagggctctgcggct ctgccagggtcgcggacggaggaacgcctaccggaaccctag