GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:32:16 Sequence gi568815597r:162274051_162476790 : 202740 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8894 8941 48 1 0 81 94 18 0.818 2.66 1.02 Intr + 13294 13386 93 0 0 95 115 112 0.763 14.76 1.03 Intr + 26598 26656 59 2 2 87 110 72 0.435 6.98 1.04 Intr + 58967 59075 109 2 1 100 98 107 0.941 13.19 1.05 Intr + 69785 69926 142 1 1 117 96 166 0.973 20.23 1.06 Intr + 80539 80562 24 2 0 98 77 30 0.580 0.90 1.07 Intr + 81137 81303 167 0 2 124 102 139 0.995 18.58 1.08 Intr + 82910 83086 177 1 0 138 17 156 0.869 13.62 1.09 Intr + 91354 91519 166 0 1 141 74 222 0.999 25.63 1.10 Term + 93002 93417 416 0 2 113 43 721 0.997 65.52 1.11 PlyA + 94879 94884 6 1.05 2.05 PlyA - 96239 96234 6 1.05 2.04 Term - 100566 99998 569 1 2 108 44 674 0.652 59.58 2.03 Intr - 100874 100683 192 0 0 68 28 86 0.324 0.06 2.02 Intr - 101353 101240 114 2 0 76 94 35 0.832 3.32 2.01 Init - 102740 102638 103 2 1 59 97 31 0.765 0.36 2.00 Prom - 103731 103692 40 -3.86 3.00 Prom + 105221 105260 40 -5.66 3.01 Init + 106672 106681 10 0 1 78 65 21 0.469 -1.68 3.02 Intr + 107841 108168 328 1 1 108 105 218 0.885 20.26 3.03 Term + 109132 109633 502 1 1 73 43 229 0.902 10.85 3.04 PlyA + 110729 110734 6 -1.75 4.07 PlyA - 112690 112685 6 -0.45 4.06 Term - 112864 112816 49 0 1 105 55 10 0.211 -3.82 4.05 Intr - 113466 113336 131 0 2 98 99 49 0.224 6.49 4.04 Intr - 116715 116518 198 0 0 5 23 155 0.091 0.25 4.03 Intr - 125037 124957 81 0 0 121 -8 68 0.314 0.33 4.02 Intr - 128752 128689 64 0 1 96 110 40 0.719 5.72 4.01 Init - 137966 137833 134 2 2 54 80 97 0.620 5.12 4.00 Prom - 141488 141449 40 -3.56 5.00 Prom + 154470 154509 40 -4.06 5.01 Init + 159368 159572 205 1 1 97 44 180 0.059 11.81 5.02 Intr + 171589 171831 243 2 0 68 70 230 0.418 16.67 5.03 Intr + 172699 172811 113 2 2 150 47 12 0.904 3.40 5.04 Term + 175216 175479 264 0 0 64 37 275 0.916 15.51 5.05 PlyA + 175563 175568 6 1.05 6.00 Prom + 176561 176600 40 -4.96 6.01 Sngl + 177464 178909 1446 1 0 74 43 545 0.850 42.34 6.02 PlyA + 178961 178966 6 1.05 7.00 Prom + 179175 179214 40 -10.55 7.01 Sngl + 179331 179978 648 2 0 58 47 275 0.735 16.58 7.02 PlyA + 180022 180027 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 159368 159697 330 1 0 97 44 201 0.870 10.92 S.002 Init + 170593 170641 49 0 1 99 113 9 0.910 3.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_1|466_aa MNFMVKNEGDLLPMKQYEFKAKNIKKKKVSIMVSVDGVKVILKKKKKKKEWTWDESKMLV MQDPIYRIFYVSHDSQDLKIFSYIARDGASNIFRCNVFKSKKKSQAMRIVRTVGQAFEVC HKLSLQHTQQNADGQEDGESERNSNSSGDPGVFLASAVGRQLTGAERASTATAEETDIDA VEVPLPGNDVLEFSRGVTDLDAVGKEGGSHTGSKVSHPQEPMLTASPRMLLPSSSSKPPG LGTETPLSTHHQMQLLQQLLQQQQQQTQVAVAQVHLLKDQLAAEAAARLEAQARVHQLLL QNKDMLQHISLLVKQVQELELKLSGQNAMGSQDSLLEITFRSGALPVLCDPTTPKPEDLH SPPLGAGLADFAHPAGSPLGRRDCLVKLECFRFLPPEDTPPPAQGEALLGGLELIKFRES GIASEYESNTDESEERDSWSQEELPRLLNVLQRQELGDGLDDEIAV >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_1|1401_bp atgaattttatggttaagaatgagggagacctgcttcctatgaagcagtatgagtttaaa gccaagaacatcaagaagaagaaagtgagcattatggtttcagtggatggagtgaaagtg attctgaagaagaagaaaaagaaaaaggaatggacgtgggatgagagcaagatgctggtg atgcaggaccccatctacaggatcttctatgtctctcatgattcccaagacttgaagatc ttcagctatatcgctcgagatggtgccagcaatatcttcaggtgtaacgtctttaaatcc aagaagaagagccaagctatgagaatcgttcggacggtggggcaggcctttgaggtctgc cacaagctgagcctgcagcacacgcagcagaatgcagatggccaggaagatggagagagc gagaggaacagcaacagctcaggagacccaggcgtcttcctggcttctgctgttggccgc cagctcactggagccgagagggcctccacggccactgcagaggagactgacatcgatgcg gtggaggtcccacttccagggaatgatgtcctggaattcagccgaggtgtgactgatcta gatgctgtagggaaggaaggaggctctcacacaggctccaaggtttcgcacccccaggag cccatgctgacagcctcacccaggatgctgctcccttcttcttcctcgaagcctccaggc ctgggcacagagacaccgctgtccactcaccaccagatgcagctcctccagcagctcctc cagcagcagcagcagcagacacaagtggctgtggcccaggtacacttgctgaaggaccag ttggctgctgaggctgcggcgcggctggaggcccaggctcgcgtgcatcagcttttgctg cagaacaaggacatgctccagcacatctccctgctggtcaagcaggtgcaagagctggaa ctgaagctgtcaggacagaacgccatgggctcccaggacagcttgctggagatcaccttc cgctccggagccctgcccgtgctctgtgaccccacgacccctaagccagaggacctgcat tcgccgccgctgggcgcgggcttggctgactttgcccaccctgcgggcagccccttaggt aggcgcgactgcttggtgaagctggagtgctttcgctttcttccgcccgaggacaccccg cccccagcgcagggcgaggcgctcctgggcggtctggagctcatcaagttccgagagtca ggcatcgcctcggagtacgagtccaacacggacgagagcgaggagcgcgactcgtggtcc caggaggagctgccgcgcctgctgaatgtcctgcagaggcaggaactgggcgacggcctg gatgatgagatcgccgtgtag >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_2|325_aa MENFSLLSISGPPISSSALSAFPDIMFSRATSLPDIAKTAVPTEASSPAQALPPQYQSII VRQGIQNTALSPEESVDMVSPGGEGLDKYQGLPTKSSLSCPKIQRPFVPLEKECSEKECP RLLLPLSCVCSQNCNRDCSLGDTQHGEKLRRNCTIYRPWFSPYSYFVCADKESQLEAYDF PEVQQDEGKWDNCLSEDMAENICSSSSSPENTCPREATKKSRHGLDSITSQDILMASRWH PAQQNGYKCVACCRMYPTLDFLKSHIKRGFREGFSCKVYYRKLKALWSKEQKARLGDRLS SGSCQAFNSPAEHLRQIGGEAYLCL >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_2|978_bp atggagaacttctcactcctcagcatctctggacctccaatctcttcctccgccctgagt gcttttcccgacattatgttctctcgtgccaccagcctgccagacattgcaaagacagca gtacccactgaggcatccagcccagctcaggccctgccaccccagtaccaaagcatcatt gtcaggcaagggatacagaacacagcgctctcaccagaggagtcagtggacatggtgtct ccaggaggggaaggcctcgacaaataccaaggcttaccaactaaaagcagcctctcctgc cccaagatccaacggccctttgtacccctagaaaaagaatgcagtgagaaggaatgccct cggctgctcttaccactgtcttgcgtctgctctcagaactgcaatcgtgactgcagcttg ggggatacccagcacggagagaagctgaggcggaactgcactatctaccggccctggttc tccccctacagctacttcgtgtgtgcagacaaagagagccagctggaggcctatgacttc ccagaggtgcagcaggatgagggcaagtgggacaactgcctttctgaggacatggctgag aacatctgttcgtcctcttcctccccagagaacacttgccctcgagaagccaccaagaaa tccaggcatggcctggactccatcacatcccaggacatcctaatggcttccaggtggcac ccagcacagcagaatggctacaagtgcgtggcctgctgccgcatgtaccccaccctggac ttcctcaagagccacatcaagaggggcttcagggagggcttcagctgcaaggtgtactac cgcaagctcaaagccctctggagcaaggagcagaaggcccggctgggagacaggctctcc tccggcagctgccaggccttcaatagtcctgctgaacaccttaggcaaattggcggtgaa gcctacttatgtctctag >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_3|279_aa MAKFDHGMFENLNTALTPKLQASRSFPHLSKPVAPGSAPLGSGEPGGPGLWVGSSQHLKN LGKAMGAKVNDFLRRKEPSSLGSVGVTEINKTAGAQLASGTDAAPEAWLEDERSVLQETF PRLDPPPPITRKRTPRALKTTQDMLISSQPVLSSLEYGTEPSPGQAQDSAPTAQPDVPAD ASQPEATMEREERGKVLPNGEVSLSVPDLIHKDSQDESKLKMTECRRASSPSLIERNGFK LSLSPISLAESWEDGSPPPQARTSSLDNEGPHPDLLSFE >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_3|840_bp atggccaagtttgaccacggcatgtttgagaatttgaacacagccctcactccaaagctc caggccagccgctccttcccccacttgtccaagcccgtggcccccggctctgcccctctg ggctctggtgagcctggggggccaggactctgggtgggcagcagccagcacctcaagaac ctgggcaaagccatgggggccaaagtgaatgacttcctgaggagaaaggagccctccagc ctgggcagtgtgggtgtgacagagatcaacaagactgcaggagcacagctggccagtggg actgacgcggctccagaggcttggctagaggatgaaaggtcagtcctgcaagaaacattt cctcggctggatcctccacctcccataaccagaaagcgaacccctcgggccctgaagacc acccaggacatgctgatttcatcacagcctgtcctcagcagtctggagtatgggacagag ccatcacctgggcaggcccaggactccgctcccactgcccagcctgacgtcccagcagac gcttcacagccagaggccaccatggaaagagaagagagaggcaaagttctgcccaatgga gaggtttccctgtcagtacctgacctaatccacaaggatagccaggacgaatccaagcta aagatgactgagtgcagaagggcctcctcccccagccttatcgagaggaatggcttcaaa ctcagcttgagccccatcagcctggctgagtcctgggaggatggcagcccccctcctcag gcacggacctccagcctcgacaatgagggccctcacccagacctgctgtcctttgaatag >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_4|218_aa MDLPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREKH GYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMCIFIELLQYARHCAGNRGVRDEDGMVP APELRDLGERSMRMKRLSMTHGEYRDGGTHRMLRERREGEDKGLANSQLCGSSGSLDLTL AQGQQRGKGTPGQLFSEPPSLRRLTKENANILQPTSQP >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_4|657_bp atggatctgccttactaccatggacgtctgaccaagcaagactgtgagaccttgctgctc aaggaaggggtggatggcaactttcttttaagagacagcgagtcgataccaggagtcctg tgcctctgtgtctcgtttaaaaatattgtctacacataccgaatcttcagagagaaacac gggtattacaggatacagactgcagaaggttctccaaaacaggtctttccaagcctaaag gaactgatctccaaatttgaaaaaccaaatcaggggatgtgcatctttatcgagctcctg caatatgccagacactgtgctgggaaccgaggagtcagggatgaagatggcatggtccct gcccctgagctcagagacctgggggagagaagcatgaggatgaagcgactttctatgaca catggagagtaccgtgatggaggaacacataggatgctacgggaacgcagagaaggggag gataaaggcctggccaattctcagctgtgtgggtcctctggatccttggatctgacactg gcccaggggcagcaaaggggcaaagggactccaggccagctcttctctgagccaccttct ctccgcaggcttacaaaggaaaatgccaacatacttcagccaacaagtcagccttga >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_5|274_aa MPEPSPASVGSCAAPASPTSAAPCSKAPSPTDHPRADECERMARDWQAAPPAAPVQDPLD EASWAPESGERIQFITWLCNGTSFAFLEPYEGKSPKIYVTHPKWQKRLSFTQSYSPQLSN LEMENIGFYSAQIATETSAKLSSYTLRIFSSKAAEGTYCPVKWIFLGNRLLLLVFLGVLR TWHIQAQECSSSPAMEQSWMENDFDELREEGFRRSNYFELKEEVQTHGKEVKNLEKRLDE LLTRITNAEKSLKDLKKLKTTARELRDKCTSFSS >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_5|825_bp atgcctgagccttcccccgcctccgtgggctcctgtgcagccccagcctccccgacgagc gccgccccctgctccaaggcgcccagtcccaccgaccacccaagggctgatgagtgcgag cgcatggcgcgggactggcaggcagctccacctgcagccccagtgcaggatccactggat gaagccagctgggctcctgagtctggagagaggatccagttcatcacttggctttgcaat ggaacatcttttgccttcctagaaccctatgaaggcaaaagtccaaaaatctacgtgact catccgaaatggcaaaagcgactgagcttcacccagtcctactccccgcagctcagcaac ctggagatggaaaacataggcttttacagtgcccagatagccacagagacctctgcaaag ctgtccagttacactctgaggatattcagttccaaggcagctgaaggcacctattgccca gtgaaatggattttcctggggaacaggcttcttctccttgtgttccttggtgtcctacga acttggcatattcaggcacaggaatgcagctcctcaccagcaatggaacaaagctggatg gagaatgactttgatgagttgagagaagaaggcttcagacgatcaaactactttgagcta aaggaggaagttcaaacccatggcaaagaagttaaaaaccttgaaaaaagattagatgaa ttgctaactagaataacaaatgcagagaagtccttaaaagacctgaagaagctgaaaacc acagcacgagaactacgtgacaaatgcacaagcttcagtagctga >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_6|481_aa MGGRAEALLTSQTGLPGRGAPHFPDRAAGQRCSSLPRRGGQAEALLTSQTMGGWAIMGDF NTPLSTLDRSRRQKVNKDIRELNSALHQADLIDIYRTLHPFSIHTECISTEYTFFSAPHS TYSKIDHIVGSKALLSKCKRTEITTNCLLDHSAIKLGLRIKKLTQNRSTTWKLNNLLLND YWVHNEMKAEIKMFFETKENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTS QLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRNWFFEKINRIDRLLAR LIKKKREKNQIDAIENDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLP RLNQEEVESLNTPITGSEIEAIINSLPNKKCPGPEAFTAEFYQRYKKELVPFLLKVLQSI EKEGILPISFYEASIILIPKSGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKAYP P >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_6|1446_bp atgggcggccgggcagaggcgctcctcacctcccagacggggctgccgggcagaggcgct cctcacttcccagacagggcagccgggcagaggtgctcctcacttcccagacggggcggc caggcagaggcgctcctcacatcccagacgatgggtggctgggcaataatgggagacttt aacaccccactgtcaacattagacagatcaaggagacagaaagttaacaaggatatccgg gaattgaactcagctctgcaccaagcagacctaatagacatctacagaactctccaccca ttctccatccatacagaatgtatatcaacagaatatacattcttctcagcaccacacagc acttattccaaaattgaccacatagttggtagtaaagcactcctcagcaaatgtaaaaga acagaaattacaacaaactgtctcttagaccacagtgcaatcaagctaggactcaggatt aagaaactcactcaaaaccgctcaactacatggaaactgaacaaccttctcctgaatgac tactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaaagagaac aaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaattt atagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatca caattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaa ataactaagatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatc aatgaatccaggaactggtttttcgaaaagatcaacagaattgatagactgctagcaaga ctaataaagaagaaaagagagaagaatcaaatagatgcaatagaaaatgataaaggggat atcaccaccaatcccacagaaatacaaactaccatcagagaatactataaacacctctac gcaaataaacttgaaaatctagaagaaatggataaattcctcgacacatacaccctccca agactaaaccaggaagaagttgaatctctgaatacaccaataacaggctctgaaattgag gcaataattaatagcctaccaaacaaaaaatgtccaggaccagaagcattcacagccgaa ttctaccagaggtacaagaaggagctggtaccattccttctgaaagtattacaatcaata gaaaaagagggaatcctccctatctcattttatgaggccagcatcatcctgataccaaag tcaggcagagacacaacaaaaaaagagaattttagaccaatatccctgatgaacatcgat gcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaagcttatcca ccatga >gi568815597r:162274051_162476790|GENSCAN_predicted_peptide_7|215_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQPFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVEMAILPKVIY RFNAIPIKIPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKPTV TKTAWYWYQNRDTDQWNRTESSEINNATYLQLSDL >gi568815597r:162274051_162476790|GENSCAN_predicted_CDS_7|648_bp atgattgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaaccattcttatac accaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaacgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatcgtggaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagataccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaagcctacagta accaaaacagcatggtactggtaccaaaacagagatacagaccaatggaacagaacagag tcctcagaaataaataatgccacatatctacaactatctgatctttga