GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:21:10 Sequence gi568815596r:171224262_171492185 : 267924 bp : 39.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6604 6700 97 0 1 65 47 92 0.233 3.32 1.02 Intr + 9353 9505 153 0 0 76 68 99 0.039 5.82 1.03 Intr + 14467 14530 64 2 1 43 83 74 0.001 -0.74 1.04 Intr + 30883 31091 209 1 2 42 14 168 0.032 2.60 1.05 Term + 53247 53548 302 1 2 15 42 303 0.612 12.80 1.06 PlyA + 53681 53686 6 1.05 2.04 PlyA - 56859 56854 6 1.05 2.03 Term - 61783 61761 23 2 2 109 55 19 0.033 -1.80 2.02 Intr - 76781 76751 31 2 1 97 100 20 0.672 0.89 2.01 Init - 77411 77262 150 2 0 62 55 268 0.648 20.45 2.00 Prom - 87324 87285 40 -5.75 3.00 Prom + 87754 87793 40 -10.05 3.01 Sngl + 88234 88758 525 0 0 104 38 256 0.379 18.00 3.02 PlyA + 91416 91421 6 1.05 4.07 PlyA - 91785 91780 6 1.05 4.06 Term - 100101 99911 191 1 2 113 43 163 0.974 10.93 4.05 Intr - 101887 101781 107 0 2 52 52 63 0.794 -1.96 4.04 Intr - 106437 106298 140 0 2 52 99 83 0.706 4.14 4.03 Intr - 107606 107543 64 1 1 96 91 47 0.453 3.60 4.02 Intr - 115293 114894 400 1 1 84 46 272 0.424 15.24 4.01 Init - 125569 125518 52 1 1 60 91 54 0.671 4.27 4.00 Prom - 126186 126147 40 -3.65 5.02 PlyA - 126949 126944 6 1.05 5.01 Sngl - 128936 128127 810 2 0 47 41 350 0.490 21.73 5.00 Prom - 129409 129370 40 -6.15 6.02 PlyA - 129575 129570 6 1.05 6.01 Sngl - 130647 130084 564 0 0 41 40 383 0.491 24.69 6.00 Prom - 133755 133716 40 -3.75 7.03 PlyA - 134494 134489 6 1.05 7.02 Term - 136252 136075 178 0 1 64 38 169 0.401 5.68 7.01 Init - 151017 150818 200 0 2 75 34 269 0.010 18.62 7.00 Prom - 152342 152303 40 -3.35 8.00 Prom + 156901 156940 40 -5.15 8.01 Init + 157081 157278 198 0 0 71 63 121 0.830 6.85 8.02 Intr + 167332 167400 69 0 0 65 84 62 0.009 1.96 8.03 Intr + 177500 177556 57 1 0 54 97 61 0.009 1.76 8.04 Intr + 187969 188191 223 0 1 104 99 9 0.153 0.38 8.05 Intr + 191898 192047 150 1 0 61 92 108 0.609 7.71 8.06 Term + 192495 192604 110 1 2 33 49 94 0.547 -2.31 8.07 PlyA + 192839 192844 6 1.05 9.00 Prom + 192879 192918 40 -7.35 9.01 Init + 195996 196139 144 2 0 79 103 73 0.698 8.08 9.02 Intr + 199733 200207 475 1 1 42 64 303 0.127 15.01 9.03 Term + 207650 207810 161 0 2 61 52 127 0.630 3.52 9.04 PlyA + 208160 208165 6 1.05 10.00 Prom + 209549 209588 40 -10.94 10.01 Init + 210317 210442 126 1 0 82 60 160 0.771 10.82 10.02 Intr + 210822 210925 104 2 2 75 93 76 0.880 4.85 10.03 Intr + 219262 219352 91 1 1 95 83 -17 0.023 -2.42 10.04 Intr + 224420 224556 137 1 2 112 68 45 0.039 3.35 10.05 Intr + 225618 225696 79 0 1 80 74 43 0.049 0.73 10.06 Intr + 233710 233814 105 0 0 95 80 99 0.901 9.29 10.07 Intr + 234111 234216 106 2 1 87 80 32 0.859 1.17 10.08 Intr + 244627 244769 143 2 2 111 59 65 0.822 5.05 10.09 Intr + 249605 249714 110 1 2 83 89 43 0.814 2.06 10.10 Intr + 252599 252689 91 2 1 92 106 77 0.997 8.98 10.11 Intr + 255777 255932 156 2 0 68 89 122 0.998 9.59 10.12 Term + 267161 267577 417 1 0 92 38 366 0.999 26.29 10.13 PlyA + 267602 267607 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 13980 13921 60 0 0 75 94 56 0.894 4.26 S.002 Term - 223826 223665 162 2 0 110 38 96 0.869 3.75 S.003 Term + 256713 256853 141 2 0 78 42 132 0.885 4.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_1|274_aa MEQRTDFSFHYQVYPYIWSQKQLRDIKGHFTEGRSVFIDPGLEILAESQGCVEAATQHLS HRADMLVWSSPELLVTPESCYFGSGFPNLQATDWYQLGTGPHGRRKTSLVPPKVLLILPT SGTGQDCRSLGIAASVNWNVAFPPYQNGFSQRLLLVYKLWILVLVDLIDESQITFHGPVP VDVGGLWDQLVIYGIGTRSLVGSMVYFGQKMAKLSGDEVQQKDGSTSEVPSELSEPPKGF YVETIVTDKEDFVPITEKILNNWKSWTGGPGAEP >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_1|825_bp atggaacagagaactgacttcagtttccactatcaagtgtacccgtatatctggagccaa aagcagctgagagacattaaagggcattttactgaaggtagatctgtatttattgaccct ggacttgagattcttgcagagtcccagggctgtgtggaggctgccactcagcacctatct catagggccgacatgctggtttggtcttcccctgagctgttggtaactcctgaatcttgc tactttggctcagggttccccaacctgcaggccacagactggtaccagttaggaactggg ccacatggcaggaggaagacttcacttgtaccacccaaggtcctgctaatactcccaaca tcaggcactggacaggactgtaggagccttggcattgccgcctctgtaaactggaatgtc gcttttccaccttaccagaatggattcagccagcgcctgctacttgtatacaagctttgg attctggtactggtggatctgatagatgaatcccagatcacgtttcacggcccggtacca gtggatgttggtggtctatgggatcagttggtgatctatgggattggcacccggtctctg gtgggctcaatggtttactttggccagaaaatggcgaagttgtcaggtgatgaagtacag caaaaggatggctcaacaagtgaagtacccagtgaactatctgaacccccaaaagggttt tatgtggaaacaattgtcacagataaggaagattttgttccaattactgaaaagatcctc aacaactggaaatcatggactggtggccctggtgcagaaccatga >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_2|67_aa MLLSQCPELVLLPAAAEAPAAEHCWDRPSPPPWLLVRCSVDARRDSQCPERAQAEAIIIK GQTPELS >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_2|204_bp atgctcctgtcccagtgcccagagctggtgctgcttcctgctgctgctgaagccccagct gcagagcactgttgggatcggccatcaccaccaccttggttgctggtgcgatgttcggtt gatgcccgtagagactcacagtgccctgaaagagcccaggcagaagctattatcatcaaa ggccagacccctgaactttcctga >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_3|174_aa MASGGASPKPLQHPHGVKPAGAQKPRTEVWEPLPRFQRMYGHTWMSRQRCAAGAEPSWRT SARAVQKGNVEWEPPHRVPTGTLPSGAGRRGPPSSRPQNDRSTNSLHCAPGKATDTQHQP VKTTGREAVPCKATGAELLKTMGTYPFHQDDLNVRYGVKGDHFAALRFDCHIGF >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_3|525_bp atggcttcaggaggtgcaagccccaagcctttgcagcatccacatggtgttaagcctgca ggtgcacagaagccaagaactgaggtctgggaacctctgcctagatttcagaggatgtat ggacacacctggatgtccaggcagaggtgtgctgcaggggcagagccctcatggagaacc tctgctagggcagtgcagaagggaaatgtggagtgggagcccccacacagagtccctact gggacactgcctagtggagctgggagaagaggaccaccatcctccagaccccagaatgat agatccaccaacagcttgcactgtgcgcctggaaaagccacagacactcaacaccagccc gtgaaaacaactgggagagaggctgtaccctgcaaagccacaggggcagagctgctcaag accatgggaacctaccccttccatcaggatgacttgaatgtgagatatggagttaaagga gatcatttcgcagctttaagatttgactgccacattggattttag >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_4|317_aa MTGIDKALEKLEPLCIVVKYEREASKYWDTFYKIHKNKFFKDRNWLLREFPEILPVDQKP EEKARESSWDHVKTSATNRFSRMHCPTVPDEKNHYEKSSGSSEGQSKTESDFSNLDSEKH KKGPMETGLFPGSNATFRILEVLCSRDKNGRNSPESFLYCCDFASGAVELVKSHSSYRAT QCFAFVHDVCDDGLPYPFPDGILDVILLVFVLSSIHPDRMQGVVNRLSKLLKPGGMLLFR DYGRYDKTQLRFKKGEVHSMFCKASLDEKQNLVDRRLQVNRKKQVKMHRVWIQGKFQKPL HQTQNSSNMVSTLLSQD >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_4|954_bp atgacaggcattgacaaggctttggagaaattggagcccttgtgtattgttgttaagtat gagagagaagctagtaaatactgggacacattttacaagattcataagaataagtttttc aaggatcgtaattggctgttgagggaatttcctgaaattcttccagttgatcaaaaacct gaagagaaggcgagagaatcatcatgggatcatgtaaaaactagtgctacaaatcgtttc tcaagaatgcactgtcctactgtgcctgatgaaaaaaatcattatgagaaaagttctggt tcttcagaaggtcaaagcaaaacagaatctgatttttccaacctagactctgaaaaacac aaaaaaggacctatggagactggattgtttcctggtagcaatgccactttcaggatacta gaggtattgtgctcaagggacaaaaatgggaggaactctccggagtcctttctgtattgt tgtgattttgcttctggagctgtggagctcgtaaagtcacactcgtcctacagagcaacc cagtgttttgcctttgttcatgatgtatgtgatgatggcttaccttacccttttccagat gggatcctggatgtcattctccttgtctttgtgctctcttctattcatcctgacaggatg caaggtgttgtaaaccgactgtccaagttactgaaacctgggggaatgctgttatttcga gactatggaagatatgataagactcagcttcgttttaaaaagggggaagtccacagtatg ttctgcaaagccagtttagatgaaaagcaaaatctggttgatcgccgcttacaagttaat aggaaaaaacaagtgaaaatgcaccgagtgtggattcaaggcaaattccagaaaccattg caccagactcagaatagctccaatatggtatctacactcctttcacaagactga >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_5|269_aa MFFETNENKDTTYQNLWDTFKAVCRGKFIALKAHKRKQERSKIDTLTSQLKELEKQEQTH SKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIKKKREKNQTD AIKNDKGDITTDPTEIQTTIRVYCKHLYANKLENLEEMDKFLDTYTLSRLNQEEVESLNR PITGSEIEAIIKSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYE ASIILIPKPGRDTTKKENFRPISLIPDEH >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_5|810_bp atgttctttgaaaccaatgagaacaaagatacaacataccagaatctctgggacacattt aaagcagtgtgtagagggaaatttatagcactaaaggcccacaagagaaagcaggaaaga tctaaaattgacaccctaacatcacaattaaaagaactagagaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataactaagatcagagcagaactgaaggagatagag acacaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaac aaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaatcaaacagat gcaataaaaaatgataaaggggatatcaccactgatcccacagaaatacaaactaccatc agagtatactgtaaacacctctatgcaaataaactagaaaatctagaagaaatggataaa ttcctggacacttataccctctcaagactaaaccaggaagaagttgaatccctgaataga ccaataacaggttctgaaattgaggcaataattaagagcctaccaaccaaaaaaagtcca ggaccagatggattcacagccgaattctaccagaggtacaaggaggagctggtaccattc cttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaattttaga ccaatatccctgatccctgatgaacattga >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_6|187_aa MFEPIAKKLKTLKKKLDKWLTRINSVEKSLKDLMELKTMARELCDACTSFSSRFDQVEER VSVIEDQMNEMKRKEKFREKRVKRNEQSLQVIWDYVKRPNLRLIGVPESDRENGTKLENT LQDIIQENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREK GQVTYKG >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_6|564_bp atgttcgagcccattgcaaagaaactaaaaaccttgaaaaaaaaattagacaaatggcta actagaataaacagcgtagagaagtccttaaaagacctgatggagctgaaaaccatggca cgagaactatgtgacgcatgtacaagcttcagtagccgatttgatcaagtggaagaaagg gtatcagtgattgaagatcaaatgaatgaaatgaagcgaaaagagaagtttagagaaaaa agagtaaaaagaaacgaacaaagcctccaagtaatatgggactatgtgaaaagaccaaat ctacgtctgattggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacact cttcaggatattatccaggagaacttccccaacctagcaaggcaggccaacattcaaatt caggaaatacagagaacgccacaaagatactcctcgagaagagcaactccaagacacata attgtcagattcaccaaggttgaaatgaaagaaaaaatgttaagggcagccagagagaaa ggtcaggttacctacaaagggtaa >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_7|125_aa MTKKRRNNGHAKKGRGHVHPIRCTNCTRCVPKDKAIKKFVIRNVVEAAAVRDISEVSVFN AYVLPKLDHMQWSKEEEAAARKKVKENSAVRVLLEEQGSQLRAVAPRARALVTFPPPRVI ASESS >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_7|378_bp atgacaaagaaaagaaggaacaacggtcatgccaaaaagggccgcggccacgtgcatcct attcgctgcactaactgtacccgatgcgtgcccaaggacaaggccattaagaaattcgtc attcgaaacgtagtggaggccgcagcagtcagggacatttctgaagtgagcgtcttcaat gcctatgtgcttcccaagctggatcacatgcagtggtctaaggaagaagaagcagcagcc agaaaaaaagtaaaagaaaactcagctgtgcgagtccttctggaagagcaaggtagtcaa ctgcgtgctgtagctcctagagccagagctttagtgacttttcctcctcctcgtgtcatt gcttctgagagttcttag >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_8|268_aa MGKDFTTKTPKATATKAKVDKWDLIKLKGFCTAKEISIKVNRQPTEWEKIFAIYPSDKGL ISRIYKGGFPESIDENHTNMISLVLSKTQTSMYGRRKKAWILDGIIEQHHLHPKAKSLCQ LQNGCSSPKHHTDMATSSKRKGPSLSVSLFREEETYPQLPSQTCSQISLARIWSQTIPKS ISVLVNHQTENTDGTSQVCPALLLKLESAPRKARAPSAGHLYTRTRGSVSKGDAGAEARA LGHIATRAHSPSRLLTPDSMFYLVRYSI >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_8|807_bp atgggcaaagacttcacgacaaaaacaccaaaagcaactgcaacaaaagccaaagttgac aaatgggatctaattaaactaaagggcttctgcacagccaaagaaattagcatcaaagtg aacaggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagaatctacaagggtggattccctgaaagcatagacgagaatcataccaatatg atctccctggtgctcagtaagacacagacttctatgtacggaagaaggaaaaaagcctgg attcttgatggcatcattgaacagcatcaccttcatcctaaggctaagtccctctgtcaa ttacaaaatggctgcagcagtcccaagcatcacacagacatggcaacatccagcaaaaga aaaggaccatctctttctgtgtctctttttagggaggaggaaacttatccacaacttccc agccagacttgctctcagatttcactggccagaatttggtcacaaactattcccaaatca atctctgtacttgttaatcatcagacagaaaacacagatggtaccagtcaggtctgccca gccttgctgctgaagctggagtcagctcctaggaaagctcgtgcaccctctgctggtcat ctctacaccaggacaagaggatctgtgtccaaaggagatgcaggagctgaggctcgagca cttggtcacatagccacaagagcccactctccaagcaggctcctgactccagattctatg ttctacttagtacgctattccatctga >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_9|259_aa MVLALELVAFDSWYDALPARLFFCLFSREWHGYAERQEEEKEDEADMVPQDLVPCTPAAP AVAKRGQGTAQAMASEGTSSKPWQLPRVIGPAGAQKSRTEFEEPLPRFQKMYRNAWMSKQ KFAAWVEPSWRTSAMAVRKGNVGLEPPHRVPTGTLPSGDVRRGPPSSRLQNCRSTDSLHH VPGKASDTLHQPMKAARTGAVPCKATVWTPEPTVGVTCDMPGPAASGARSQLLCRHLEQP AGYHTTLHSPAQTPSPARG >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_9|780_bp atggtgctggcactagaacttgtagcttttgactcctggtatgatgctcttcccgctaga ctattcttttgtcttttcagtagagagtggcatggttatgcagagagacaagaagaagaa aaagaagatgaggcagacatggtgcctcaggacttggtgccctgcaccccagctgctcca gctgtggctaaaaggggccaaggtacagcacaggccatggcttcagaggggacaagctcc aagccttggcagcttccacgtgttattgggcctgcgggtgcacagaagtcaagaactgag tttgaggaacctctgcctagatttcagaagatgtatagaaatgcctggatgtccaagcag aagtttgctgcatgggtggagccctcatggagaacctctgctatggcagtgcggaaggga aatgtgggattggagcccccacacagagtccctactgggacactgcctagtggagatgtg agaagagggcccccatcctccagactccagaactgtagatccactgacagcttgcaccat gtgcctggaaaagcctcagacactctacaccagcccatgaaagcagccaggacaggggct gtaccctgcaaagccacagtctggactccagagcccactgtgggagtcacttgtgacatg cctggtccagccgcaagcggtgcacggagccagctcctgtgccggcacttggagcaacca gctggataccacacaactctgcactcaccagctcagacaccctctcctgccaggggctga >gi568815596r:171224262_171492185|GENSCAN_predicted_peptide_10|554_aa MGPTRKPNVCSRLSRRALGCFSRDAGVVQRTNLGILRALVCQESTKFKNVWTTHSRSPIA YERGRIYFDNYRRCVSSVASEPRKLYEMPKCSKSEKIEDALLWECPVGDILPNSSDYKSS LIALTAHNWLLRISATTGKILEKIYLAPYCKFRYLSWDTPQEVIAVKSAQNRGSAVARQI FGNVTDATLSHGILIVMYSSGLVRLYSFQTIAEQFMQQKLDLGCACRWGGTTGTVGEAPF GIPCNIKITDMPPLLFEVSSLENAFQIGGHPWHYIVTPNKKKQKGVFHICALKDNSLAKN GIQEMDCCSLESDWIYFHPDASGRIIHVGPNQVKVLKLTEIENNSSQHQISEDFVILANR ENHKTFKIVDYEDELDLLSVVAVTQIDAEGKAHLDFHCNEYGTLLKSIPLVESWDVGITR VRNTTDAAGIVLKELKRQSSLGVFHLLVAVDGINALWGRTALKREDKCPIAPEELALVHN LRKMMKNDWHGGAIVLTLNQTGSLFKPRKAYLLQELLVKEGFDALDPFIPILVSNYNPKE FESRIQYCLENNWL >gi568815596r:171224262_171492185|GENSCAN_predicted_CDS_10|1665_bp atgggcccgacccggaagcccaacgtgtgcagccggctgagtcgccgggcgctgggctgc ttctcgcgcgacgcaggcgtggtgcagaggaccaacctgggcatcctgcgggcgctggtg tgccaggaaagtactaaatttaagaatgtctggacaactcattccaggtcacctatagcc tatgagagaggaagaatatattttgacaattatcggcgctgtgtcagcagtgttgcatct gagccaagaaaactttatgaaatgccaaaatgttccaaatcagaaaaaatagaggatgct ttattatgggaatgcccagtgggagatatacttcccaattcatcagattataagtcctca ctcatagcactgactgctcataattggctacttcgtatatcagcaactacgggaaaaatc cttgagaaaatatatcttgcaccttattgcaaattcagatacttgagctgggacactcct caagaagtcattgcagttaagtcagctcagaacagaggctcagcagtggcccggcagatt tttgggaacgttacagatgctaccttgtctcatggaatactgattgtgatgtacagctca ggactggtcagactctatagcttccaaaccatcgctgaacagttcatgcaacagaaactt gacttagggtgtgcatgcagatggggtgggactactggaactgtaggagaggctcctttt ggcattccttgtaatattaaaatcacagacatgccaccactgctctttgaggtgtcatcc ctggagaatgcttttcagattggaggccatccttggcactacatcgtcacacctaataag aagaaacagaaaggagttttccatatttgtgccctaaaagacaattccctggcaaaaaat gggatccaagaaatggattgttgttctctagaatctgactggatctatttccatcctgat gcttctggtagaataatacatgttggtccaaatcaagtcaaagttttgaagctaactgaa atagaaaataatagttctcagcatcagatctctgaagattttgtcattttggccaacagg gagaaccataaaactttcaaaattgtggactatgaagatgagttagatttgctttctgtg gtagctgttactcaaatagatgctgaaggaaaagctcacctggatttccactgtaatgaa tatggaactttacttaaaagcattccactagtggagtcatgggatgtgggcataacacgg gtgaggaacaccacagatgcagctggaattgtgctgaaagagctaaagaggcaaagttct ttgggtgtttttcacctgctggtggccgtggatggaatcaatgctctttggggaaggact gctctgaaaagagaagataaatgcccgattgccccagaggaattagcacttgttcacaac ctgaggaaaatgatgaaaaatgattggcatggaggtgccattgtgttgactttgaaccag actgggtctctctttaagccccggaaagcctatctgctccaggagttgctggtaaaggaa ggatttgatgccctggatccctttattcccatcctggtttccaactataacccaaaggaa tttgaaagtcgtattcagtattgtttggaaaacaattggctttaa