GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:28:43 Sequence gi568815590r:42653182_42868430 : 215249 bp : 44.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6582 6729 148 0 1 91 61 48 0.151 2.64 1.02 Intr + 25360 25425 66 0 0 105 66 44 0.040 3.00 1.03 Intr + 33627 33786 160 2 1 -4 86 161 0.061 6.26 1.04 Term + 38134 38432 299 2 2 94 49 164 0.864 8.53 1.05 PlyA + 39750 39755 6 1.05 2.00 Prom + 41628 41667 40 -4.56 2.01 Init + 44366 44417 52 1 1 71 115 45 0.476 6.47 2.02 Intr + 55536 55687 152 1 2 77 89 47 0.440 3.58 2.03 Intr + 77413 77522 110 0 2 43 106 56 0.182 1.98 2.04 Intr + 78486 79368 883 0 1 49 109 688 0.925 58.33 2.05 Term + 83303 83437 135 1 0 117 49 33 0.895 0.32 2.06 PlyA + 85349 85354 6 1.05 3.00 Prom + 87391 87430 40 -2.56 3.01 Init + 91167 91323 157 2 1 75 79 38 0.574 1.67 3.02 Intr + 92627 92794 168 0 0 50 68 100 0.379 4.12 3.03 Term + 100589 100743 155 0 2 87 34 55 0.124 -1.92 3.04 PlyA + 101002 101007 6 -0.45 4.07 PlyA - 101836 101831 6 1.05 4.06 Term - 102619 102566 54 1 0 77 54 50 0.421 -1.94 4.05 Intr - 103643 102665 979 1 1 72 84 472 0.530 36.14 4.04 Intr - 103856 103747 110 2 2 60 27 101 0.609 0.28 4.03 Intr - 108653 108375 279 2 0 2 53 236 0.599 9.17 4.02 Intr - 112023 111884 140 1 2 122 89 143 0.951 17.98 4.01 Init - 115249 115171 79 1 1 72 100 43 0.918 3.12 4.00 Prom - 117825 117786 40 -4.86 5.07 PlyA - 119648 119643 6 1.05 5.06 Term - 119678 119658 21 2 0 104 49 4 0.008 -3.59 5.05 Intr - 134670 134484 187 2 1 62 55 110 0.010 4.79 5.04 Intr - 161320 161240 81 0 0 100 89 40 0.417 4.05 5.03 Intr - 161557 161474 84 0 0 92 33 109 0.389 4.74 5.02 Intr - 163555 163488 68 1 2 45 97 51 0.365 -0.60 5.01 Init - 170131 170057 75 1 0 52 108 47 0.477 4.19 5.00 Prom - 176711 176672 40 -5.36 6.06 PlyA - 177634 177629 6 1.05 6.05 Term - 185089 184781 309 1 0 121 42 275 0.515 21.26 6.04 Intr - 186200 186005 196 1 1 69 85 8 0.775 -1.98 6.03 Intr - 189939 189843 97 1 1 47 61 168 0.064 9.07 6.02 Intr - 190598 190574 25 2 1 77 95 36 0.053 0.80 6.01 Intr - 197225 197097 129 2 0 64 98 55 0.325 4.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 25360 25452 93 0 0 105 49 89 0.857 4.53 S.002 Sngl + 32193 32456 264 2 0 69 43 183 0.887 7.10 S.003 Term - 72629 72363 267 2 0 10 41 253 0.858 8.29 S.004 Init - 189913 189843 71 1 2 80 61 178 0.916 14.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:42653182_42868430|GENSCAN_predicted_peptide_1|224_aa XCLPDFSKATGPLLQDFSHGAAEEGDLWGDGTVCSGHQVASRFWGDSARRDQIQHAGDYS PIPRRYLTVVQKRIKYVGIQIKRDVKDLFKENYKPLLNEIKEDTNKHSMLMDRKNQYCEN GHTAQALGVHLLWTLGWLTGARSPIHPQPVWPIRTQSCWDKASSNPKPIVVEKQTLVPGQ SEQDTSWSSDQAAALRNGGGGEDKVWTLLSVRQDEGKPQKRNND >gi568815590r:42653182_42868430|GENSCAN_predicted_CDS_1|675_bp nnctgcttgcctgactttagcaaagccactggcccccttctccaggacttttctcatgga gctgctgaggaaggtgacctctggggtgatggtactgtgtgctcaggccatcaggtggct tctcgcttctggggtgacagtgctagaagggatcaaatccagcatgcaggtgactacagt cccatcccccggcgttatctcacagtggtacagaagagaataaaatacgtaggaatccaa attaaaagggatgtgaaggacctcttcaaggagaactacaaaccgctgctcaacgaaata aaggaggacacaaacaaacattccatgctcatggataggaagaatcaatattgtgaaaat ggccatactgcccaagctctgggcgtccatctgctgtggacactggggtggctgaccggg gccagaagcccaatacacccgcagcctgtttggcccatccgcacccagagttgctgggac aaagcttcttcaaacccaaagcctatcgtggtggaaaagcagactttagtgcctggccag tctgagcaggacacttcctggtcctccgaccaggctgcagccctgaggaacggtggcggc ggcgaggacaaagtgtggacgctcctgtcagtcagacaagatgaggggaagcctcaaaaa agaaacaacgactga >gi568815590r:42653182_42868430|GENSCAN_predicted_peptide_2|443_aa MLPDFMLVLIVLGIPSSATTGFNSIAENEDALLRHLFQGYQKWVRPVLHSNDTIKVYFGL KISQLVDVEWTDHKLRWNPDDYGGIHSIKVPSESLWLPDIVLFENADGRFEGSLMTKVIV KSNGTVVWTPPASYKSSCTMDVTFFPFDRQNCSMKFGSWTYDGTMVDLILINENVDRKDF FDNGEWEILNAKGMKGNRRDGVYSYPFITYSFVLRRLPLFYTLFLIIPCLGLSFLTVLVF YLPSDEGEKLSLSTSVLVSLTVFLLVIEEIIPSSSKVIPLIGEYLLFIMIFVTLSIIVTV FVINVHHRSSSTYHPMAPWVKRLFLQKLPKLLCMKDHVDRYSSPEKEESQPVVKGKVLEK KKQKQLSDGEKVLVAFLEKAADSIRYISRHVKKEHFISQVVQDWKFVAQVLDRIFLWLFL IVSVTGSVLIFTPALKMWLHSYH >gi568815590r:42653182_42868430|GENSCAN_predicted_CDS_2|1332_bp atgctcccagattttatgctggttctcatcgtccttggcatcccttcctcagccaccaca ggtttcaactcaatcgccgaaaatgaagatgccctcctcagacatttgttccaaggttat cagaaatgggtccgccctgtattacattctaatgacaccataaaagtatattttggattg aaaatatcccagcttgtagatgtggaatggacagaccacaagttacgctggaatcctgat gattatggtgggatccattccattaaagttccatcagaatctctgtggcttcctgacata gttctctttgaaaatgctgacggccgcttcgaaggctccctgatgaccaaggtcatcgtg aaatcaaacggaactgttgtctggacccctcccgccagctacaaaagctcctgcaccatg gacgtcacgtttttcccgttcgaccgacagaactgctccatgaagtttggatcctggact tatgatggcaccatggttgacctcattttgatcaatgaaaatgtcgacagaaaagacttc ttcgataacggagaatgggaaatactgaacgcaaaggggatgaaggggaacagaagggac ggcgtgtactcctatccctttatcacgtattccttcgtcctgagacgcctgcctttattc tataccctctttctcatcatcccctgcctggggctgtctttcctaacagttcttgtgttc tatttaccttcggatgaaggagaaaaactttcattatccacatcggtcttggtttctctg acagttttccttttagtgattgaagaaatcatcccatcgtcttccaaagtcattcctctc attggagagtacctgctgttcatcatgatttttgtgaccctgtccatcattgttaccgtg tttgtcattaacgttcaccacagatcttcttccacgtaccaccccatggccccctgggtt aagaggctctttctgcagaaacttccaaaattactttgcatgaaagatcatgtggatcgc tactcatccccagagaaagaggagagtcaaccagtagtgaaaggcaaagtcctcgaaaaa aagaaacagaaacagcttagtgatggagaaaaagttctagttgcttttttggaaaaagct gctgattccattagatacatttcgagacatgtgaagaaagaacattttatcagccaggta gtacaagactggaaatttgtagctcaagttcttgaccgaatcttcctgtggctctttctg atagtgtcagtaacaggctcggttctgatttttacccctgctttgaagatgtggctacat agttaccattag >gi568815590r:42653182_42868430|GENSCAN_predicted_peptide_3|159_aa MASPVENNWYLNVFRMLYKNTIDWVASKLQKFIADSSGDWEAQGKVLAGLVCGAVFFPDP PSSGLINLDDQDTPQSFILVIILSLKNQMIKHLRPHPAHGFEGSRMELEGRCTASVSASD SRVCGKALTPWVTRDFPNCKKRSREATAQRARLILCNGG >gi568815590r:42653182_42868430|GENSCAN_predicted_CDS_3|480_bp atggcatctccagttgagaataactggtatctaaatgtattcaggatgctatataaaaat accatagattgggtggcttctaaactgcagaaatttattgctgacagttctggagactgg gaagcccaaggaaaggtgctggcaggtctggtgtgtggtgcggtgtttttcccagacccg ccctcaagtggcttaataaacctcgatgatcaagacacgcctcagtccttcattttggtc atcattctttctctgaaaaatcagatgatcaagcacctgaggccacatccagcacatggc tttgagggctcccgaatggagctggagggcagatgcacagcctctgtgagtgcctcagac agcagagtctgcggcaaagccctgactccgtgggtcaccagggactttcccaactgcaag aaacgctccagagaagcaactgctcagagagctagactcattctttgcaacggtgggtag >gi568815590r:42653182_42868430|GENSCAN_predicted_peptide_4|546_aa MLTSKGQGFLHGGLCLWLCVFTPFFKGCVGCATEERLFHKLFSHYNQFIRPVENVSDPVT VHFEVAITQLANVPLMQLLCQLIDFSKKEVQGDGTRNGVDAVGLWTPPAEMSDSSAMPIG GIWNWMEREYDLPDRSCSIGSADISEGVHIPMGHVHTTWSFSEPAEIWNDYKLRWDPMEY DGIETLRVPADKIWKPDIVLYNNAVGDFQVEGKTKALLKYNGMITWTPPAIFKSSCPMDI TFFPFDHQNCSLKFGSWTYDKAEIDLLIIGSKVDMNDFWENSEWEIIDASGYKHDIKYNC CEEIYTDITYSFYIRRLPMFYTINLIIPCLFISFLTVLVFYLPSDCGEKVTLCISVLLSL TVFLLVITETIPSTSLVVPLVGEYLLFTMIFVTLSIVVTVFVLNIHYRTPTTHTMPRWVK TVFLKLLPQVLLMRWPLDKTRGTGSDAVPRGLARRPAKGKLASHGEPRHLKECFHCHKSN ELATSKRRLSHQPLQWVVENSEHSPEVEDVINSVQFIAENMKSHNETKEPMGTLDAQSDL QCDFCS >gi568815590r:42653182_42868430|GENSCAN_predicted_CDS_4|1641_bp atgctgaccagcaaggggcagggattccttcatgggggcttgtgtctctggctgtgtgtg ttcacacctttctttaaaggctgtgtgggctgtgcaactgaggagaggctcttccacaaa ctgttttctcattacaaccagttcatcaggcctgtggaaaacgtttccgaccctgtcacg gtacactttgaagtggccatcacccagctggccaacgtgcctttgatgcagctgctgtgt caattaattgatttttccaagaaggaggtacagggtgatggcaccaggaatggagttgat gctgtgggtctttggactcctccggcagagatgtctgatagcagtgccatgcccattggt gggatctggaattggatggagagagagtatgacttgccagacaggtcctgctccataggg tcagctgacatctctgaaggtgtccacatccccatgggccatgtccataccacctggagc ttctccgagcctgcggaaatctggaatgattataaattgcgctgggatccaatggaatat gatggcattgagactcttcgcgttcctgcagataagatttggaagcccgacattgttctc tataacaatgctgttggtgacttccaagtagaaggcaaaacaaaagctcttcttaaatac aatggcatgataacctggactccaccagctatttttaagagttcctgccctatggatatc acctttttcccttttgatcatcaaaactgttccctaaaatttggttcctggacgtatgac aaagctgaaattgatcttctaatcattggatcaaaagtggatatgaatgatttttgggaa aacagtgaatgggaaatcattgatgcctctggctacaaacatgacatcaaatacaactgt tgtgaagagatatacacagatataacctattctttctacattagaagattgccgatgttt tacacgattaatctgatcatcccttgtctctttatttcatttctaaccgtgttggtcttt taccttccttcggactgtggtgaaaaagtgacgctttgtatttcagtcctgctttctctg actgtgtttttgctggtcatcacagaaaccatcccatccacatctctggtggtcccactg gtgggtgagtacctgctgttcaccatgatctttgtcacactgtccatcgtggtgactgtg tttgtgttgaacatacactaccgcaccccaaccacgcacacaatgcccaggtgggtgaag acagttttcctgaagctgctgccccaggtcctgctgatgaggtggcctctggacaagaca aggggcacaggctctgatgcagtgcccagaggccttgccaggaggcctgccaaaggcaag cttgcaagccatggggaacccagacatcttaaagaatgcttccattgtcacaaatcaaat gagcttgccacaagcaagagaagattaagtcatcagccattacagtgggtggtggaaaat tcggagcactcgcctgaagttgaagatgtgattaacagtgttcagttcatagcagaaaac atgaagagccacaatgaaaccaaggagcctatgggcactctggatgcacagtcagacctt cagtgtgacttttgttcctga >gi568815590r:42653182_42868430|GENSCAN_predicted_peptide_5|171_aa MHRRTSLSNGSSINMIGPKHVLKAQRDLDKCFRLEEAHDTSPWVKICKAVCSGGITPSQR ECELKCTGSLDRAGAELFALQVWRLPASDSYFGTGAHGLTSTCGVRGLAGSGVKLLTLTV SVTALKAGRLELFILPGGFVVSLASGVKLQTFVVSVTAHKGSVDPKTSKLK >gi568815590r:42653182_42868430|GENSCAN_predicted_CDS_5|516_bp atgcatagaagaacttcattatcaaatggaagtagtataaacatgattgggcccaagcac gtcctaaaggcacaaagagatctagacaaatgcttccgattagaagaggctcatgatact tctccgtgggtcaagatttgtaaagcggtgtgctcgggcggcatcacacccagccagcgc gagtgtgaactcaagtgcacgggcagccttgaccgggctggagctgagctgtttgcactg caggtttggcgcctgcctgcatctgattcctactttgggacaggagcccatgggctgacg agcacgtgtggggttcgtggtctcgctggctcaggagtgaagctgctgaccctcactgtg agtgttacagctcttaaggcggggcgtctggagttgttcattcttcctggtgggttcgtg gtctcactggcttcaggagtgaagctacagaccttcgtggtgagtgttacagctcataaa ggcagtgtggacccaaaaacctccaaattgaaatga >gi568815590r:42653182_42868430|GENSCAN_predicted_peptide_6|251_aa DHSDLPGSLPGTKCLAPYMQRSPMIISSVTLLESLLSLSHRVKKLVIPEDTEVKCLRTGR MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP DCFKRECNNKLLKENAVPTIFLCTEPHDKVDAAIGLLMPPLQTPVNLSVFCDHNYTVEDT MHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSERGYVILP NDYFEIVEVPA >gi568815590r:42653182_42868430|GENSCAN_predicted_CDS_6|756_bp gatcacagcgaccttccaggcagtctcccaggcaccaagtgcctggctccctacatgcaa cgcagcccgatgatcatctcaagtgtgactctgctggaaagcctgctctccctttcccac agagtaaagaaacttgtgatacctgaagacacagaagtgaagtgcctgaggaccggaagg atggtgcagtcctgctccgcctacggctgcaagaaccgctacgacaaggacaagcccgtt tctttccacaagtttcctcttactcgacccagtctttgtaaagaatgggaggcagctgtc agaagaaaaaactttaaacccaccaagtatagcagtatttgttcagagcactttactcca gactgctttaagagagagtgcaacaacaagttactgaaagagaatgctgtgcccacaata tttctttgtactgagccacatgacaaggttgatgctgctattggattactaatgccgcct cttcagacccctgttaatctctcagttttctgtgaccacaactatactgtggaggataca atgcaccagcggaaaaggattcatcagctagaacagcaagttgaaaaactcagaaagaag ctcaagaccgcacagcagcgatgcagaaggcaagaacggcagcttgaaaaattaaaggag gttgttcacttccagaaagagaaagacgacgtatcagaaagaggttatgtgattctacca aatgactactttgaaatagttgaagtaccagcataa