GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:03:07 Sequence gi568815592r:89229682_89443757 : 214076 bp : 43.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 289 284 6 1.05 1.13 Term - 4949 4689 261 2 0 78 36 194 0.687 8.73 1.12 Intr - 20617 20479 139 0 1 107 50 39 0.106 2.47 1.11 Intr - 28300 28050 251 1 2 76 -17 162 0.141 0.74 1.10 Intr - 34927 34731 197 2 2 24 100 531 0.948 47.03 1.09 Intr - 35479 35405 75 2 0 55 100 34 0.631 0.79 1.08 Intr - 36084 35932 153 1 0 87 33 117 0.983 6.14 1.07 Intr - 38138 37998 141 0 0 83 30 153 0.997 9.32 1.06 Intr - 38415 38333 83 2 2 31 87 80 0.853 1.48 1.05 Intr - 39553 39330 224 1 2 77 55 348 0.858 27.33 1.04 Intr - 42041 41974 68 0 2 110 68 148 0.956 13.62 1.03 Intr - 49417 49246 172 1 1 62 77 63 0.773 2.12 1.02 Intr - 51582 51463 120 0 0 63 80 85 0.886 5.89 1.01 Init - 52365 52312 54 0 0 88 71 14 0.672 0.98 1.00 Prom - 52931 52892 40 -7.96 2.00 Prom + 53868 53907 40 -3.66 2.01 Init + 58692 58835 144 2 0 67 44 107 0.544 4.33 2.02 Intr + 60983 61192 210 1 0 22 93 85 0.619 1.51 2.03 Intr + 72147 72274 128 2 2 -15 44 264 0.951 11.18 2.04 Intr + 72357 72485 129 0 0 68 6 164 0.721 6.01 2.05 Intr + 72528 72618 91 0 1 -37 33 174 0.662 -0.60 2.06 Intr + 72959 73270 312 1 0 53 48 553 0.825 44.18 2.07 Term + 73284 73433 150 2 0 -28 48 358 0.968 18.11 2.08 PlyA + 73714 73719 6 1.05 3.08 PlyA - 77830 77825 6 1.05 3.07 Term - 79368 79337 32 1 2 139 41 8 0.262 -0.78 3.06 Intr - 100221 100102 120 1 0 38 64 80 0.087 1.07 3.05 Intr - 108862 108778 85 1 1 87 86 58 0.591 4.89 3.04 Intr - 112774 112643 132 1 0 70 107 36 0.897 4.54 3.03 Intr - 114075 114002 74 1 2 54 88 41 0.704 -0.17 3.02 Intr - 122595 122548 48 1 0 95 60 38 0.313 0.35 3.01 Init - 122888 122858 31 2 1 113 88 65 0.749 8.90 3.00 Prom - 135742 135703 40 -4.36 4.08 PlyA - 138215 138210 6 1.05 4.07 Term - 138526 138375 152 2 2 119 43 104 0.977 7.07 4.06 Intr - 142904 142756 149 1 2 61 88 83 0.998 5.48 4.05 Intr - 148132 147990 143 1 2 75 106 62 0.998 5.85 4.04 Intr - 149657 149543 115 1 1 88 89 50 0.984 5.55 4.03 Intr - 150686 150487 200 2 2 68 94 151 0.987 11.85 4.02 Intr - 157909 157614 296 2 2 73 103 275 0.989 24.23 4.01 Init - 182312 182165 148 2 1 93 83 348 0.999 35.05 4.00 Prom - 207248 207209 40 -2.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 159388 159379 10 1 1 50 105 5 0.854 -0.80 S.002 Term + 176703 176830 128 1 2 84 42 168 0.959 10.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:89229682_89443757|GENSCAN_predicted_peptide_1|645_aa MEGGGWTDELKLSDCPHQSQLLLSTDFKLLEEGTAAYASPYCELQSAQGLVWWTQSGQPM HRAWRFLGLIIHFLVLVILGTGPYCEPQQTEAPQPLESWLGKPGVGNSGARFLSPGPAIP VGVDVQVESLDSISEVDMDFTMTLYLRHYWKDERLAFSSASNKSMTFDGRLVKKIWVPDV FFVHSKRSFTHDTTTDNIMLRVFPDGHVLYSMRITVTAMCNMDFSHFPLDSQTCSLELES YAYTDEDLMLYWKNGDESLKTDEKISLSQFLIQKFHTTSRLAFYSSTGWYNRLYINFTLR RHIFFFLLQTYFPATLMVMLSWVSFWIDRRAVPARVSLDPEFQSESLQHVIVNLDNIRLG FEKGITTVLTMTTIITGVNASMPRVSYVKAVDIYLWVSFVFVFLSVLEYAAVNYLTTVQE RKERKLREKFPCMCGMLHSKTMMLDGSYSESEANSLAGYPRSHILTEEERQDKIVVHLGL SGEANAARKKGLLKGQTGFRIFQNTHAIDKYSRKHRCSTSARKLAAAASTSPTLSWCLLY HGIGEAVEWHLATDLSHPPGRAWDLQPAMPEPLQPSVGSCVARASPTSTAPCSTAPSPIN HPRAEEYGHTVQDWQAAPPAALVQDPLGEASWAPESGGEVENLYV >gi568815592r:89229682_89443757|GENSCAN_predicted_CDS_1|1938_bp atggagggaggtggctggacggatgagttgaagctgagtgactgcccccaccagagtcag ttgctcctctccacggattttaagctcctggaggaaggaactgcggcctatgcatctcca tattgcgaattgcagtctgctcagggcttggtctggtggacacagtcaggacagcccatg cacagggcatggcggtttctcgggcttatcatccacttcttggtattggtcattcttggt actggcccatactgtgagccacagcaaacagaggcccctcaaccactggagagctggctg gggaagccaggagtaggaaattcaggggcccgcttcctgagtccaggccctgccatcccg gtgggcgtggacgtacaggtggagagcctggacagcatctccgaggtggacatggacttc actatgaccctgtacctgcggcattactggaaggatgagaggctagctttctccagcgcc agcaacaagagcatgaccttcgatggccggctggtgaagaagatctgggtccctgatgtc ttctttgttcactccaaaagatcgttcactcatgacaccaccactgacaacatcatgctg agggtgttcccagatggacacgtgctgtacagcatgaggattacggtcactgccatgtgc aacatggacttcagccactttcccctggactcccagacctgttctttggagctggagagc tatgcctatacagatgaagatctaatgctgtactggaagaatggggatgaatccctaaaa acagatgagaagatctccttgtctcagtttctgattcagaaatttcacacaacttccagg ctggccttctacagcagcactggctggtacaaccgtctgtacattaacttcacgttgcgt cgccacatcttcttcttcttgctccaaacatatttccctgccactctgatggtcatgctg tcctgggtgtccttctggatcgaccgcagagctgtgcctgccagagtttcactggaccct gaattccagagtgaaagtttacaacatgtcattgtaaatttagacaacattaggttgggc tttgagaaaggtatcacgacggtgctgaccatgaccaccatcatcacgggcgtgaatgcc tccatgccgcgcgtctcctacgtcaaggccgtggacatctacctctgggtcagctttgtg ttcgtgttcctctcggtgctggagtatgcggctgtcaactacctgaccaccgtgcaggag cgcaaggaacggaagctgcgggagaagttcccgtgcatgtgtggaatgcttcattcaaaa accatgatgctggatggaagctacagtgagtctgaggccaacagcctggctgggtacccc agaagccatatcctgacagaagaagaaaggcaagacaaaatagtggtccacctgggcctg agtggtgaagccaacgctgccagaaagaaggggcttctgaagggccagacgggttttcgt atcttccagaatacccatgccattgacaaatactctagaaagcaccgctgttccacctcc gctaggaagctggcagctgctgcctccacatctccaactctttcctggtgtctgctctac catggcattggagaggctgtagagtggcacctagccacagacctttctcatcctcctggc agggcttgggacctgcagcccgccatgcctgagcctctccagccctccgtgggctcctgt gtggcccgagcctccccaacgagcaccgccccctgctccacggcgcccagtcccatcaac cacccaagggctgaggagtatgggcacacggtgcaggactggcaggcagctccacctgca gccctggtgcaggatccactgggtgaagccagctgggctcctgagtctggtggggaggtg gagaacctttatgtctag >gi568815592r:89229682_89443757|GENSCAN_predicted_peptide_2|387_aa MPGCLRSPSFGVFSPGTRHKHENTFECFHKHENTHQLPPSDYNCMRAPVLALHLLLGSAN ESASRYQKAGGIGREEEDEGNGFSCIALGPATGCILLLKATAPIGRLFPRATALFQAQIG AKFWEVISDEHGIDPSGNYVGNSDLELEQISIYYNEASSHKPDNLIFGQSGAGNNWARGH YTEGAELVDSLLDVRKKCENCDGLSGMGTLLISKMHEEYPNCIMNTFSVVPSPKMFDAKN MMAARDRHHGCYLAVATVFRGCLSMKEVDEQMLSIQSKNSSYFVEWIPNNMKVDVCDIPP PSLKMSSTLISNSTGIQELFKHLRAVHGHVPAQGLPTLGMDEMEITEAKSNMNDLVSEYQ QYQDSMAQEEGEMFADEEEELEAQGPK >gi568815592r:89229682_89443757|GENSCAN_predicted_CDS_2|1164_bp atgccaggatgcctgaggagccccagctttggggtcttcagcccaggcaccagacacaag catgaaaacaccttcgagtgttttcacaagcatgaaaacacccatcagctgccaccatct gactacaactgcatgagagccccagtccttgccctccacctcctgcttggttctgccaat gaaagtgccagtcggtatcagaaggcaggaggtatcggaagggaggaggaggatgagggg aatggcttctcctgcatcgctcttggcccagctacgggctgcatccttcttttgaaggcc acagctcccatcgggcggctttttccaagagctacagctcttttccaggctcagatcggg gccaagttctgggaagtcatcagtgatgagcatggcatagaccccagcggcaactacgtg gggaactcggacttggagctggagcagatcagcatctactacaacgaggcctcttctcat aagcctgacaatttaatctttggtcagagtggggccggcaataactgggccaggggtcac tacacggagggtgcggagctggtggattccctcctggatgtgcggaagaagtgtgagaac tgcgacggtctctcgggtatgggcacgctgctcatcagcaagatgcatgaggagtatccc aactgcatcatgaacaccttcagcgtagtgccctcgcccaagatgtttgatgccaagaac atgatggctgcccgcgaccggcaccacggctgctacctggcagtggccaccgtgttccgg ggctgcctgtccatgaaggaggtggacgagcagatgctgtccatccagagcaagaacagc agctacttcgtggagtggatccccaacaacatgaaggtggacgtgtgtgacatcccaccc cccagcctcaagatgtcttccaccctcatcagcaacagcacgggcatccaggagctgttc aagcatctcagagcagttcacggacatgttccagcacaaggccttcctacactgggcatg gacgagatggagatcaccgaggccaagagcaacatgaatgacctggtgtccgagtaccag cagtaccaggactccatggcccaggaggagggtgagatgttcgcagatgaggaggaggaa ttggaggcccagggccccaagtga >gi568815592r:89229682_89443757|GENSCAN_predicted_peptide_3|173_aa METRYNLKSPELGQKLIPENRCKGDVAVKRLMKEAAELKDPTDHYHAQPLEDNLFEWHFT VRGPPDSDFDGGVYHGRIVLPPEYPMKPPSIILLTANGRFEVGKKICLSISGHHPETWQP SWSTKNTSMSPRQRRAQQQSQRRLSTSPDVIQGHQPRDNHTDHGLRRITSHLI >gi568815592r:89229682_89443757|GENSCAN_predicted_CDS_3|522_bp atggagacccgctacaacctgaagagtccggagctgggtcagaaactcatccctgagaac aggtgtaaaggtgatgtcgctgttaaacgtttaatgaaagaagcggcagaattgaaagat ccaacagatcattaccatgcgcagcctttagaggataacctttttgaatggcacttcacg gttagagggcccccagactccgattttgatggaggagtttatcacgggcggatagtactg ccaccagagtatcccatgaaaccaccaagcattattctcctaacggctaatggtcgattt gaagtgggcaagaaaatctgtttgagcatctcaggccatcatcctgaaacttggcagcct tcgtggagtactaagaatacctccatgagccctcgacagcgccgggcccagcagcagagt cagagaaggttgtctacttcaccagatgtaatccagggccaccagccaagagacaaccac actgatcatggactcagaagaatcaccagtcacctcatctaa >gi568815592r:89229682_89443757|GENSCAN_predicted_peptide_4|400_aa MSQVLGKPQPQDEDDAEEEEEEDELVGLADYGDGPDSSDADPDSGTEEGVLDFSDPFSTE VKPRILLMGLRRSGKSSIQKVVFHKMSPNETLFLESTNKICREDVSNSSFVNFQIWDFPG QIDFFDPTFDYEMIFRGTGALIFVIDSQDDYMEALARLHLTVTRAYKVNTDINFEVFIHK VDGLSDDHKIETQRDIHQRANDDLADAGLEKIHLSFYLTSIYDHSIFEAFSKVVQKLIPQ LPTLENLLNIFISNSGIEKAFLFDVVSKIYIATDSTPVDMQTYELCCDMIDVVIDISCIY GLKEDGAGTPYDKESTAIIKLNNTTVLYLKEVTKFLALVCFVREESFERKGLIDYNFHCF RKAIHEVFEVRMKVVKSRKVQNRLQKKKRATPNGTPRVLL >gi568815592r:89229682_89443757|GENSCAN_predicted_CDS_4|1203_bp atgagccaggtgctggggaagccgcagccgcaggacgaggacgacgcggaggaggaggag gaggaggatgagctggtggggctagcggactacggagacgggcccgactcctccgacgcc gatccggacagcggcacagaggagggagttctggacttcagtgaccccttcagcactgaa gtgaagccgagaatcctgctcatgggcctgaggagaagcggcaagtcgtctattcagaaa gttgtctttcacaaaatgtctcccaacgaaactctgttcttggagagcactaataagata tgccgggaagatgtttccaacagctcctttgtcaattttcagatttgggacttcccagga cagattgacttttttgaccctacatttgactatgagatgatcttccggggaacaggagca ctgatatttgtcattgactcacaggatgattacatggaagccctggccaggctccacctc acggtgaccagggcctacaaagtgaatactgacatcaacttcgaggtgtttattcataaa gtggatggtctgtcagatgaccacaaaattgaaacccaaagagatattcaccagagggca aacgatgaccttgcagatgctggattagaaaaaattcacctcagcttttatctgacaagc atatatgatcattcaatatttgaagcttttagcaaagttgttcagaaactgattccacaa ctcccaactctggagaatttgctgaacatctttatctcaaattctggaattgaaaaggca tttctatttgatgtggtcagtaaaatttatattgcaactgatagtactccggtggatatg caaacctatgagctctgctgtgatatgatagatgtggttattgacatctcttgtatttat ggtctcaaagaagatggagcaggaaccccctatgacaaggaatccacagccatcataaag cttaataatacaaccgtgctttatttaaaagaggtgacaaagttcctggctctcgtttgc tttgtcagagaggaaagctttgaaagaaaagggctaattgactataattttcattgcttc cggaaggccattcatgaagtttttgaggtgagaatgaaagtagtaaaatctcgaaaggtt cagaatcggctgcagaagaaaaagagagccacccctaatgggacccctagagtgctgctg tag