GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:23:49 Sequence gi568815581r:46621585_46822167 : 200583 bp : 43.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2660 2745 86 1 2 76 81 39 0.255 0.62 1.02 Intr + 5029 5128 100 1 1 81 93 22 0.600 2.11 1.03 Intr + 8827 8866 40 0 1 124 100 -21 0.761 0.50 1.04 Intr + 15792 15958 167 1 2 99 68 239 0.860 22.68 1.05 Intr + 18470 18577 108 1 0 79 65 108 0.996 8.08 1.06 Intr + 19059 19134 76 2 1 84 91 43 0.999 3.29 1.07 Intr + 21520 21675 156 2 0 33 106 152 0.993 11.48 1.08 Intr + 52830 53029 200 1 2 126 57 188 0.495 18.57 1.09 Intr + 71319 71484 166 2 1 86 76 255 0.992 23.63 1.10 Intr + 72263 72337 75 0 0 63 82 41 0.583 0.49 1.11 Intr + 72891 73078 188 1 2 37 77 139 0.702 7.11 1.12 Intr + 83175 83270 96 2 0 80 81 101 0.921 8.81 1.13 Intr + 89379 89535 157 2 1 82 100 223 0.997 22.48 1.14 Intr + 92269 92402 134 2 2 96 64 142 0.852 12.96 1.15 Intr + 93695 93740 46 1 1 86 63 -16 0.685 -6.32 1.16 Term + 96189 96496 308 1 2 -6 45 372 0.580 18.78 1.17 PlyA + 98870 98875 6 1.05 2.02 PlyA - 99984 99979 6 1.05 2.01 Sngl - 100549 99998 552 1 0 68 41 521 0.997 39.52 2.00 Prom - 106277 106238 40 -6.26 3.00 Prom + 109206 109245 40 -6.06 3.01 Init + 109301 109377 77 1 2 70 99 55 0.292 5.36 3.02 Intr + 117073 117107 35 1 2 105 89 15 0.169 1.17 3.03 Intr + 120839 120930 92 0 2 32 92 53 0.178 -0.19 3.04 Intr + 128189 128323 135 1 0 58 83 206 0.984 17.86 3.05 Intr + 129919 130032 114 0 0 71 110 118 0.998 12.94 3.06 Term + 132108 132251 144 2 0 110 38 61 0.944 1.21 3.07 PlyA + 132408 132413 6 1.05 4.12 PlyA - 133892 133887 6 1.05 4.11 Term - 147215 146736 480 2 0 92 49 758 0.993 67.10 4.10 Intr - 148464 148199 266 1 2 94 101 408 0.999 39.93 4.09 Intr - 152325 152084 242 2 2 115 79 394 0.875 38.39 4.08 Intr - 160911 160833 79 1 1 127 49 27 0.001 1.31 4.07 Intr - 179677 179630 48 2 0 68 94 30 0.032 0.25 4.06 Intr - 183519 183471 49 0 1 47 101 44 0.082 -0.15 4.05 Intr - 194355 194301 55 2 1 103 51 74 0.410 4.18 4.04 Intr - 197221 196934 288 0 0 74 83 111 0.335 5.56 4.03 Intr - 198419 198337 83 2 2 106 37 49 0.191 0.04 4.02 Intr - 198727 198580 148 0 1 110 47 74 0.277 5.74 4.01 Intr - 199629 199524 106 1 1 79 86 22 0.567 0.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:46621585_46822167|GENSCAN_predicted_peptide_1|700_aa SMQAARCPTDELSLTNCAVVNEKDFQSGQHVIVRTSPNHRYTFTLKTHPSVVPGSIAFSL PQRKWAGLSIGQEIEVSLYTFDKAKQCIGTMTIEIDFLQKKSIDSNPYDTDKMAAEFIQQ FNNQAFSVGQQLVFSFNEKLFGLLVKDIEAMDPSILKGEPATGKRQKIEVGLVVGNSQVA FEKAENSSLNLIGKAKTKENRQSIINPDWNFEKMGIGGLDKEFSDIFRRAFASRVFPPEI VEQMGCKHVKGILLYGPPGCGKTLLARQIGKMLNAREPKVVNGPEILNKYVGESEANIRK LFADAEEEQRRLGANSGLHIIIFDEIDAICKQRGSMAGSTGVHDTVVNQLLSKIDGVEQL NNILVIGMTNRPDLIDEALLRPGRLEVKMEIGLPDEKGRLQILHIHTARMRGHQLLSADV DIKELAVETKNFSGAELEGLVRAAQSTAMNRHIKASTKVEVDMEKAESLQVTRGDFLASL ENDIKPAFGTNQEDYASYIMNGIIKWGDPVTRVLDDGELLVQQTKNSDRTPLVSVLLEGP PHSGKTALAAKIAEESNFPFIKICSPDKMIGFSETAKCQAMKKENLNRITGTSDFPRVGK ELEQCQRQANKVTEITLNNFDKVLEHDGKLTELEQRSDQLLDMSSAFSKTTKTLAQKKCW ENIHCQIYLGLVVGGSLLIILIEQLAIFLPQSDTSNAPQT >gi568815581r:46621585_46822167|GENSCAN_predicted_CDS_1|2103_bp agcatgcaagcggcaagatgtcctacagatgaattatctttaaccaattgtgcagttgtg aatgaaaaggatttccagtctggccagcatgtgattgtgaggacctctcccaatcacagg tacacatttacactgaagacacatccatcggtggttccagggagcattgcattcagttta cctcagagaaaatgggctgggctttctattgggcaagaaatagaagtctccttatataca tttgacaaagccaaacagtgtattggcacaatgaccatcgagattgatttcctgcagaaa aaaagcattgactccaacccttatgacaccgacaagatggcagcagaatttattcagcaa ttcaacaaccaggccttctcagtgggacaacagcttgtctttagcttcaatgaaaagctt tttggcttactggtgaaggacattgaagccatggatcctagcatcctgaagggagagcct gcgacagggaaaaggcagaagattgaagtaggactggttgttggaaacagtcaagttgca tttgaaaaagcagaaaattcgtcacttaatcttattggcaaagctaaaaccaaggaaaat cgccaatcaattatcaatcctgactggaactttgaaaaaatgggaataggaggtctagac aaggaattttcagatattttccgacgagcatttgcttcccgagtatttcctccagagatt gtggagcagatgggttgtaaacatgttaaaggcatcctgttatatggacccccaggttgt ggtaagactctcttggctcgacagattggcaagatgttgaatgcaagagagcccaaagtg gtcaatgggccagaaatccttaacaaatatgtgggagaatcagaggctaacattcgcaaa ctttttgctgatgctgaagaggagcaaaggaggcttggtgctaacagtggtttgcacatc atcatctttgatgaaattgatgccatctgcaagcagagagggagcatggctggtagcacg ggagttcatgacactgttgtcaaccagttgctgtccaaaattgatggcgtggagcagcta aacaacatcctagtcattggaatgaccaatagaccagatctgatagatgaggctcttctt agacctggaagactggaagttaaaatggagataggcttgccagatgagaaaggccgacta cagattcttcacatccacacagcaagaatgagagggcatcagttactctctgctgatgta gacattaaagaactggccgtggagaccaagaatttcagtggtgctgaattggagggtctg gtgcgagcagcccagtccactgctatgaatagacacataaaggccagtactaaagtggaa gtggacatggagaaagcagaaagcctgcaagtgacgagaggagacttccttgcttctttg gagaatgatatcaaaccagcctttggcacaaaccaagaagattatgcaagttacattatg aacggtatcatcaaatggggtgacccagttactcgagttctagatgatggggagctgctg gtgcagcagactaagaacagtgaccgcacaccattggtcagcgtgcttctggaaggccct cctcacagtgggaagactgctttagctgcaaaaattgcagaggaatccaacttcccgttc atcaagatctgttctcctgataaaatgattggcttttctgaaacagccaaatgtcaggcc atgaagaaggaaaatctgaatagaataactgggaccagtgatttccctagagtagggaaa gagttggagcagtgccagcggcaagcgaacaaggtgacggaaatcacgcttaacaacttt gacaaggtcctggagcatgatggaaagctgaccgaactggagcagcgttcagaccaactc ctggatatgagctcagccttcagcaagacaacaaagaccctggcccagaagaagtgctgg gagaacatccattgccagatctacttggggctagtggtgggtggtagcctgctcatcatc ctgattgagcagctggccatctttctccctcagagtgacaccagtaatgccccacagacc tag >gi568815581r:46621585_46822167|GENSCAN_predicted_peptide_2|183_aa MARSRTSSSPAISQALLELEMNSDLKAQLRELNITAAKETEVGGGRKAIIIFVPVPQLKS FQKIQVRLVRELEKKFSGKHVVFIAQRRILPKPTRKSRTKNKQKCPRSRTLTAVHDAFLE DLVFPSEIVGKRIPVKLDSSRLIKVHLDKAQQNNVEHKVETFSGVYKKLTGKDVNFEFPE FQL >gi568815581r:46621585_46822167|GENSCAN_predicted_CDS_2|552_bp atggcgagaagccggacgagttcgagtccggccatctcccaggctcttctggagctggag atgaactcggacctcaaggctcagctcagggagctgaatattacggcagccaaggaaact gaagttggtggtggtcggaaagctatcataatctttgttcccgttcctcaactgaaatct ttccagaaaatccaagtccggctagtacgcgaattggagaaaaagttcagtgggaagcat gtcgtctttatcgctcagaggagaattctgcctaagccaactcgaaaaagccgtacaaaa aataagcaaaagtgtcccaggagccgtactctgacagctgtgcacgatgccttccttgag gacttggtcttcccaagcgaaattgtgggcaagagaatccccgtcaaactagatagcagc cggctcataaaggttcatttggacaaagcacagcagaacaatgtggaacacaaggttgaa actttttctggtgtctataagaagctcacgggcaaggatgttaattttgaattcccagag tttcaattgtaa >gi568815581r:46621585_46822167|GENSCAN_predicted_peptide_3|198_aa MAIIKKSGNDKRWCRCGEIGTCMHCCGFAISCYLQNPSSMLALETSVKPNRLKSCPDEGE LARNTVNIGRKLLIIGTTSRKDVLQEMEMLNAFSTTIHVPNIATGEQLLEALELLGNFKD KERTTIAQQVKGKKVWIGIKKLLMLIEMSLQVSDQVNASYACGIESESGALRPDLQRLTF HRPLSLNPFMSNGFRAAE >gi568815581r:46621585_46822167|GENSCAN_predicted_CDS_3|597_bp atggctatcatcaaaaagtcaggtaatgacaagcgctggtgcagatgtggagaaattgga acctgtatgcactgctgtggttttgctatcagctgttacttgcagaatccaagcagtatg ctagccctggaaacatcagtaaagccaaaccgacttaagtcctgtcctgatgaaggagaa ctagcaagaaatacagtaaatataggccgcaagcttcttatcattgggaccactagccgc aaagatgtccttcaggagatggaaatgcttaacgctttcagcaccaccatccacgtgccc aacattgccacaggagagcagctgttggaagctttggagcttttgggcaacttcaaggat aaggaacgcaccacaattgcacagcaagtcaaagggaagaaggtctggataggaatcaag aagttactaatgctgatcgagatgtccctacaggtcagtgatcaagttaatgcttcttat gcatgtgggatagagagtgagagtggggcactcaggcctgatcttcagcgactgacattt cataggcctctgagtttgaaccccttcatgtcaaatggatttcgtgcagctgagtga >gi568815581r:46621585_46822167|GENSCAN_predicted_peptide_4|614_aa XAHGQLLDSTLQPTCFLPDAHDLRLSSPGCLEEAFSPAMGWERSRSHDKPRRLSRPLVPP RPFPRAPCAGSSRVRRGLADQKGQQFPTQRSLLPTGSASFTPDRGCAESWCLRPRALIGC SLTSSNPAAPRWAREGGGCGWRCASDKPESHFQSQVDFVPTIGGVAPPLHGRGQTSSSAP LLMEPHLLGLLLGLLLGGTRVLAGYPIWWCWAQGCVAVNIFADIYLWWVRGLTDFKNEAT YLCAWPLPPCTRSSPNPNMEVSVEGGALLGPGTSWASVQYCPGASRSLALGQQYTSLGSQ PLLCGSIPGLVPKQLRFCRNYIEIMPSVAEGVKLGIQECQHQFRGRRWNCTTIDDSLAIF GPVLDKATRESAFVHAIASAGVAFAVTRSCAEGTSTICGCDSHHKGPPGEGWKWGGCSED ADFGVLVSREFADARENRPDARSAMNKHNNEAGRTTILDHMHLKCKCHGLSGSCEVKTCW WAQPDFRAIGDFLKDKYDSASEMVVEKHRESRGWVETLRAKYSLFKPPTERDLVYYENSP NFCEPNPETGSFGTRDRTCNVTSHGIDGCDLLCCGRGHNTRTEKRKEKCHCIFHWCCYVS CQECIRIYDVHTCK >gi568815581r:46621585_46822167|GENSCAN_predicted_CDS_4|1845_bp ngcgcacatggccagctcctagattccacccttcaacccacttgtttcctgcctgatgca catgacctgcgtctgagttctccaggctgcctggaggaggcattcagtccagcaatgggc tgggaacgcagcaggagccatgacaagcccaggcggctctcccgacccttggtgcccccg aggccatttccccgcgctccctgtgccggcagcagccgcgtgcggagagggctcgccgac cagaaggggcagcagttccctacacagcggtccctgctccccaccggcagtgcttccttc accccagaccggggctgcgcagagtcctggtgcctcaggccgcgggcgctgattggctgc tcgctgacatcctcaaacccggctgctccgcgctgggctcgggaggggggcggctgcggg tggaggtgcgcttctgacaagcccgaaagtcatttccaatctcaagtggactttgttcca actattgggggcgtcgctccccctcttcatggtcgcgggcaaacttcctcctcggcgcct cttctaatggagccccacctgctcgggctgctcctcggcctcctgctcggtggcaccagg gtcctcgctggctacccaatttggtggtgctgggcccagggctgtgtggctgtgaatatc tttgcagacatctacctgtggtgggttcgtggtctcactgacttcaagaatgaagccacg tacctttgcgcctggcccctgccgccctgcacccgctcctctcccaaccccaacatggaa gtttccgtggagggtggagcccttctgggcccaggaacaagttgggcctctgtccagtac tgcccaggagccagcaggtccctggccctgggccagcagtacacatctctgggctcacag cccctgctctgcggctccatcccaggcctggtccccaagcaactgcgcttctgccgcaat tacatcgagatcatgcccagcgtggccgagggcgtgaagctgggcatccaggagtgccag caccagttccggggccgccgctggaactgcaccaccatagatgacagcctggccatcttt gggcccgtcctcgacaaagccacccgcgagtcggccttcgttcacgccatcgcctcggcc ggcgtggccttcgccgtcacccgctcctgcgccgagggcacctccaccatttgcggctgt gactcgcatcataaggggccgcctggcgaaggctggaagtggggcggctgcagcgaggac gctgacttcggcgtgttagtgtccagggagttcgcggatgcgcgcgagaacaggccggac gcgcgctcggccatgaacaagcacaacaacgaggcgggccgcacgactatcctggaccac atgcacctcaaatgcaagtgccacgggctgtcgggcagctgtgaggtgaagacctgctgg tgggcgcagcctgacttccgtgccatcggtgacttcctcaaggacaagtatgacagcgcc tcggagatggtagtagagaagcaccgtgagtcccgaggctgggtggagaccctccgggcc aagtactcgctcttcaagccacccacggagagggacctggtctactacgagaactccccc aacttttgtgagcccaacccagagacgggttcctttggcacaagggaccggacttgcaat gtcacctcccacggcatcgatggctgcgatctgctctgctgtggccggggccacaacacg aggacggagaagcggaaggaaaaatgccactgcatcttccactggtgctgctacgtcagc tgccaggagtgtattcgcatctacgacgtgcacacctgcaagtag