GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:12:51 Sequence gi568815588r:98329538_98543240 : 213703 bp : 45.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 22521 22833 313 2 1 75 35 139 0.558 4.80 1.02 Intr + 25534 25662 129 2 0 88 89 103 0.855 11.07 1.03 Term + 38658 38755 98 1 2 129 52 85 0.712 7.13 1.04 PlyA + 39522 39527 6 1.05 2.18 PlyA - 41521 41516 6 1.05 2.17 Term - 54630 54485 146 1 2 92 53 51 0.756 0.07 2.16 Intr - 55530 55410 121 0 1 112 105 103 0.988 14.47 2.15 Intr - 57770 57664 107 0 2 65 63 139 0.999 9.03 2.14 Intr - 58971 58817 155 2 2 101 88 145 0.996 15.52 2.13 Intr - 60950 60873 78 1 0 42 110 63 0.838 2.57 2.12 Intr - 61217 61061 157 0 1 112 91 145 0.996 16.27 2.11 Intr - 61545 61473 73 0 1 86 95 67 0.994 6.18 2.10 Intr - 63029 62895 135 2 0 48 94 138 0.998 11.16 2.09 Intr - 63546 63405 142 2 1 65 61 86 0.518 3.96 2.08 Intr - 65756 65659 98 2 2 110 92 34 0.997 4.81 2.07 Intr - 65915 65854 62 0 2 89 113 40 0.959 5.05 2.06 Intr - 67961 67808 154 2 1 4 111 176 0.656 11.15 2.05 Intr - 70720 70565 156 1 0 90 91 157 0.975 16.41 2.04 Intr - 78118 78045 74 2 2 81 97 60 0.997 5.33 2.03 Intr - 78460 78367 94 1 1 64 99 163 0.988 14.54 2.02 Intr - 81421 81402 20 2 2 114 68 38 0.935 0.73 2.01 Init - 85598 85472 127 2 1 90 103 124 0.990 12.47 2.00 Prom - 86702 86663 40 -7.46 3.20 PlyA - 87512 87507 6 1.05 3.19 Term - 88189 88027 163 0 1 129 47 294 0.999 26.81 3.18 Intr - 88720 88638 83 1 2 117 66 114 0.995 10.54 3.17 Intr - 90621 90508 114 0 0 93 78 190 0.807 19.14 3.16 Intr - 92984 92832 153 2 0 8 86 166 0.442 8.67 3.15 Intr - 93222 93150 73 2 1 74 57 39 0.396 -1.19 3.14 Intr - 94350 94216 135 2 0 105 115 101 0.678 14.18 3.13 Intr - 94837 94776 62 1 2 104 64 9 0.561 -2.37 3.12 Intr - 96183 96004 180 0 0 89 94 227 0.879 23.46 3.11 Intr - 96448 96281 168 1 0 108 53 139 0.957 12.54 3.10 Intr - 97727 97678 50 0 2 74 109 43 0.960 3.40 3.09 Intr - 100105 100036 70 1 1 104 94 34 0.962 4.35 3.08 Intr - 100352 100254 99 2 0 71 78 80 0.980 5.61 3.07 Intr - 101133 100965 169 2 1 82 52 181 0.726 13.85 3.06 Intr - 101754 101594 161 0 2 87 71 231 0.954 20.09 3.05 Intr - 104554 104446 109 0 1 95 47 211 0.992 17.99 3.04 Intr - 105877 105735 143 1 2 95 64 267 0.999 24.15 3.03 Intr - 106235 106098 138 2 0 91 105 267 0.999 29.36 3.02 Intr - 113703 113587 117 0 0 123 105 163 0.993 22.16 3.01 Init - 117505 117350 156 1 0 70 104 63 0.768 4.01 3.00 Prom - 118557 118518 40 -7.06 4.07 PlyA - 119415 119410 6 1.05 4.06 Term - 130202 130037 166 1 1 105 43 204 0.992 14.99 4.05 Intr - 138531 138296 236 0 2 33 75 122 0.220 1.89 4.04 Intr - 142914 142785 130 2 1 12 68 92 0.044 0.30 4.03 Intr - 153245 153099 147 1 0 60 70 163 0.137 11.05 4.02 Intr - 160659 160514 146 0 2 141 108 88 0.992 15.18 4.01 Init - 187180 187166 15 1 0 102 94 -1 0.071 2.27 4.00 Prom - 191830 191791 40 -6.46 5.00 Prom + 192125 192164 40 -1.76 5.01 Init + 212834 213545 712 1 1 88 58 443 0.843 36.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 153245 153083 163 1 1 60 48 200 0.856 10.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:98329538_98543240|GENSCAN_predicted_peptide_1|179_aa MSSGESPVLKDVMARATEFCSSKDEKTFCVCPNQEGKQQVWRVLNFDKFQRLSDKNQGEI PGGRGNQGKCLSPGRRGEVNGRTRLPRMFVSELDHGRDLRVNLIGEVKDRVPTCEAVSVT NQRFHFHLHNFQGTDSSSDKYDQRPSYGWHCTEMEPDETAPFERLLYRVGSASCSCAPR >gi568815588r:98329538_98543240|GENSCAN_predicted_CDS_1|540_bp atgagctcaggcgaaagtcctgtcttgaaagatgtgatggctcgagctacggaattctgt tccagcaaggatgagaaaacattctgtgtctgtccaaaccaagaaggaaagcaacaagta tggagggttttaaactttgacaagttccagaggctcagtgataagaaccagggagagatt ccagggggtagaggcaatcaagggaaatgtctgagcccaggaagaagaggggaggtgaat ggcaggacccggttaccaaggatgttcgtgtctgagttagaccatggaagagacctccgt gtgaacctgataggtgaggtcaaggatagagtccccacatgtgaggctgtgtcagtcacc aaccagagattccacttccacctccacaacttccagggcacagactcctcatctgacaaa tatgaccaaagacctagctatggctggcactgcacagagatggaacctgatgaaacggcc ccatttgagcgcctgctgtatcgcgttggctctgcttcctgcagctgtgctccaagatga >gi568815588r:98329538_98543240|GENSCAN_predicted_peptide_2|632_aa MAASGRGLCKAVAASPFPAWRRDNTEARGGLKPEYDAVVIGAGHNGLVAAAYLQRLGVNT AVFERRHVIGGAAVTEEIIPGFKFSRASYLLSLLRPQIYTDLELKKHGLRLHLRNPYSFT PMLEEGAGSKVPRCLLLGTDMAENQKQIAQFSQKDAQVFPKYEEFMHRLALAIDPLLDAA PVDMAAFQHGSLLQRMRSLSTLKPLLKAGRILGAQLPRYYEVLTAPITKVLDQWFESEPL KATLATDAVIGAMTSPHTPGSGYVLLHHVMGGLEGMQGAWGYVQGGMGALSDAIASSATT HGASIFTEKTVAKVQVNSEGCVQGVVLEDGTEVRSKMVLSNTSPQITFLKLTPQEWLPEE FLERISQLDTRSPVTKINVAVDRLPSFLAAPNAPRGQPLPHHQCSIHLNCEDTLLLHQAF EDAMDGLPSHSDLGSAKIHTPKDRKCIPGEKSEDEGRPVIELCIPSSLDPTLAPPGCHVV SLFTQYMPYTLAGGKAWDEQERDAYADRVFDCIEVYAPGFKDSVVGRDILTPPDLERIFG LPGGNIFHCAMSLDQLYFARPVPLHSGYRCPLQGLYLCGSGAHPAPYSEILTSLTKRVPI QHVHPQPLRLWAVDLAPPQTFGAGSLEGARIY >gi568815588r:98329538_98543240|GENSCAN_predicted_CDS_2|1899_bp atggctgcaagtggccgaggtctctgcaaggctgtggccgcctctcccttcccggcgtgg agacgagataacacggaagccaggggaggtctgaagcctgagtatgatgcggtggtgata ggagcaggacacaacggactggtggctgcagcgtacctgcagagactgggggtgaacacc gccgtcttcgagaggcgccatgtgatcgggggtgcagctgtcactgaggagatcatccca gggtttaagttctcccgcgcgtcctacctgctcagcctgctgaggccgcagatttacact gatctggagctgaagaaacatgggctgaggcttcatcttcgaaacccctactccttcacc cccatgctggaagagggtgcaggcagcaaggtgcccaggtgccttctgctgggcacagac atggcagaaaaccagaagcagatcgcccagttctcccagaaggatgcccaggtctttccc aaatatgaggagttcatgcatcgcttggcattagccattgaccctctgctggatgcggcc cccgtggacatggcggccttccagcatggctccttgctgcaaaggatgaggtcgctctcc accctcaagcccctgctgaaggcaggccgcatcctgggagcccagcttccccgatattat gaggtcctcacagctcccattaccaaggtgctggatcagtggttcgagtctgagccttta aaagccactctagccacagatgcagtgattggagccatgacaagtccccacactccgggg agtgggtatgtgctgctgcaccatgtgatggggggcctggagggaatgcagggggcctgg ggctacgtccaggggggcatgggtgccctctctgatgcgatcgcaagctcagccaccaca catggagcaagcatcttcactgaaaagacagtggcgaaggtgcaggtgaacagtgaaggc tgtgttcaaggagttgtgctggaagatggcacagaggtgagaagcaaaatggtgctgtcc aacacatcaccgcagatcaccttcctgaagctgacgccacaggagtggcttcctgaggag ttcctggagagaatctctcagctggacacccggtcgcctgtcaccaagatcaatgtggcc gtagacaggctgcccagcttcctggcggcccccaatgctcccaggggccagccgctgccc catcaccaatgctccatccacctgaactgtgaagacaccctcctccttcatcaggccttt gaagatgccatggatggcctgccttcccacagtgacttaggatctgccaaaatccatact cctaaggacaggaagtgcatcccaggtgaaaaatctgaggatgagggcaggcctgtgatt gagctctgcatcccttcctcgctggaccccaccctggctccccctggctgccatgtagtc tccctcttcactcagtacatgccctatacgctggctggaggcaaggcctgggacgagcag gagagagacgcttatgcagacagagtgtttgattgcatcgaggtctatgcccctggcttc aaggactctgtggttggcagagacatcctcacaccaccagatttggagagaatcttcggg cttcctggagggaacatattccactgcgccatgtccctggaccagctctacttcgcccgc cccgtgcccctgcattctggctaccgctgccctctccagggcctgtatctctgtggaagt ggggctcatcctgccccttattctgaaattttaacttcacttactaagcgtgtgcccatc cagcacgtccaccctcagccgttgagactgtgggctgtggacctggcccctccccagacc tttggtgcaggaagcctggagggagccagaatctattga >gi568815588r:98329538_98543240|GENSCAN_predicted_peptide_3|780_aa MGKAWQGRGSNPRKATAAPAPPGAGGHVRDHAVLPGSAPGLLQAGRCARRDPMKCVLVAT EGAEVLFYWTDQEFEESLRLKFGQSENEEEELPALEDQLSTLLAPVIISSMTMLEKLSDT YTCFSTENGNFLYVLHLFGECLFIAINGDHTESEGDLRRKLYVLKYLFEVHFGLVTVDGH LIRKELRPPDLAQRVQLWEHFQSLLWTYSRLREQEQCFAVEALERLIHPQLCELCIEALE RHVIQAVNTSPERGGEEALHAFLLVHSKLLAFYSSHSASSLRPADLLALILLVQDLYPSE STAEDDIQVSGQLSGQREAIREVVQKTCLKMPSPRRARSSQNIPVQQAWSPHSTGPTGGS SAETETDSFSLPEEYFTPAPSPGDQSSGSTIWLEGGTPPMDALQIAEDTLQTLVPHCPVP SGPRRIFLDANVKESYCPLVPHTMYCLPLWQGINLVLLTRSPSAPLALVLSQLMDGFSML EKKLKEGPEPGASLRSQPLVGDLRQRMDKFVKNRGAQEIQSTWLEFKAKAFSKSEPGSSW ELLQACGKLKRQLCAIYRLNFLTTAPSRGGPHLPQHLQDQVQRLMRARTARVKCASVEDL TAFQGLGSLETFTYLEDFPGLVHFIYVDRTTGQMVAPSLNCSQKTSSELGKGPLAAFVKT KVWSLIQLARRYLQKGYTTLLFQEGDFYCSYFLWFENDMGYKLQMIEVPVLSDDSVPIGM LGGDYYRKLLRYYSKNRPTEAVRCYELLALHLSVIPTDLLVQQAGQLARRLWEASRIPLL >gi568815588r:98329538_98543240|GENSCAN_predicted_CDS_3|2343_bp atgggcaaggcctggcaggggcggggctcgaacccgcggaaggcgacggcggctccggcc cctcccggggcgggcggtcacgtgcgcgatcacgcggtcctacccggaagcgcgcccggg ctcctgcaggcggggcgctgtgcgcgccgcgatccgatgaagtgcgtcttggtggccact gagggcgcagaggtcctcttctactggacagatcaggagtttgaagagagtctccggctg aagttcgggcagtcagagaatgaggaagaagagctccctgccctggaggaccagctcagc accctcctagccccggtcatcatctcctccatgacgatgctggagaagctctcggacacc tacacctgcttctccacggaaaatggcaacttcctgtatgtccttcacctgtttggagaa tgcctgttcattgccatcaatggtgaccacaccgagagcgagggggacctgcggcggaag ctgtatgtgctcaagtacctgtttgaagtgcactttgggctggtgactgtggacggtcat cttatccgaaaggagctgcggcccccagacctggcgcagcgtgtccagctgtgggagcac ttccagagcctgctgtggacctacagccgcctgcgggagcaggagcagtgcttcgccgtg gaggccctggagcgactgattcacccccagctctgtgagctgtgcatagaggcgctggag cggcacgtcatccaggctgtcaacaccagccccgagcggggaggcgaggaggccctgcat gccttcctgctcgtgcactccaagctgctggcattctactctagccacagtgccagctcc ctgcgcccggccgacctgcttgccctcatcctcctggttcaggacctctaccccagcgag agcacagcagaggacgacattcaggtctcagggcagctttctgggcagagggaggccatt agggaggtagttcagaaaacctgcctgaagatgccttccccgcggagggcccggagcagc cagaacatccccgtgcagcaggcctggagccctcactccacgggcccaactggggggagc tctgcagagacggagacagacagcttctccctccctgaggagtacttcacaccagctcct tcccctggcgatcagagctcaggtagcaccatctggctggaggggggcaccccccccatg gatgcccttcagatagcagaggacaccctccaaacactggttccccactgccctgtgcct tccggccccagaaggatcttcctggatgccaacgtgaaggaaagctactgccccctagtg ccccacaccatgtactgcctgcccctgtggcagggcatcaacctggtgctcctgaccagg agccccagcgcgcccctggccctggttctgtcccagctgatggatggcttctccatgctg gagaagaagctgaaggaagggccggagcccggggcctccctgcgctcccagcccctcgtg ggagacctgcgccagaggatggacaagtttgtcaagaatcgaggggcacaggagattcag agcacctggctggagtttaaggccaaggctttctccaaaagtgagcccggatcctcctgg gagctgctccaggcatgtgggaagctgaagcggcagctctgcgccatctaccggctgaac tttctgaccacagcccccagcaggggaggcccacacctgccccagcacctgcaggaccaa gtgcagaggctcatgcgggccagaactgcccgtgtcaaatgtgcaagtgttgaggatctt acagcttttcaaggcctggggagcttggagaccttcacctacctagaagacttcccaggc ttggtgcacttcatctatgtggaccgcaccactgggcagatggtggcgccttccctcaac tgcagtcaaaagacctcgtcggagttgggcaaggggccgctggctgcctttgtcaaaact aaggtctggtctctgatccagctggcgcgcagatacctgcagaagggctacaccacgctg ctgttccaggagggggatttctactgctcctacttcctgtggttcgagaatgacatgggg tacaaactccagatgatcgaggtgcccgtcctctccgacgactcagtgcctatcggcatg ctgggaggagactactacaggaagctcctgcgctactacagcaagaaccgcccaaccgag gctgtcaggtgctacgagctgctggccctgcacctgtctgtcatccccactgacctgctg gtgcagcaggccggccagctggcccggcgcctctgggaggcctcccgtatccccctgctc tag >gi568815588r:98329538_98543240|GENSCAN_predicted_peptide_4|279_aa MGAIQDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDKLRIYAHCTNHHNHNYVRG SITLFIINLHRSRKKIKLAGTLRDKLVHQYLLQPYGQEGLKSKSTTSAAALLKGHSEPLS RKDKLQRLRAPNGKKVPGIREIYELARGGREPAKHKQVRGRPDWRPGAAGDPEPGRAQRR VVAGAADPAPALGPPVLCTLAGKPAPLQELQSRFARFLNIRPHSRSVQLNGQPLVMVDDG TLPELKPRPLRAGRTLVIPPVTMGFYVVKNVNALACRYR >gi568815588r:98329538_98543240|GENSCAN_predicted_CDS_4|840_bp atgggggcaattcaggactactggctctctctcctctacaagcgcctgatcggccccaaa gtcttggctgtgcatgtggctgggctccagcggaagccacggcctggccgagtgatccgg gacaaactaaggatttatgctcactgcacaaaccaccacaaccacaactacgttcgtggg tccattacactttttatcatcaacttgcatcgatcaagaaagaaaatcaagctggctggg actctcagagacaagctggttcaccagtacctgctgcagccctatgggcaggagggccta aagtccaaatcaaccacatcagctgcagcccttttgaaaggccattcagagcccctcagc aggaaagacaagctgcagaggctgagagcaccgaatggcaagaaagtgcctgggatcagg gagatttacgagctggctcgagggggccgggagcctgcaaagcacaaacaggtgcggggc cgacccgactggcggccgggggcagccggggacccggagcccgggagagcccagcgccga gtcgtggcgggcgccgcagaccccgcacctgccctgggaccccccgttctctgcacgctg gccgggaagccggcacccctgcaggaattgcagtcccgtttcgcacgtttccttaatatt aggccacattccaggtcagtgcaactgaatggccagcccttagtgatggtggacgacggg accctcccagaattgaagccccgcccccttcgggccggccggacattggtcatccctcca gtcaccatgggcttttatgtggtcaagaatgtcaatgctttggcctgccgctaccgataa >gi568815588r:98329538_98543240|GENSCAN_predicted_peptide_5|238_aa MGKKQSRKTGNSKKQSASPPPKERSSSPAMEQSWMENDFDELREEGFRQSNYSELWEDIQ TKGKEVENFEKNLEEYVTRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQETWDYVKRPNLCLIGVPESDGENGTKLENTLQD IIQENFPNLARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKX >gi568815588r:98329538_98543240|GENSCAN_predicted_CDS_5|714_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacaatcaaattactccgagctatgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatatgtaactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagcg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaacgagcaaagcctccaagaaacatgggactatgtgaaaagaccaaatctatgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacatcattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagnn