GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:04:52 Sequence gi568815593f:176989603_177197673 : 208071 bp : 47.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 116 111 6 1.05 1.02 Term - 801 743 59 1 2 83 42 73 0.155 0.05 1.01 Init - 3414 3366 49 0 1 86 58 53 0.264 1.22 1.00 Prom - 5091 5052 40 0.74 2.03 PlyA - 9255 9250 6 1.05 2.02 Term - 16353 16147 207 0 0 85 48 110 0.599 4.04 2.01 Init - 17227 16937 291 1 0 52 102 130 0.739 5.95 2.00 Prom - 25280 25241 40 -5.76 3.00 Prom + 30321 30360 40 -4.16 3.01 Init + 33137 33295 159 1 0 95 80 203 0.973 18.98 3.02 Intr + 33558 33632 75 2 0 90 119 50 0.969 8.01 3.03 Term + 34105 34263 159 0 0 119 42 91 0.899 5.54 3.04 PlyA + 41155 41160 6 1.05 4.00 Prom + 41251 41290 40 -5.36 4.01 Init + 50278 50347 70 0 1 74 98 36 0.919 2.41 4.02 Intr + 51524 51627 104 0 2 105 85 80 0.993 9.29 4.03 Intr + 52176 52268 93 2 0 88 90 55 0.980 5.86 4.04 Intr + 54718 54931 214 0 1 51 79 210 0.802 14.69 4.05 Intr + 61101 61334 234 1 0 84 94 184 0.605 16.26 4.06 Intr + 72456 72549 94 1 1 65 100 101 0.706 8.02 4.07 Intr + 75904 76007 104 1 2 91 51 73 0.165 3.72 4.08 Intr + 99948 100091 144 1 0 64 52 141 0.164 8.35 4.09 Intr + 100788 101051 264 1 0 124 83 181 0.921 18.68 4.10 Intr + 101143 101223 81 2 0 46 86 54 0.576 0.61 4.11 Intr + 101336 101502 167 0 2 52 100 268 0.941 24.08 4.12 Intr + 102083 102206 124 1 1 77 80 178 0.963 16.06 4.13 Intr + 102719 102909 191 0 2 77 93 398 0.999 38.50 4.14 Intr + 103044 103182 139 2 1 71 99 219 0.778 21.34 4.15 Intr + 103536 103729 194 1 2 -7 84 228 0.846 12.11 4.16 Intr + 103804 103949 146 0 2 20 85 63 0.972 -1.72 4.17 Intr + 104052 104173 122 0 2 91 113 133 0.875 16.24 4.18 Intr + 105728 105838 111 0 0 96 81 228 0.776 23.25 4.19 Intr + 105931 106121 191 2 2 125 87 249 0.988 27.80 4.20 Intr + 106455 106577 123 2 0 76 64 218 0.841 18.98 4.21 Term + 106685 106759 75 1 0 120 47 81 0.999 5.04 4.22 PlyA + 106788 106793 6 -5.32 5.06 PlyA - 106862 106857 6 -8.24 5.05 Term - 107089 106945 145 0 1 54 48 83 0.696 -1.72 5.04 Intr - 107536 107241 296 1 2 -44 5 903 0.682 64.61 5.03 Intr - 108472 108403 70 0 1 105 48 96 0.604 6.48 5.02 Intr - 127477 127316 162 0 0 60 40 162 0.133 7.69 5.01 Init - 142791 142475 317 0 2 31 -38 345 0.026 11.01 5.00 Prom - 145344 145305 40 -11.63 6.00 Prom + 145417 145456 40 -8.66 6.01 Init + 145502 146428 927 1 0 81 52 555 0.662 45.09 6.02 Intr + 179724 179826 103 2 1 15 106 93 0.184 3.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 89780 89902 123 1 0 97 48 75 0.820 2.78 S.002 Sngl - 142791 142258 534 0 0 31 42 332 0.967 16.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:176989603_177197673|GENSCAN_predicted_peptide_1|35_aa MGFRHVGHAGLQLLISVLKSGKCKIKVPDEGLHCA >gi568815593f:176989603_177197673|GENSCAN_predicted_CDS_1|108_bp atggggtttcgccatgttggccatgctggtctccaactcctgatctcagttctgaagtct ggtaaatgtaagatcaaggtgcctgatgagggcttgcactgtgcttaa >gi568815593f:176989603_177197673|GENSCAN_predicted_peptide_2|165_aa MAPPRRGLVKSPRAAPAPAPRACLWHRTRAPLPLPPRSQRVAARDPRQPGEGLQSGLRQR VSTRRLRTAPPTPPCLLFFLSGSAARYSPGGALPDAKGRRVSRCLGSAAGFSSATAGGLV PCRSVNGARVPGSSGAGFESQQLDSSVLSFNHYIAPRYRNDTRKK >gi568815593f:176989603_177197673|GENSCAN_predicted_CDS_2|498_bp atggctcccccgcggcgggggttggttaagtctccgcgcgctgcgcctgcgcccgccccg agagcgtgtctctggcatcggacgcgcgcgcccctccccctccccccgcgctcccaacgt gtggcggctcgcgacccccggcaacccggagaaggtctacagagcggcctgcgccagcga gtgagtacccgccgcctgcgcacagctccgcccacccctccctgcctccttttcttcctc agcgggtccgcggcccgctactctccgggaggggcgcttcccgacgccaagggccggcgc gtttccaggtgcttaggcagcgccgcaggcttctcgagcgccacagccggcgggctggtg ccctgccgctcagttaacggggcgcgagtcccgggcagtagtggagctggatttgaatcc cagcagctggactccagtgttctttcatttaatcactacatcgcaccgagatatagaaat gacacccggaaaaagtga >gi568815593f:176989603_177197673|GENSCAN_predicted_peptide_3|130_aa MEYPAPATVQAADGGAAGPYSSSELLEGQEPDGVRFDRERARRLWEAVSGAQPRSFSHIR GLSSLYSSCRRPTVFAIPSQLLDANPDVQELYLGFSANLVRYLEGNQISPVICLLAEQTC LRKFNMLFVN >gi568815593f:176989603_177197673|GENSCAN_predicted_CDS_3|393_bp atggagtatcccgcgccggccacggtgcaggccgcggacggcggagcggccgggccttac agcagctcggagttgctggagggccaggagccggacggggtgcgctttgaccgcgagagg gcgcgccgcctgtgggaagccgtgtccggtgcccagccgcgcagtttttcccacattcga ggactgtcatccctatactcgtcctgtcggagacctacagtctttgccatcccgagccag ctgcttgatgctaacccagatgtacaagaactctatctgggcttctctgctaaccttgtc aggtatctggagggaaatcagatttctccagtgatttgtttgcttgctgagcagacttgt cttcgaaagtttaatatgctatttgtgaactga >gi568815593f:176989603_177197673|GENSCAN_predicted_peptide_4|994_aa MIELMPWPGAVAHACNPSTLEGQVEHMIQKNQCLFTNTQCKVCCALLISESQKLAHYQSK KHANKVKRYLAIHGMETLKGETKKLDSDQWGTIEDFLVGQWYDQTCVLSSNQKSSRSKDK NQCCPICNMTFSSPVVAQSHYLGKTHAKNLKLKQQSTKVEALSKRLTNPFLVASTLALHQ NREMIDPDKFCSLCHATFNDPVMAQQHYVGKKHRKQETKLKLMARYGRLADPAVTDFPAG KGYPCKTCKIVLNSIEQYQAHVSGFKHKNHGPLAQFAVVLAFPIIHRVISGSQGKKERAD AGFYRRQLVGSPAWVPESCEKEMRLLLALLGVLLSVPGPPVLSLEASEEVELEPCLAPSL EQQEQELTVALGQPVRLCCGRAERGGHWYKEGSRLAPAGRVRGWRGRLEIASFLPEDAGR YLCLARGSMIVLQNLTLITGDSLTSSNDDEDPKSHRDPSNRHSYPQQAPYWTHPQRMEKK LHAVPAGNTVKFRCPAAGNPTPTIRWLKDGQAFHGENRIGGIRLRHQHWSLVMESVVPSD RGTYTCLVENAVGSIRYNYLLDVLERSPHRPILQAGLPANTTAVVGSDVELLCKVYSDAQ PHIQWLKHIVINGSSFGADGFPYVQVLKTADINSSEVEVLYLRNVSAEDAGEYTCLAGNS IGLSYQSAWLTVLPEEDPTWTAAAPEARYTDIILYASGSLALAVLLLLAGLYRGQALHGR HPRPPATVQKLSRFPLARQFSLESGSSGKSSSSLVRGVRLSSSGPALLAGLVSLDLPLDP LWEFPRDRLVLGKPLGEGCFGQVVRAEAFGMDPARPDQASTVAVKMLKDNASDKDLADLV SEMEVMKLIGRHKNIINLLGVCTQEGPLYVIVECAAKGNLREFLRARRPPGPDLSPDGPR SSEGPLSFPVLVSCAYQVARGMQYLESRKCIHRDLAARNVLVTEDNVMKIADFGLARGVH HIDYYKKTSNGRLPVKWMAPEALFDRVYTHQSDV >gi568815593f:176989603_177197673|GENSCAN_predicted_CDS_4|2985_bp atgatagaactcatgccttggccgggcgcggtggctcacgcctgtaatcccagcactttg gaaggccaagtggagcacatgatccagaagaaccaatgtctcttcaccaacacccagtgt aaggtttgctgcgccttgcttatttctgagtcccagaagctggcacattaccagagcaaa aaacatgccaacaaagtgaagagatacctagcaatccatggaatggagacattaaagggg gaaacgaagaagctagactcagatcagtggggaaccattgaagattttctggtggggcag tggtatgatcagacctgtgttctttcttccaaccagaagagcagcagaagcaaagacaag aaccagtgctgccccatctgtaacatgaccttttcctcccctgtcgtggcccagtcgcac tacctggggaagacccacgcaaagaacttaaagctgaagcagcagtccactaaggtggaa gctctgtcaaaacgccttacaaatcctttccttgtggcctccaccttagccttgcaccag aatagagagatgatagacccagacaagttctgcagcctctgccatgcaactttcaacgac cctgtcatggctcaacaacattatgtgggcaagaaacacagaaaacaggagaccaagctc aaactaatggcacgctatgggcggctggcggaccctgctgtcactgactttccagctgga aagggctacccctgcaaaacatgtaagatagtgctgaactccatagaacagtaccaagct catgtcagcggcttcaaacacaagaaccacggaccacttgcccagtttgctgtggtgcta gccttccccatcatccaccgggtgatttctgggtcccagggaaagaaagagagagctgat gcaggtttctacagaaggcagttggtgggaagtccagcttgggtccctgagagctgtgag aaggagatgcggctgctgctggccctgttgggggtcctgctgagtgtgcctgggcctcca gtcttgtccctggaggcctctgaggaagtggagcttgagccctgcctggctcccagcctg gagcagcaagagcaggagctgacagtagcccttgggcagcctgtgcgtctgtgctgtggg cgggctgagcgtggtggccactggtacaaggagggcagtcgcctggcacctgctggccgt gtacggggctggaggggccgcctagagattgccagcttcctacctgaggatgctggccgc tacctctgcctggcacgaggctccatgatcgtcctgcagaatctcaccttgattacaggt gactccttgacctccagcaacgatgatgaggaccccaagtcccatagggacccctcgaat aggcacagttacccccagcaagcaccctactggacacacccccagcgcatggagaagaaa ctgcatgcagtacctgcggggaacaccgtcaagttccgctgtccagctgcaggcaacccc acgcccaccatccgctggcttaaggatggacaggcctttcatggggagaaccgcattgga ggcattcggctgcgccatcagcactggagtctcgtgatggagagcgtggtgccctcggac cgcggcacatacacctgcctggtagagaacgctgtgggcagcatccgctataactacctg ctagatgtgctggagcggtccccgcaccggcccatcctgcaggccgggctcccggccaac accacagccgtggtgggcagcgacgtggagctgctgtgcaaggtgtacagcgatgcccag ccccacatccagtggctgaagcacatcgtcatcaacggcagcagcttcggagccgacggt ttcccctatgtgcaagtcctaaagactgcagacatcaatagctcagaggtggaggtcctg tacctgcggaacgtgtcagccgaggacgcaggcgagtacacctgcctcgcaggcaattcc atcggcctctcctaccagtctgcctggctcacggtgctgccagaggaggaccccacatgg accgcagcagcgcccgaggccaggtatacggacatcatcctgtacgcgtcgggctccctg gccttggctgtgctcctgctgctggccgggctgtatcgagggcaggcgctccacggccgg cacccccgcccgcccgccactgtgcagaagctctcccgcttccctctggcccgacagttc tccctggagtcaggctcttccggcaagtcaagctcatccctggtacgaggcgtgcgtctc tcctccagcggccccgccttgctcgccggcctcgtgagtctagatctacctctcgaccca ctatgggagttcccccgggacaggctggtgcttgggaagcccctaggcgagggctgcttt ggccaggtagtacgtgcagaggcctttggcatggaccctgcccggcctgaccaagccagc actgtggccgtcaagatgctcaaagacaacgcctctgacaaggacctggccgacctggtc tcggagatggaggtgatgaagctgatcggccgacacaagaacatcatcaacctgcttggt gtctgcacccaggaagggcccctgtacgtgatcgtggagtgcgccgccaagggaaacctg cgggagttcctgcgggcccggcgccccccaggccccgacctcagccccgacggtcctcgg agcagtgaggggccgctctccttcccagtcctggtctcctgcgcctaccaggtggcccga ggcatgcagtatctggagtcccggaagtgtatccaccgggacctggctgcccgcaatgtg ctggtgactgaggacaatgtgatgaagattgctgactttgggctggcccgcggcgtccac cacattgactactataagaaaaccagcaacggccgcctgcctgtgaagtggatggcgccc gaggccttgtttgaccgggtgtacacacaccagagtgacgtgtga >gi568815593f:176989603_177197673|GENSCAN_predicted_peptide_5|329_aa MCVRAGLGAGTGRAAPRTADAAVLGTQADGPLVPPPPRAGRLLRSPVPRGVRASPLRAGH LDPSGAAPGHGTARLAGTSPLAQPSAWGLTACILHRARSTESSTTASGFRSRPHAGRIPG RTPPIGPFGQVAATGQSAAAGAVGRAKGSAAVEPQKRSSLLPASLGSGANAPVKTLGQQQ QGEAEEEEEEEEEEEQEEEEEEKEEEEKEEEEEQEEKEEEEEEEEEEEEEEEEEEKEEEE EQEEKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEELFSENSSSTGMPGYGEPPRV KISHSRIPKDHLEVGPGLSVAQRPPDDRG >gi568815593f:176989603_177197673|GENSCAN_predicted_CDS_5|990_bp atgtgcgtgcgcgccgggctgggggccgggacgggacgggccgcgcctcgcaccgcggac gctgcagttctcggcacccaggcagatggccctctcgtgccaccgccgcccagagcgggc cgcctgctccgcagcccggttccccggggagtgcgcgcctcgcccctgagggccgggcat ctggaccccagcggagctgcgccgggacacgggaccgcccggctggccgggacaagcccg ctggcgcagccctcggcctggggcctcacggcctgcatcctgcaccgggcccgcagcacc gaatctagcacgaccgcgtcgggcttccgctcccggccacacgcgggccgcattcctggg agaacgcctcctattggtccgttcggtcaggtggctgccacgggccaatcagcggcggct ggtgccgtgggccgcgcgaagggctctgcggcggtggagcctcagaaaagatcatctttg cttccagcttctctgggctcaggggccaatgctcccgtcaagacgctggggcagcagcag cagggggaggctgaggaggaggaagaggaggaggaggaagaggagcaggaggaggaggag gaagagaaggaggaggaagagaaggaggaggaagaggagcaggaggagaaggaggaagag gaggaggaggaagaggaggaggaggaagaggaggaggaggaagagaaggaggaggaagag gagcaggaggagaaggaggaagaggaggaggaggaagaggaggaggaggaagaggaggag gaagaggaggaggaggaagaggaggaggaggaagaggaagaggaggaggaggacgaggag ttgttcagcgagaacagctcctccaccgggatgccaggatacggggagcccccgagggtg aagatctcccatagcaggatcccaaaagaccacctggaggtagggccagggctcagtgtg gctcagcgccctcccgacgaccggggctag >gi568815593f:176989603_177197673|GENSCAN_predicted_peptide_6|344_aa MDQTCELPRRNCLLPFSNPVNLDAPEDKDSPFGNGQSNFSEPLNGCTMQLSTVSGTSQNA YGQDSPSCYIPLRRLQDLASMINVEYLNGSADGSESFQDPEKSDSRAQTPIVCTSLSPGG PTALAMKQEPSCNNSPELQVKVTKTIKNGFLHFENFTCVDDADVDSEMDPEQPVTEDESI EEIFEETQTNATCNYETKSENGVKVAMGSEQDSTPESRHGAVKSPFLPLAPQTETQKNKQ RNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDPDSSTSTLGNMLELPGTSS SSTSQELPFLMRHPLIELFHFSNLLQMLNDRRMVDVELFDNFLX >gi568815593f:176989603_177197673|GENSCAN_predicted_CDS_6|1032_bp atggatcagacctgtgaactacccagaagaaattgtctgctgcccttttccaatccagtg aatttagatgcccctgaagacaaggacagccctttcggtaatggtcaatccaatttttct gagccacttaatgggtgtactatgcagttatcgactgtcagtggaacatcccaaaatgct tatggacaagattctccatcttgttacattccactgcggagactacaggatttggcctcc atgatcaatgtagagtatttaaatgggtctgctgatggatcagaatcctttcaagaccct gaaaaaagtgattcaagagctcagacgccaattgtttgcacttccttgagtcctggtggt cctacagcacttgctatgaaacaggaaccctcttgtaataactcccctgaactccaggta aaagtaacaaagactatcaagaatggctttctgcactttgagaattttacttgtgtggac gatgcagatgtagattctgaaatggacccagaacagccagtcacagaggatgagagtata gaggagatctttgaggaaactcagaccaatgccacctgcaattatgagactaaatcagag aatggtgtaaaagtggccatgggaagtgaacaagacagcacaccagagagtagacacggt gcagtcaaatcgccattcttgccattagctcctcagactgaaacacagaaaaataagcaa agaaatgaagtggacggcagcaatgaaaaagcagcccttctcccagcccccttttcacta ggagacacaaacattacaatagaagagcaattaaactcaataaatttatcttttcaggat gatccagattccagtaccagtacattaggaaacatgctagaattacctggaacttcatca tcatctacttcacaggaattgccatttctgatgaggcacccacttattgagctttttcac ttttccaatttgcttcaaatgctgaacgaccgtagaatggtcgacgttgagctcttcgac aacttcttggnn